2 months ago

Generative AI Inference Engineer

Stability AI

Hybrid

Full Time

$150,000

Hybrid

Apply

Job Overview

Job TitleGenerative AI Inference Engineer

Job TypeFull Time

Offered Salary$150,000

LocationHybrid

Who's the hiring manager?

Sign up to PitchMeAI to discover the hiring manager's details for this job. We will also write them an intro email for you.

Uncover Hiring Manager

Job Description

About the role

Stability AI is seeking passionate Machine Learning Engineers to join its Inference team focused on creative applications of generative AI models. As a Generative AI Inference Engineer, you will leverage cutting-edge technology to push the boundaries of multi-modal inference optimization.

Responsibilities

Drive the design and development of multi-modal ML inference systems.
Collaborate with Platform and Inference teams on model optimization and deployment.
Partner with cloud providers to deliver hosted Stability AI inference solutions.
Act as a strategic thought partner driving business impact through ML.
Prototype and productionize enhancements and new inference features.

Qualifications

7+ years of experience productionizing machine learning systems.
Expert in Python services at scale and scientific stack including pyTorch.
Proficiency in at least one high-performance inference framework like Triton or TensorRT.
Deep understanding of diffusion architectures and profiling tools for Nvidia GPUs.
Experience with Kubernetes, major cloud providers (AWS, GCP, Azure), Docker and open-source ML ecosystems.
Strong communication, collaboration, and documentation skills.

Key skills/competency

Machine Learning
Inference Systems
Multi-modal Models
Python
pyTorch
Docker
Cloud Deployment
Diffusion Architecture
Optimization
High-performance Computing

Equal Employment Opportunity

Stability AI is an equal opportunity employer and does not discriminate on the basis of race, religion, national origin, gender, sexual orientation, age, veteran status, disability or other legally protected statuses.

Tags:

Generative AI Inference Engineer

Multi-modal

Inference

Optimization

Python

pyTorch

Cloud

Diffusion

Kubernetes

Docker

Nvidia

Profiling

High-performance

Deployment

Research

Collaboration

Prototyping

Production

How to Get Hired at Stability AI

Customize your resume: Highlight ML deployment and optimization skills.
Research Stability AI: Understand their culture, projects, and innovations.
Prepare projects: Showcase work with multi-modal inference systems.
Practice technical questions: Focus on Python, pyTorch and diffusion architectures.

Frequently Asked Questions

Find answers to common questions about this job opportunity

01What is expected from a Generative AI Inference Engineer at Stability AI?

02How important is experience with diffusion architectures for Stability AI?

03What programming skills are required for the Generative AI Inference Engineer position at Stability AI?

04Can candidates with cloud deployment experience apply for Stability AI's role?

Explore similar opportunities that match your background

This job post expired on March 13, 2026

Generative AI Inference Engineer

Stability AI

Job Overview

Who's the hiring manager?

Job Description

About the role

Responsibilities

Qualifications

Key skills/competency

Equal Employment Opportunity

Tags:

Share Job:

How to Get Hired at Stability AI

Frequently Asked Questions