13 days ago

Generative AI Inference Engineer

Stability AI

Hybrid
Full Time
$150,000
Hybrid

Job Overview

Job TitleGenerative AI Inference Engineer
Job TypeFull Time
CategoryCommerce
Experience5 Years
DegreeMaster
Offered Salary$150,000
LocationHybrid

Who's the hiring manager?

Sign up to PitchMeAI to discover the hiring manager's details for this job. We will also write them an intro email for you.

Uncover Hiring Manager

Job Description

About the role

Stability AI is seeking passionate Machine Learning Engineers to join its Inference team focused on creative applications of generative AI models. As a Generative AI Inference Engineer, you will leverage cutting-edge technology to push the boundaries of multi-modal inference optimization.

Responsibilities

  • Drive the design and development of multi-modal ML inference systems.
  • Collaborate with Platform and Inference teams on model optimization and deployment.
  • Partner with cloud providers to deliver hosted Stability AI inference solutions.
  • Act as a strategic thought partner driving business impact through ML.
  • Prototype and productionize enhancements and new inference features.

Qualifications

  • 7+ years of experience productionizing machine learning systems.
  • Expert in Python services at scale and scientific stack including pyTorch.
  • Proficiency in at least one high-performance inference framework like Triton or TensorRT.
  • Deep understanding of diffusion architectures and profiling tools for Nvidia GPUs.
  • Experience with Kubernetes, major cloud providers (AWS, GCP, Azure), Docker and open-source ML ecosystems.
  • Strong communication, collaboration, and documentation skills.

Key skills/competency

  • Machine Learning
  • Inference Systems
  • Multi-modal Models
  • Python
  • pyTorch
  • Docker
  • Cloud Deployment
  • Diffusion Architecture
  • Optimization
  • High-performance Computing

Equal Employment Opportunity

Stability AI is an equal opportunity employer and does not discriminate on the basis of race, religion, national origin, gender, sexual orientation, age, veteran status, disability or other legally protected statuses.

Tags:

Generative AI Inference Engineer
Multi-modal
Inference
Optimization
Python
pyTorch
Cloud
Diffusion
Kubernetes
Docker
Nvidia
Profiling
High-performance
Deployment
Research
Collaboration
Prototyping
Production

Share Job:

How to Get Hired at Stability AI

  • Customize your resume: Highlight ML deployment and optimization skills.
  • Research Stability AI: Understand their culture, projects, and innovations.
  • Prepare projects: Showcase work with multi-modal inference systems.
  • Practice technical questions: Focus on Python, pyTorch and diffusion architectures.

Frequently Asked Questions

Find answers to common questions about this job opportunity

Explore similar opportunities that match your background