Software Engineer Model Serving Systems ML Plat... @ Netflix
Your Application Journey
Email Hiring Manager
Job Details
About Netflix
Netflix is one of the world's leading entertainment services with 283 million paid memberships in over 190 countries. Members enjoy a diverse range of TV series, films, and games regardless of location.
Role Overview
The Software Engineer Model Serving Systems ML Platform role is crucial in developing the computational platform powering almost all consumer and studio-facing ML/AI applications at Netflix. The position focuses on building scalable model serving infrastructure for real-time ML model inference including innovations in LLMs and other large models.
What You Will Do
- Develop and expand compute infrastructure for ML/AI initiatives
- Build systems for online ML model inference and serving
- Optimize latency, performance, and cost for high-traffic distributed services
- Collaborate cross-functionally with engineers, product managers, and data scientists
- Deploy and maintain ML models using tools such as Triton Inference Server and Docker
- Ensure best practices in production hosting, capacity planning, and performance tuning
What We Are Looking For
Netflix seeks engineers with hands-on experience in building scalable distributed systems for ML model serving. Candidates should be proficient in object-oriented programming (preferably Java), have experience with public cloud services (AWS, Azure, or GCP), and exhibit strong communication skills. A BS/MS in Computer Science, Applied Math, Engineering, or a related field is required.
Benefits & Culture
Netflix offers comprehensive benefits including Health Plans, Mental Health support, 401(k) with employer match, Stock Option Programs, Disability Programs, and generous paid time off. Netflix values diversity and inclusion and strives to provide an accessible interview process for all candidates.
Key skills/competency
- ML infrastructure
- Distributed systems
- Real-time inference
- LLM
- Java
- Triton Inference Server
- Docker
- Cloud (AWS/Azure/GCP)
- Capacity planning
- Observability
How to Get Hired at Netflix
🎯 Tips for Getting Hired
- Customize your resume: Tailor experiences to ML infrastructure roles.
- Highlight technical skills: Emphasize Java, cloud, and Docker.
- Prepare for system design: Showcase distributed systems experience.
- Practice behavioral interviews: Share teamwork and problem-solving stories.