Software Engineer Model Serving Systems ML Plat...
@ Netflix

Hybrid
$720,000
Hybrid
Full Time
Posted 10 hours ago

Your Application Journey

Personalized Resume
Apply
Email Hiring Manager
Interview

Email Hiring Manager

XXXXXXXXX XXXXXXXXXXXXX XXXXXXXXX****** @netflix.com
Recommended after applying

Job Details

About Netflix

Netflix is one of the world's leading entertainment services with 283 million paid memberships in over 190 countries. Members enjoy a diverse range of TV series, films, and games regardless of location.

Role Overview

The Software Engineer Model Serving Systems ML Platform role is crucial in developing the computational platform powering almost all consumer and studio-facing ML/AI applications at Netflix. The position focuses on building scalable model serving infrastructure for real-time ML model inference including innovations in LLMs and other large models.

What You Will Do

  • Develop and expand compute infrastructure for ML/AI initiatives
  • Build systems for online ML model inference and serving
  • Optimize latency, performance, and cost for high-traffic distributed services
  • Collaborate cross-functionally with engineers, product managers, and data scientists
  • Deploy and maintain ML models using tools such as Triton Inference Server and Docker
  • Ensure best practices in production hosting, capacity planning, and performance tuning

What We Are Looking For

Netflix seeks engineers with hands-on experience in building scalable distributed systems for ML model serving. Candidates should be proficient in object-oriented programming (preferably Java), have experience with public cloud services (AWS, Azure, or GCP), and exhibit strong communication skills. A BS/MS in Computer Science, Applied Math, Engineering, or a related field is required.

Benefits & Culture

Netflix offers comprehensive benefits including Health Plans, Mental Health support, 401(k) with employer match, Stock Option Programs, Disability Programs, and generous paid time off. Netflix values diversity and inclusion and strives to provide an accessible interview process for all candidates.

Key skills/competency

  • ML infrastructure
  • Distributed systems
  • Real-time inference
  • LLM
  • Java
  • Triton Inference Server
  • Docker
  • Cloud (AWS/Azure/GCP)
  • Capacity planning
  • Observability

How to Get Hired at Netflix

🎯 Tips for Getting Hired

  • Customize your resume: Tailor experiences to ML infrastructure roles.
  • Highlight technical skills: Emphasize Java, cloud, and Docker.
  • Prepare for system design: Showcase distributed systems experience.
  • Practice behavioral interviews: Share teamwork and problem-solving stories.

📝 Interview Preparation Advice

Technical Preparation

Review Java and OOP principles.
Practice cloud deployment scenarios.
Study distributed systems design.
Refresh containerization with Docker.

Behavioral Questions

Describe a challenging team project.
Explain your problem-solving process.
Detail a time you learned new skills.
Discuss effective cross-team communication.

Frequently Asked Questions