Systems Software Engineer AI Infrastructure
@ NVIDIA

Hybrid
$250,000
Hybrid
Full Time
Posted 22 days ago

Your Application Journey

Personalized Resume
Apply
Email Hiring Manager
Interview

Email Hiring Manager

XXXXXXXXX XXXXXXXXXXX XXXXXXX****** @nvidia.com
Recommended after applying

Job Details

About NVIDIA

NVIDIA is one of the technology world’s most desirable employers. With a legacy of transforming computer graphics, PC gaming, and accelerated computing, NVIDIA is now defining the next era of computing through AI Infrastructure.

What You Will Be Doing

As a Systems Software Engineer AI Infrastructure, you will:

  • Develop and maintain large-scale systems for critical AI use cases.
  • Implement SRE fundamentals including incident management and performance optimization.
  • Build tools for improved observability and actionable reliability metrics.
  • Establish frameworks for operational maturity with blameless postmortems.
  • Collaborate with engineering teams, mentor peers, and contribute to hiring.

What We Need To See

Candidates should have:

  • A degree in Computer Science or related field with 8+ years of experience.
  • Proficiency in Python and at least one language such as C/C++, Go, Perl, or Ruby.
  • Strong expertise in systems engineering, Linux/Windows environments, and cloud platforms.
  • A deep understanding of SRE principles including error budgets, SLOs, and SLAs.
  • Experience with Infrastructure as Code, observability platforms, and CI/CD systems.
  • Excellent communication skills and commitment to diversity and continuous improvement.

Ways To Stand Out

Additional advantages include:

  • Experience in AI training, inferencing, and data infrastructure services.
  • Proficiency in deep learning frameworks like PyTorch, TensorFlow, JAX, and Ray.
  • A background in hardware health monitoring and system reliability.
  • Expertise in managing incidents and scaling distributed systems with stringent SLAs.

Compensation & Benefits

Your base salary will be determined based on location, experience, and similar roles, ranging from approximately 184,000 to 356,500 USD. Equity and benefits are also part of the package.

Key skills/competency

  • AI Infrastructure
  • SRE
  • Systems Engineering
  • Python
  • Cloud Platforms
  • Observability
  • CI/CD
  • Incident Management
  • Automation
  • Deep Learning

How to Get Hired at NVIDIA

🎯 Tips for Getting Hired

  • Research NVIDIA's culture: Review mission, values, and recent innovations.
  • Customize your resume: Tailor it to highlight relevant SRE and systems skills.
  • Demonstrate hands-on expertise: Show projects in AI infrastructure and cloud environments.
  • Prepare for technical interviews: Practice coding and systems design challenges.
  • Emphasize communication: Be clear on your collaboration and mentoring experience.

📝 Interview Preparation Advice

Technical Preparation

Review cloud platforms and Linux systems.
Practice SRE principles and incident management.
Work on coding in Python and C++.
Study Infrastructure as Code and observability tools.

Behavioral Questions

Describe a past incident resolution experience.
Explain how you mentor team members.
Discuss adapting to technical challenges.
Detail collaboration in cross-functional projects.

Frequently Asked Questions