Systems Software Engineer AI Infrastructure @ NVIDIA
placeHybrid
attach_money $250,000
businessHybrid
scheduleFull Time
Posted 22 days ago
Your Application Journey
Interview
Email Hiring Manager
******  @nvidia.com
Recommended after applying
Job Details
About NVIDIA
NVIDIA is one of the technology world’s most desirable employers. With a legacy of transforming computer graphics, PC gaming, and accelerated computing, NVIDIA is now defining the next era of computing through AI Infrastructure.
What You Will Be Doing
As a Systems Software Engineer AI Infrastructure, you will:
- Develop and maintain large-scale systems for critical AI use cases.
- Implement SRE fundamentals including incident management and performance optimization.
- Build tools for improved observability and actionable reliability metrics.
- Establish frameworks for operational maturity with blameless postmortems.
- Collaborate with engineering teams, mentor peers, and contribute to hiring.
What We Need To See
Candidates should have:
- A degree in Computer Science or related field with 8+ years of experience.
- Proficiency in Python and at least one language such as C/C++, Go, Perl, or Ruby.
- Strong expertise in systems engineering, Linux/Windows environments, and cloud platforms.
- A deep understanding of SRE principles including error budgets, SLOs, and SLAs.
- Experience with Infrastructure as Code, observability platforms, and CI/CD systems.
- Excellent communication skills and commitment to diversity and continuous improvement.
Ways To Stand Out
Additional advantages include:
- Experience in AI training, inferencing, and data infrastructure services.
- Proficiency in deep learning frameworks like PyTorch, TensorFlow, JAX, and Ray.
- A background in hardware health monitoring and system reliability.
- Expertise in managing incidents and scaling distributed systems with stringent SLAs.
Compensation & Benefits
Your base salary will be determined based on location, experience, and similar roles, ranging from approximately 184,000 to 356,500 USD. Equity and benefits are also part of the package.
Key skills/competency
- AI Infrastructure
- SRE
- Systems Engineering
- Python
- Cloud Platforms
- Observability
- CI/CD
- Incident Management
- Automation
- Deep Learning
How to Get Hired at NVIDIA
🎯 Tips for Getting Hired
- Research NVIDIA's culture: Review mission, values, and recent innovations.
- Customize your resume: Tailor it to highlight relevant SRE and systems skills.
- Demonstrate hands-on expertise: Show projects in AI infrastructure and cloud environments.
- Prepare for technical interviews: Practice coding and systems design challenges.
- Emphasize communication: Be clear on your collaboration and mentoring experience.
📝 Interview Preparation Advice
Technical Preparation
circle
Review cloud platforms and Linux systems.
circle
Practice SRE principles and incident management.
circle
Work on coding in Python and C++.
circle
Study Infrastructure as Code and observability tools.
Behavioral Questions
circle
Describe a past incident resolution experience.
circle
Explain how you mentor team members.
circle
Discuss adapting to technical challenges.
circle
Detail collaboration in cross-functional projects.
Frequently Asked Questions
What qualifications does NVIDIA seek for Systems Software Engineer roles?
keyboard_arrow_down
What technical skills are required for this Systems Software Engineer AI Infrastructure?
keyboard_arrow_down
How important is SRE experience in this position at NVIDIA?
keyboard_arrow_down
Does NVIDIA value diversity in hiring for this role?
keyboard_arrow_down
What sets the AI Infrastructure team apart at NVIDIA?
keyboard_arrow_down