Senior AI Performance and Efficiency Engineer
NVIDIA
Job Overview
Who's the hiring manager?
Sign up to PitchMeAI to discover the hiring manager's details for this job. We will also write them an intro email for you.

Job Description
Senior AI Performance and Efficiency Engineer, GPU Clusters
NVIDIA is seeking a highly skilled Senior AI/ML Performance and Efficiency Engineer to join our AI Efficiency team. In this role, you will significantly enhance the efficiency of our researchers by implementing improvements across the entire technology stack. You will collaborate closely with customers to identify and resolve infrastructure and application deficiencies, enabling groundbreaking AI and ML research on GPU Clusters. Together, we will build powerful, efficient, and scalable solutions that shape the future of AI/ML technology!
What You Will Be Doing
- Collaborate with AI/ML researchers to optimize ML models for improved productivity and cost savings.
- Develop tools, frameworks, and apply ML techniques to detect and analyze efficiency bottlenecks.
- Engage with researchers on diverse ML workloads including Robotics, Autonomous Vehicles, LLMs, and Video.
- Partner with engineering teams to optimize hardware, software, and infrastructure usage.
- Monitor fleet-wide utilization, analyze inefficiency patterns, and implement scalable solutions.
- Stay current with AI/ML technologies and advocate for their adoption within the organization.
What We Need To See
- BS or equivalent in Computer Science or a related field, or equivalent experience.
- Minimum 5 years of experience designing and operating large-scale compute infrastructure.
- Strong understanding of modern ML techniques and tools.
- Experience in investigating and resolving training and inference performance issues.
- Debugging and optimization experience with NSight Systems and NSight Compute.
- Experience debugging large-scale distributed training using NCCL.
- Proficiency in Python, Go, Bash, and familiarity with cloud platforms (AWS, GCP, Azure).
- Experience with parallel computing frameworks and paradigms.
- Commitment to continuous learning in AI/ML infrastructure.
- Excellent communication and collaboration skills.
Ways To Stand Out From The Crowd
- Background with NVIDIA GPUs, CUDA Programming, NCCL, and MLPerf benchmarking.
- Experience with Machine Learning and Deep Learning concepts, algorithms, and models.
- Familiarity with InfiniBand with IBOP and RDMA.
- Understanding of fast, distributed storage systems (Lustre, GPFS) for AI/HPC.
- Familiarity with deep learning frameworks like PyTorch and TensorFlow.
About NVIDIA
NVIDIA offers competitive salaries and a comprehensive benefits package. Our engineering teams are growing rapidly due to outstanding expansion. If you're a passionate and independent engineer with a love for technology, we want to hear from you.
Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 152,000 USD - 241,500 USD for Level 3, and 184,000 USD - 287,500 USD for Level 4. You will also be eligible for equity and benefits.
Applications for this job will be accepted at least until March 23, 2026. This posting is for an existing vacancy.
NVIDIA uses AI tools in its recruiting processes.
NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. We do not discriminate on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.
Key skills/competency
- Senior AI Performance Engineer
- Machine Learning Efficiency
- GPU Cluster Optimization
- Distributed Systems
- Performance Debugging
- NVIDIA GPUs
- CUDA Programming
- Python
- ML Frameworks
- Infrastructure Optimization
How to Get Hired at NVIDIA
- Tailor your resume: Highlight experience with AI/ML performance, GPU clusters, and NVIDIA technologies.
- Showcase relevant skills: Emphasize Python, distributed training, and performance debugging tools like NSight.
- Quantify achievements: Use data to demonstrate productivity improvements and cost savings from your optimizations.
- Prepare for technical interviews: Be ready to discuss ML concepts, system design, and debugging scenarios.
- Research NVIDIA's impact: Understand their role in AI advancement and how your skills contribute.
Frequently Asked Questions
Find answers to common questions about this job opportunity
Explore similar opportunities that match your background