15 days ago

Senior AI Performance and Efficiency Engineer

NVIDIA

On Site
Full Time
$241,500
New York, NY
Apply

Job Overview

Job TitleSenior AI Performance and Efficiency Engineer
Job TypeFull Time
Offered Salary$241,500
LocationNew York, NY

Who's the hiring manager?

Sign up to PitchMeAI to discover the hiring manager's details for this job. We will also write them an intro email for you.

Uncover Hiring Manager

Job Description

Senior AI Performance and Efficiency Engineer, GPU Clusters

NVIDIA is seeking a highly skilled Senior AI/ML Performance and Efficiency Engineer to join our AI Efficiency team. In this role, you will significantly enhance the efficiency of our researchers by implementing improvements across the entire technology stack. You will collaborate closely with customers to identify and resolve infrastructure and application deficiencies, enabling groundbreaking AI and ML research on GPU Clusters. Together, we will build powerful, efficient, and scalable solutions that shape the future of AI/ML technology!

What You Will Be Doing

  • Collaborate with AI/ML researchers to optimize ML models for improved productivity and cost savings.
  • Develop tools, frameworks, and apply ML techniques to detect and analyze efficiency bottlenecks.
  • Engage with researchers on diverse ML workloads including Robotics, Autonomous Vehicles, LLMs, and Video.
  • Partner with engineering teams to optimize hardware, software, and infrastructure usage.
  • Monitor fleet-wide utilization, analyze inefficiency patterns, and implement scalable solutions.
  • Stay current with AI/ML technologies and advocate for their adoption within the organization.

What We Need To See

  • BS or equivalent in Computer Science or a related field, or equivalent experience.
  • Minimum 5 years of experience designing and operating large-scale compute infrastructure.
  • Strong understanding of modern ML techniques and tools.
  • Experience in investigating and resolving training and inference performance issues.
  • Debugging and optimization experience with NSight Systems and NSight Compute.
  • Experience debugging large-scale distributed training using NCCL.
  • Proficiency in Python, Go, Bash, and familiarity with cloud platforms (AWS, GCP, Azure).
  • Experience with parallel computing frameworks and paradigms.
  • Commitment to continuous learning in AI/ML infrastructure.
  • Excellent communication and collaboration skills.

Ways To Stand Out From The Crowd

  • Background with NVIDIA GPUs, CUDA Programming, NCCL, and MLPerf benchmarking.
  • Experience with Machine Learning and Deep Learning concepts, algorithms, and models.
  • Familiarity with InfiniBand with IBOP and RDMA.
  • Understanding of fast, distributed storage systems (Lustre, GPFS) for AI/HPC.
  • Familiarity with deep learning frameworks like PyTorch and TensorFlow.

About NVIDIA

NVIDIA offers competitive salaries and a comprehensive benefits package. Our engineering teams are growing rapidly due to outstanding expansion. If you're a passionate and independent engineer with a love for technology, we want to hear from you.

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 152,000 USD - 241,500 USD for Level 3, and 184,000 USD - 287,500 USD for Level 4. You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until March 23, 2026. This posting is for an existing vacancy.

NVIDIA uses AI tools in its recruiting processes.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. We do not discriminate on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Key skills/competency

  • Senior AI Performance Engineer
  • Machine Learning Efficiency
  • GPU Cluster Optimization
  • Distributed Systems
  • Performance Debugging
  • NVIDIA GPUs
  • CUDA Programming
  • Python
  • ML Frameworks
  • Infrastructure Optimization

Tags:

AI Engineer
Machine Learning Engineer
Performance Engineer
Efficiency Engineer
GPU
NVIDIA
CUDA
Python
Distributed Systems
Cloud Computing
MLOps

Share Job:

How to Get Hired at NVIDIA

  • Tailor your resume: Highlight experience with AI/ML performance, GPU clusters, and NVIDIA technologies.
  • Showcase relevant skills: Emphasize Python, distributed training, and performance debugging tools like NSight.
  • Quantify achievements: Use data to demonstrate productivity improvements and cost savings from your optimizations.
  • Prepare for technical interviews: Be ready to discuss ML concepts, system design, and debugging scenarios.
  • Research NVIDIA's impact: Understand their role in AI advancement and how your skills contribute.

Frequently Asked Questions

Find answers to common questions about this job opportunity

Explore similar opportunities that match your background