4 hours ago

Manager, Software Verification

NVIDIA

Hybrid
Full Time
$300,000
Hybrid

Job Overview

Job TitleManager, Software Verification
Job TypeFull Time
CategoryCommerce
Experience5 Years
DegreeMaster
Offered Salary$300,000
LocationHybrid

Who's the hiring manager?

Sign up to PitchMeAI to discover the hiring manager's details for this job. We will also write them an intro email for you.

Uncover Hiring Manager

Job Description

About the Role: Manager, Software Verification at NVIDIA

As the world pivots towards Generative AI, the network is now the computer. NVIDIA is seeking a visionary leader to head its USA Networking Cluster Validation team. In this pivotal role, you will simulate the world’s largest AI data centers, ensuring that NVIDIA's InfiniBand and Ethernet solutions define the next era of computing. This is an unparalleled opportunity to be at the forefront of the AI revolution, pushing the limits of hyper-scale networking.

What You’ll Be Doing

  • Lead a high-performance engineering team dedicated to the qualification and integration of groundbreaking Networking AI/HPC cluster solutions.
  • Direct the design and testing of massive NVIDIA setups that simulate the production workloads of the world’s largest AI data center customers.
  • Partner with R&D to review architectural designs and requirements for next-generation features across the entire Ethernet and InfiniBand portfolio (switches and network adapters).
  • Oversee the creation of complex network topologies to ensure comprehensive product coverage, emphasizing the emulation of complex customer environments at scale.
  • Drive the roadmap for the testing automation team, ensuring seamless integration of new features into the software release cycles for data center products.
  • Serve as the primary Engineering Lead (PIC) for full verification cycles; assist in debugging complex customer use cases and perform root-cause analysis for critical system issues.
  • Manage comprehensive testing scopes including Regression, Performance, Functional, and Scale, providing executive-level summary reports on release readiness.
  • Foster a culture of technical excellence by mentoring team members and driving professional growth within the organization.

What We Need To See

  • Bachelor’s or Master’s degree in Computer Science, Computer Engineering, or equivalent experience.
  • 8+ years of overall technical experience in networking or systems engineering.
  • 5+ years of experience in a formal team leadership or engineering management role.
  • Proven ability to multi-task, drive people toward deadlines, and manage high-priority tasks in a fast-paced environment.
  • Excellent communication and technical presentation skills; the ability to explain complex technical concepts to both R&D and executive stakeholders.
  • Strong debugging, analytical, and problem-solving skills with a "fast-learner" approach.

Ways To Stand Out From The Crowd

  • Proven experience in testing and qualifying AI cluster infrastructure, including performance tuning for large-scale GPU-to-GPU communication.
  • Deep experience with technologies like KVM, HyperV, or Kubernetes.
  • Advanced knowledge of InfiniBand and Ethernet protocols (RDMA, RoCE).
  • Hands-on experience in programming or scripting (Python, Bash) for automated validation frameworks.

Key skills/competency

  • Software Verification
  • Networking Engineering
  • AI Data Centers
  • InfiniBand Protocols
  • Ethernet Protocols
  • Test Automation
  • Team Leadership
  • Debugging & Analysis
  • Performance Tuning
  • Cluster Validation

Tags:

Software Verification Manager
Networking
AI
HPC
Verification
Testing
Leadership
Automation
Debugging
Validation
Performance
InfiniBand
Ethernet
RDMA
RoCE
KVM
HyperV
Kubernetes
Python
Bash
Data Center

Share Job:

How to Get Hired at NVIDIA

  • Research NVIDIA's AI Vision: Understand NVIDIA's leadership in AI, HPC, and networking solutions.
  • Tailor Your Resume: Highlight experience in networking verification, AI/HPC clusters, and team leadership.
  • Showcase Technical Expertise: Emphasize InfiniBand, Ethernet, KVM, Kubernetes, Python, and Bash skills.
  • Prepare for Technical Deep Dives: Be ready to discuss complex networking concepts and debugging strategies.
  • Demonstrate Leadership & Communication: Prepare examples of managing teams, driving projects, and presenting complex solutions.

Frequently Asked Questions

Find answers to common questions about this job opportunity

Explore similar opportunities that match your background