8 hours ago

Developer Technology Engineer - AI

NVIDIA

On Site
Full Time
$180,000
Beijing, Beijing, China

Job Overview

Job TitleDeveloper Technology Engineer - AI
Job TypeFull Time
Offered Salary$180,000
LocationBeijing, Beijing, China

Who's the hiring manager?

Sign up to PitchMeAI to discover the hiring manager's details for this job. We will also write them an intro email for you.

Uncover Hiring Manager

Job Description

About the Developer Technology Engineer - AI Role at NVIDIA

NVIDIA is looking for a passionate, world-class computer scientist to work in its Compute Developer Technology (DevTech) team. In this role, you will research and develop techniques to GPU-accelerate leading applications in high performance computing fields within machine and deep learning, scientific computing, and data processing, performing in-depth analysis and optimization to ensure the best possible performance on current- and next-generation GPU architectures.

What You Will Be Doing

  • Working directly with key application developers (especially LLM) to understand the current and future problems they are solving, creating and optimizing core parallel algorithms and data structures to provide the best solutions using GPUs, through both library development and direct contribution to the applications. This includes training and inference optimization for large language models, directly contributing to frameworks such as Megatron and TRTLLM, SGLang, vLLM.
  • Collaborating closely with the architecture, research, libraries, tools, and system software teams at NVIDIA to influence the design of next-generation architectures, software platforms, and programming models, including by investigating impact on application performance and developer productivity.
  • Engaging in deep optimization of high-performance operators, involving but not limited to CUDA deep optimization, instruction and compiler optimization. These optimizations will directly support customers or be integrated into products like cuDNN, cuBLAS, and CUTLASS.
  • Some travel is required for conferences and for on-site visits with developers.

What We Need To See

  • A degree from university in an engineering or computer science related discipline (BS; MS or PhD preferred).
  • 2+ working experience is required.
  • Strong knowledge of C/C++ and/or Fortran.
  • Deep knowledge of software design, programming techniques, and algorithms.
  • Expert knowledge of LLM training/inference optimization, including but not limited to development and optimization experience in distributed training/inference, NCCL, NVSHMEM, IB, RoCE, etc.
  • Strong mathematical fundamentals, including linear algebra and numerical methods.
  • Experience with parallel programming, ideally CUDA C/C++ and OpenACC.
  • Good communication and organization skills, with a logical approach to problem solving, good time management, and task prioritization skills.

Key skills/competency

  • GPU Acceleration
  • Machine Learning
  • Deep Learning
  • High-Performance Computing (HPC)
  • CUDA
  • LLM Optimization
  • C/C++ Programming
  • Parallel Algorithms
  • Distributed Training
  • Performance Tuning

Tags:

Developer Technology Engineer
AI
Machine Learning
Deep Learning
GPU Acceleration
HPC
Optimization
Algorithms
Distributed training
Performance analysis
Software development
CUDA
C++
Fortran
OpenACC
LLM
Megatron
cuDNN
cuBLAS
CUTLASS
NCCL

Share Job:

How to Get Hired at NVIDIA

  • Research NVIDIA's culture: Study their mission, values, recent AI innovations, and employee testimonials on LinkedIn and Glassdoor to understand their drive for excellence.
  • Tailor your resume: Highlight extensive C/C++, CUDA, parallel programming, and LLM optimization experience to match the Developer Technology Engineer - AI role's requirements.
  • Showcase GPU and HPC expertise: Provide concrete examples of projects where you optimized applications for GPUs or high-performance computing environments.
  • Prepare for technical deep-dives: Expect rigorous questions on algorithms, data structures, distributed training, and low-level optimization techniques specific to GPU architecture.
  • Demonstrate collaborative problem-solving: Be ready to discuss how you've partnered with developers and internal teams to solve complex technical challenges and influence product design.

Frequently Asked Questions

Find answers to common questions about this job opportunity

Explore similar opportunities that match your background