Question 1

What is the primary focus of an AI Training Optimization Engineer at AMD?

Accepted Answer

The core focus of this role at AMD is to enhance the performance and efficiency of AI model training on AMD GPUs by identifying bottlenecks, optimizing critical kernels, and strengthening the overall training ecosystem for customers.

Question 2

What technical skills are crucial for the AI Training Optimization Engineer role at AMD?

Accepted Answer

Key technical skills include hands-on experience with HIP, CUDA, and Triton for GPU programming, a strong understanding of Transformer models and attention mechanisms, and expertise in profiling and optimizing kernels with low-level tools.

Question 3

How does AMD's Training Optimization Team support customers?

Accepted Answer

The team directly supports customers by ensuring smooth AI model training on AMD GPUs, which involves diagnosing performance bottlenecks and delivering precise, kernel-level performance improvements.

Question 4

What kind of kernel development is involved for an AI Training Optimization Engineer at AMD?

Accepted Answer

Kernel development involves designing and optimizing 'hot' operators using technologies like HIP, CUDA, and Triton for real-world AI training workloads, and also prototyping next-generation kernel techniques such as sparse or linear attention operations.

Question 5

Does the AI Training Optimization Engineer at AMD work with distributed training?

Accepted Answer

Yes, a significant part of the role involves optimizing distributed training performance across multi-GPU and multi-node clusters, focusing on improving communication/compute overlap and parallelism strategies like DP/TP/PP/ZeRO.

Question 6

What academic background is preferred for this AMD AI engineering role?

Accepted Answer

Candidates are typically expected to have a Bachelor’s or Master's degree in Computer Science, Computer Engineering, Electrical Engineering, or a closely related and equivalent field.

Question 7

What frameworks and libraries are important for this AI Training Optimization Engineer role at AMD?

Accepted Answer

Familiarity with PyTorch internals, Megatron-LM, DeepSpeed, or other large-training frameworks is preferred, along with experience contributing to kernel libraries such as CUTLASS or CK, Triton, or broader ML compiler ecosystems.

Question 8

How does an AI Training Optimization Engineer contribute to AMD's AI ecosystem?

Accepted Answer

This role contributes by filling functional gaps, improving framework integration, enhancing ROCm-based training performance, and driving upstream improvements through collaboration with GPU library and open-source teams.

This job post expired on March 19, 2026

AI Training Optimization Engineer

AMD

Job Overview

Who's the hiring manager?

Job Description

About AMD

The Role: AI Training Optimization Engineer

The Person

Key Responsibilities

Preferred Experience

Academic Credentials

Key skills/competency

Tags:

How to Get Hired at AMD

Frequently Asked Questions