Question 1

What are the key technical skills required for the AI/ML Validation Engineer role at AMD?

Accepted Answer

For the AI/ML Validation Engineer position at AMD, key technical skills include experience with AI infrastructure (GPUs, networking, ROCEv2), distributed training and inference frameworks (PyTorch, Tensorflow, vLLM), automation scripting (Python, Golang), and familiarity with schedulers like Kubernetes and Slurm. Experience with performance profiling and debugging complex compute, network, and storage issues is also crucial.

Question 2

What kind of AI workloads will I be working with as an AI/ML Validation Engineer at AMD?

Accepted Answer

As an AI/ML Validation Engineer at AMD, you will be working with complex AI solutions, focusing on distributed training and inference workloads. This includes training Large Language Models (LLMs), Mixture-of-Experts (MoE) models, Image Generation, and recommendation models, as well as running inference benchmarks for various AI applications.

Question 3

How important is experience with AMD ROCM for this AI/ML Validation Engineer role?

Accepted Answer

Experience with AMD ROCM is considered an added advantage for the AI/ML Validation Engineer role. While not strictly mandatory, having experience validating AI solutions with AMD's ROCM software will be highly beneficial and can set your application apart.

Question 4

What academic background is preferred for the AI/ML Validation Engineer position at AMD?

Accepted Answer

AMD prefers candidates for the AI/ML Validation Engineer position to hold a Bachelor’s or Master's degree in Computer Science, Computer Engineering, Electrical Engineering, or an equivalent field. This academic background provides a strong foundation for the technical demands of the role.

Question 5

Can you provide insights into the interview process for the AI/ML Validation Engineer job at AMD?

Accepted Answer

The interview process for an AI/ML Validation Engineer at AMD typically involves technical assessments focusing on your validation, automation, and AI/ML infrastructure knowledge. Behavioral questions will also be used to assess your problem-solving skills, communication abilities, and how you collaborate with cross-functional teams.

AI ML Validation Engineer

AMD

Job Overview

Who's the hiring manager?

Job Description

About AMD

The Role

The Person

Key Responsibilities

Preferred Experience

Preferred Academic Credentials

Key skills/competency

Tags:

How to Get Hired at AMD

Frequently Asked Questions