14 days ago

Software Engineer AI/ML AWS Neuron

Amazon Web Services (AWS)

On Site
Full Time
$165,200
Cupertino, CA

Job Overview

Job TitleSoftware Engineer AI/ML AWS Neuron
Job TypeFull Time
CategoryCommerce
Experience5 Years
DegreeMaster
Offered Salary$165,200
LocationCupertino, CA

Who's the hiring manager?

Sign up to PitchMeAI to discover the hiring manager's details for this job. We will also write them an intro email for you.

Uncover Hiring Manager

Job Description

Overview

The Software Engineer AI/ML AWS Neuron role at Amazon Web Services (AWS) focuses on developing, enabling, and optimizing large-scale machine learning model training across diverse architectures. You will work on designing, implementing, and improving distributed training solutions for modern ML models on AWS Trainium systems.

Key Responsibilities

  • Design, implement, and optimize distributed training solutions for large-scale ML models.
  • Extend and optimize distributed training frameworks like FSDP, torchtitan, and Hugging Face libraries.
  • Profile, analyze, and tune end-to-end training pipelines for optimal performance.
  • Collaborate with hardware, compiler, runtime teams, and AWS solution architects.
  • Engage with customers to deploy and optimize training workloads at scale.

About the Team & Culture

Join a team that fosters an inclusive culture, work/life balance and career growth. Benefit from mentorship opportunities, flexible working hours, and a collaborative environment that values diverse perspectives.

Basic & Preferred Qualifications

Basic qualifications include 3+ years of professional software development and design experience, programming proficiency, and deep learning algorithm skills. Preferred qualifications involve advanced experience with full development cycles, deep learning frameworks like PyTorch, Jax or Tensorflow, and distributed libraries.

Compensation & Benefits

The base salary range is provided along with additional benefits such as sign-on payments, RSUs, comprehensive health insurance, 401(k) matching, paid time off, and parental leave.

Key skills/competency

  • Distributed Training
  • ML Model Optimization
  • Deep Learning
  • Software Development
  • High Performance Computing
  • Programming
  • Optimization Techniques
  • AWS Trainium
  • Frameworks
  • Collaboration

Tags:

Software Engineer AI/ML AWS Neuron
distributed training
machine learning
deep learning
AWS Trainium
optimization
Pytorch
Tensorflow
scalability
performance

Share Job:

How to Get Hired at Amazon Web Services (AWS)

  • Research AWS culture: Study AWS mission and leadership principles.
  • Customize your resume: Highlight distributed training and ML skills.
  • Network on LinkedIn: Connect with current AWS employees.
  • Prepare for technical interviews: Practice coding and ML system design.

Frequently Asked Questions

Find answers to common questions about this job opportunity

Explore similar opportunities that match your background