13 days ago

Machine Learning Engineer

Red Hat

Hybrid
Full Time
$150,000
Hybrid

Job Overview

Job TitleMachine Learning Engineer
Job TypeFull Time
CategoryCommerce
Experience5 Years
DegreeMaster
Offered Salary$150,000
LocationHybrid

Who's the hiring manager?

Sign up to PitchMeAI to discover the hiring manager's details for this job. We will also write them an intro email for you.

Uncover Hiring Manager

Job Description

About Red Hat

Red Hat is the world's leading provider of enterprise open source software solutions, renowned for its community-powered approach to delivering high-performing Linux, cloud, container, and Kubernetes technologies. Operating in over 40 countries, the company offers flexible work options including in-office, hybrid, and fully remote arrangements.

About the Role

The Machine Learning Engineer will join the AI Inference Engineering team working on distributed vLLM infrastructure within the llm-d project. The role involves developing and optimizing scalable inference systems for large language models (LLMs) using Kubernetes-native deployments and collaborating with an open-source community.

Responsibilities

  • Design, develop, and test new features for the AI Inference platform.
  • Engage actively with upstream open-source communities.
  • Develop distributed inference infrastructure leveraging Kubernetes APIs and operators.
  • Implement system components in Go and/or Rust and integrate with vLLM.
  • Optimize resource utilization, fault tolerance, and stability of inference systems.
  • Participate in technical design discussions and conduct code reviews.

Qualifications

  • Proficiency in Python and/or GoLang.
  • Experience with cloud-native Kubernetes and service mesh technologies.
  • Understanding of networking protocols such as HTTP/2, gRPC, and API gateways.
  • Knowledge of LLM serving runtimes including vLLM and TensorRT-LLM.
  • Excellent communication skills and ability to work in a dynamic environment.
  • Optional: Experience in C, C++, or Rust and familiarity with GPU benchmarking tools.

Benefits

Comprehensive medical, dental, and vision coverage, retirement plans with employer matching, paid time off, parental leave, stock purchase plans, tuition reimbursement, and more.

Key skills/competency

  • Python
  • GoLang
  • Kubernetes
  • vLLM
  • Cloud-native
  • Distributed Systems
  • Networking
  • Open Source
  • Deep Learning
  • Fault Tolerance

Tags:

Machine Learning Engineer
Python
GoLang
Kubernetes
Distributed Systems
vLLM
Networking
Open Source
Inference
LLM
cloud-native
API

Share Job:

How to Get Hired at Red Hat

  • Research Red Hat's culture: Study their mission, values, and recent innovations.
  • Customize your resume: Highlight open-source contributions and Kubernetes skills.
  • Emphasize technical expertise: Showcase programming and distributed systems experience.
  • Prepare for interviews: Practice discussing scalable AI and inference systems.

Frequently Asked Questions

Find answers to common questions about this job opportunity

Explore similar opportunities that match your background