9 days ago

Backend Software Engineer, Evals

OpenAI

On Site
Full Time
$300,000
San Francisco, CA

Job Overview

Job TitleBackend Software Engineer, Evals
Job TypeFull Time
CategoryCommerce
Experience5 Years
DegreeMaster
Offered Salary$300,000
LocationSan Francisco, CA

Who's the hiring manager?

Sign up to PitchMeAI to discover the hiring manager's details for this job. We will also write them an intro email for you.

Uncover Hiring Manager

Job Description

About The Team

The Support Automation team at OpenAI scales the organization by applying cutting-edge AI models to real-world challenges, automating and enhancing work across the organization. From customer operations to engineering, we develop an ecosystem of automation products that empower our colleagues and drive impact. We're passionate about crafting products that serve those around us, blending rapid prototyping with a focus on long-term quality and reliability. By creating reusable solutions, we create patterns that can be applied across diverse domains within OpenAI.

This team leverages OpenAI technology to improve OpenAI, providing the opportunity to leverage the full extent of our tech (both public and pre-released) to accomplish this mission.

About The Role

We’re looking for a Backend Software Engineer, Evals with experience working in ML/LLM-heavy domains to help to design and build an evals infrastructure that measures the quality of OpenAI’s support automation. This is a deeply technical and highly cross-functional role where you’ll build robust systems and backend services that serve as the foundation for how knowledge is created, accessed, and applied across OpenAI. The role will especially focus on working closely with Data Science and Research partners to design and build evals at scale.

In This Role, You Will

  • Design eval pipelines that are reliable, reproducible, and extendable.
  • Build the infrastructure for continuous eval monitoring frameworks (regression/drift monitoring, building robust golden datasets) along with feedback loops that ultimately strengthen support automation.
  • Design, build, and maintain backend services and APIs to support intelligent automation and knowledge systems.
  • Integrate and structure data across internal platforms, transforming it into formats optimized for use by downstream systems and AI workflows.
  • Collaborate closely with data, research, and engineering teams to integrate OpenAI models into high-leverage workflows.
  • Own the full development lifecycle of new backend systems and internal platform capabilities.
  • Build with scale and maintainability in mind, while rapidly iterating on new ideas.

You Might Be a Great Fit If You Have

  • 4+ years of backend engineering experience at product-driven companies (excluding internships).
  • Proficiency in backend technologies. Our tech stack includes Python, FastAPI, and Postgres.
  • Experience designing and scaling distributed systems, APIs, or data processing pipelines.
  • Experience building AI agents or applications, including designing evals and improving performance through prompting or scaffolding.
  • Familiarity with evaluation methods for LLMs and experience with patterns like multi-agent workflows, tool use, or long context.
  • Experience creating production evals and/or measuring performance of ML/LLM models at scale.
  • A pragmatic mindset. You’re comfortable shipping iteratively while building toward a long-term vision.

About OpenAI

OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety and human needs at its core, and to achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity.

Key skills/competency

  • Backend Engineering
  • ML/LLM Domain Expertise
  • Evals Infrastructure Design
  • Distributed Systems
  • API Development
  • Data Integration
  • Python
  • FastAPI
  • Postgres
  • AI Agents
  • Continuous Monitoring

Tags:

Backend Software Engineer
ML Engineering
LLM Evaluation
Distributed Systems
API Development
Data Integration
Automation
Python
FastAPI
Postgres
AI Agents
System Design
Continuous Monitoring
Scalability
Software Development
Data Pipelines
Research Collaboration
Internal Tools
Product Development
Machine Learning

Share Job:

How to Get Hired at OpenAI

  • Research OpenAI's mission: Study their dedication to general AI benefits, safety, and core values.
  • Tailor your resume: Highlight backend engineering, ML/LLM experience, and distributed systems expertise.
  • Showcase eval experience: Emphasize designing and building evaluation frameworks for AI models.
  • Understand OpenAI's tech stack: Prepare for questions on Python, FastAPI, Postgres, and AI applications.
  • Demonstrate cross-functional skills: Be ready to discuss collaboration with data, research, and engineering teams.

Frequently Asked Questions

Find answers to common questions about this job opportunity

Explore similar opportunities that match your background