Staff Infrastructure Engineer, Discovery Team
@ Anthropic

San Francisco, CA
$382,500
On Site
Full Time
Posted 21 days ago

Your Application Journey

Personalized Resume
Apply
Email Hiring Manager
Interview

Email Hiring Manager

XXXXXXXXX XXXXXXXXXXXXX XXXXXX****** @anthropic.com
Recommended after applying

Job Details

About Anthropic

Anthropic is dedicated to creating reliable, interpretable, and steerable AI systems that are safe and beneficial for society.

About The Team

The Discovery Team is focused on building an AI scientist capable of solving long-term reasoning challenges and advancing scientific workflows through improved model capabilities and robust infrastructure.

About The Role

As a Staff Infrastructure Engineer on our team, you will work end to end to identify and remove key infrastructure blockers on the path to scientific AGI. You will design, develop, and optimize large-scale systems that support AI scientist training, evaluation, and deployment across distributed environments.

Responsibilities

  • Design and implement large-scale infrastructure systems for AI scientist workflows.
  • Identify and resolve infrastructure bottlenecks affecting model performance.
  • Develop evaluation frameworks to measure progress toward scientific AGI.
  • Build scalable VM/sandboxing/container architectures for long-horizon tasks.
  • Collaborate with researchers to translate experimental requirements into production-ready systems.
  • Develop and optimize large-scale data pipelines for language model training.

You May Be a Good Fit If You

  • Have 6+ years of experience in infrastructure engineering.
  • Are proficient with performance optimization and distributed systems.
  • Have experience with containerization (Docker, Kubernetes) and orchestration.
  • Have built large-scale data pipelines and distributed storage systems.
  • Communicate effectively and work collaboratively across the ML stack.

Strong Candidates May Also Have

  • Experience with language model training and distributed ML frameworks.
  • Background in building infrastructure for AI research labs or large-scale ML organizations.
  • Knowledge of GPU/TPU architectures and cloud platforms (AWS, GCP).
  • Experience with workflow orchestration tools and experiment management systems.

Additional Information

The role offers competitive compensation, equity, benefits, and a hybrid working arrangement with at least 25% office attendance. Visa sponsorship is available under certain conditions.

Key skills/competency

  • Infrastructure Engineering
  • Distributed Systems
  • Performance Optimization
  • Containerization
  • Data Pipelines
  • VM/Sandboxing
  • ML Workloads
  • System Architecture
  • Cloud Platforms
  • Collaboration

How to Get Hired at Anthropic

🎯 Tips for Getting Hired

  • Customize your resume: Tailor your experience to distributed systems and ML.
  • Highlight key projects: Showcase large-scale infrastructure achievements.
  • Research Anthropic: Understand their mission and recent innovations.
  • Practice technical interviews: Prepare for system design and performance questions.

📝 Interview Preparation Advice

Technical Preparation

Review distributed system design patterns.
Practice container orchestration exercises.
Optimize performance in simulated environments.
Study large-scale data pipeline construction.

Behavioral Questions

Describe a challenging project collaboration.
Explain how you resolve complex conflicts.
Discuss past innovative problem-solving approaches.
Share experiences adapting to team feedback.

Frequently Asked Questions