Research Compute Operations
Anthropic
Job Overview
Who's the hiring manager?
Sign up to PitchMeAI to discover the hiring manager's details for this job. We will also write them an intro email for you.

Job Description
About Anthropic
Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems.
About The Research Compute Operations Role
Anthropic's researchers use internal tooling and infrastructure to run the experiments that advance AI safety and capability. This role owns the researcher experience with that tooling — both the day-to-day support and the longer-term product vision. You'll be the person researchers come to when they need help, and the person driving improvements and automation to make that manual help unnecessary over time.
This role sits on the Capacity Operations team at the intersection of research and infrastructure.
Responsibilities
- Serve as a primary point of contact for researchers using internal compute infrastructure, including triaging access issues, resolving researcher requests, and real-time monitoring
- Proactively monitor usage patterns and work with researchers to optimize their workloads
- Help design the product roadmap for research inference tooling. You will gather user feedback, prioritize improvements, and drive execution
- Prototype better tools: dashboards, automations, self-service workflows, and more intuitive interfaces for complex systems
- Build automations (using Claude) for common operational workflows
You may be a good fit if you
- Have an engineering background (or equivalent technical depth) and have transitioned into or are drawn to product management, technical operations, or systems design work
- Can query data, understand infrastructure, debug issues, and build tools and scripts to prototype solutions quickly
- Are a systems-thinker: when a researcher hits a confusing error, you don't just fix it, you ask why the system produced it and how to prevent it for everyone
- Are comfortable navigating ambiguity across teams and context-switching between tactical support and strategic design
- Use Claude or other AI tools daily and are excited to teach others your best practices
Strong candidates may also have
- An understanding of compute infrastructure and familiarity with concepts like rate limiting, autoscaling, and request prioritization
- Background in ML infrastructure, ML engineering, or research engineering
- Experience with large-scale accelerator clusters (TPUs, GPUs, or similar)
- Familiarity with ML training pipelines and how they consume inference capacity
- Track record of building internal tools or developer platforms that people actually love using
- Experience in developer experience (DevEx) or platform engineering
Compensation and Logistics
The annual compensation range for this role is $270,000—$290,000 USD.
Education requirements: We require at least a Bachelor's degree in a related field or equivalent experience.
Location-based hybrid policy: Currently, we expect all staff to be in one of our offices at least 25% of the time. We do sponsor visas and our headquarters are in San Francisco.
How We're Different
At Anthropic, we believe the highest-impact AI research will be big science. We work as a single cohesive team on a few large-scale research efforts, valuing impact in advancing our long-term goals of steerable, trustworthy AI. We view AI research as an empirical science, emphasizing collaboration and communication skills.
Our research directions include GPT-3, Circuit-Based Interpretability, Multimodal Neurons, Scaling Laws, AI & Compute, Concrete Problems in AI Safety, and Learning from Human Preferences. We are a public benefit corporation headquartered in San Francisco, offering competitive compensation, benefits, and flexible working hours.
Key skills/competency
- Compute Infrastructure
- ML Operations
- Systems Design
- Automation
- User Support
- Product Management
- Debugging
- Workload Optimization
- AI Tools (Claude)
- Developer Experience
How to Get Hired at Anthropic
- Research Anthropic's culture: Study their mission, values, recent news, and employee testimonials on LinkedIn and Glassdoor.
- Tailor your resume: Highlight experience in compute infrastructure, ML operations, and systems design specific to research environments.
- Showcase problem-solving: Prepare examples demonstrating systems thinking, debugging complex issues, and proactive optimization.
- Emphasize AI tool proficiency: Discuss your experience with Claude or other AI tools, showcasing how you leverage them for automation and best practices.
- Demonstrate collaborative impact: Share instances where you contributed to large-scale research efforts or improved developer experience.
Frequently Asked Questions
Find answers to common questions about this job opportunity
Explore similar opportunities that match your background