
Senior Site Reliability Engineer, Core AI Infrastructure
Coinbase · United States
- Hybrid
- Full-time
- $200,000 / year
- United States
Email the hiring manager to get a response.
Get their verified email + an intro that's ready to send.
Subject: Interested in the Senior Site Reliability Engineer, Core AI Infrastructure role at Coinbase
Hi Riley — I came across the Senior Site Reliability Engineer, Core AI Infrastructure opening and wanted to reach out directly. I've spent the last few years doing exactly this kind of work, and Coinbase stood out because…
✎ Personalized to your résumé after sign-up.
- ✓ Verified email of the hiring manager
- ✓ Intro email personalized to your résumé
- ✓ $9/mo = unlimited — any job link
Secure checkout · cancel anytime
Job highlights
- Own AI infrastructure reliability and automation.
- Build automation and tooling for IT workflows.
- Partner with infrastructure, security, and compliance.
- Strengthen observability and documentation standards.
- Develop full-stack applications with Go or Python.
About the role
About Coinbase
Ready to do the most impactful work of your career? At Coinbase, we are uncompromising on our mission to increase economic freedom. The bar is high, the environment is intense, and we like it that way. This isn't a place for complacency, it’s a place to be pushed past your perceived limits. If you're ready to build the future of finance alongside people who refuse to settle for "good enough," you belong here. Coinbase is a remote-first, but not remote-only company. Expect to get together quarterly for intense in-person working sessions called “surges.” Learn more about working at Coinbase.
Job Overview: Senior Site Reliability Engineer, Core AI Infrastructure
You'll join a high-performing team of engineers driving AI transformation at Coinbase as a Senior Site Reliability Engineer on the IT Operations team. This team builds and scales the infrastructure powering Coinbase's AI products, with direct exposure to senior leadership in a fast-paced, incubator-style environment. You'll own the reliability and automation of critical AI infrastructure, ensuring our systems are resilient, observable, and secure at scale.
What you’ll be doing:
- Own the reliability, monitoring, and incident response lifecycle for AI infrastructure services, including on-call support for AWS deployment pipelines, root cause analysis, and blameless retros.
- Build automation and tooling to streamline operational IT workflows, eliminate manual tasks, and improve deployment velocity across CI/CD frameworks and Kubernetes environments.
- Partner with the Coinbase Infrastructure team to extend CI/CD frameworks supporting IT services and enterprise network platforms, and with Security and Compliance to integrate surveillance tooling into deployment pipelines.
- Strengthen observability and documentation standards across IT engineering by defining metrics, implementing monitoring solutions, and maintaining technical documentation that sets a standard of excellence.
- Develop full-stack applications that power internal AI products and infrastructure with Go or Python.
What we look for in you:
- 5+ years of experience automating and supporting cloud infrastructure (AWS) and network environments, with hands-on use of infrastructure-as-code tools (Terraform, Ansible, Chef, Puppet, or Salt).
- Proven experience deploying, managing, and troubleshooting containerized workloads using Docker and Kubernetes in production environments.
- Proficiency in at least one scripting or programming language (Python, Bash, Ruby, or Go) and version control workflows using Git-based CI/CD pipelines.
- Track record of leading incident response in environments with strict SLAs, including root cause analysis, blameless retros, and measurable reliability improvements.
- Utilizes generative AI responsibly, maintaining human oversight to deliver business-ready outputs and drive measurable improvements in workflow efficiency, cost, and quality.
Nice to haves:
- Expertise with Linux, Bash, Ruby, Python and/or Go
- Expertise automating EC2 or containers deployment with Terraform
- Strong network security fundamentals
- Experience managing and leveraging log aggregation
- Experience working in a highly regulated environment
- Experience in a fast-paced, high-growth company
- Experience in a Remote-first IT environment
Key skills/competency
- Site Reliability Engineering
- AI Infrastructure
- AWS
- Kubernetes
- Docker
- Terraform
- CI/CD
- Python
- Go
- Incident Response
Skills & topics
- Senior Site Reliability Engineer
- Site Reliability Engineering
- AI Infrastructure
- Cloud Infrastructure
- AWS
- Kubernetes
- Docker
- Terraform
- CI/CD
- Python
- Go
- Incident Response
- Automation
- Observability
- SRE
- Core AI
- IT Operations
How to get hired
- Tailor your resume: Highlight experience with AWS, Kubernetes, Terraform, and CI/CD pipelines, aligning with the Senior Site Reliability Engineer role.
- Showcase incident response: Detail your track record in leading incident response, root cause analysis, and blameless retros for strict SLAs.
- Emphasize automation skills: Clearly state your proficiency in scripting/programming languages like Python or Go and IaC tools such as Terraform.
- Prepare for technical questions: Be ready to discuss your experience with containerized workloads, monitoring solutions, and developing full-stack applications.
- Understand the culture: Research Coinbase's mission and values, emphasizing your drive for impact and continuous improvement in your application and interviews.
Technical preparation
Behavioral questions
Frequently asked questions
- What are the salary expectations for a Senior Site Reliability Engineer at Coinbase?
- The annual base salary range for a Senior Site Reliability Engineer at Coinbase is $186,065 to $218,900 USD. This range excludes potential equity and bonus eligibility, as well as benefits like medical, dental, vision, and 401(k).
- How does Coinbase handle remote work for this Senior Site Reliability Engineer role?
- Coinbase operates as a remote-first company. While the role is primarily remote, expect quarterly in-person working sessions called "surges" for team collaboration and strategic alignment.
- What specific cloud infrastructure experience is required for the Senior Site Reliability Engineer position?
- The role requires at least 5 years of experience automating and supporting cloud infrastructure, specifically AWS, and network environments. Proficiency with infrastructure-as-code tools like Terraform is also essential.
- Can you provide more details on the AI aspect of the Senior Site Reliability Engineer role?
- As a Senior Site Reliability Engineer on the Core AI Infrastructure team, you will be responsible for the reliability and automation of the infrastructure powering Coinbase's AI products. This includes developing applications with Go or Python to support these internal AI products and infrastructure.
- What are the key programming languages and tools for this Senior Site Reliability Engineer job?
- Proficiency in at least one scripting or programming language such as Python, Bash, Ruby, or Go is required. Experience with version control using Git and CI/CD pipelines is also crucial. Expertise with Docker and Kubernetes for containerized workloads is necessary.
- What is Coinbase's approach to incident response for this role?
- Coinbase looks for a strong track record of leading incident response in environments with strict SLAs. This includes expertise in root cause analysis, conducting blameless retrospectives, and implementing measurable reliability improvements for AI infrastructure services.
- How does Coinbase use AI in its hiring process for the Senior Site Reliability Engineer role?
- For select roles, Coinbase pilots AI tools for initial screening interviews and interview note summarization. While AI assists in these processes, human recruiters and interviewers will review responses and make employment decisions.
Similar roles
Open positions we recommend based on this role.
