6 days ago

Cloud Infrastructure Engineer

Alchemy

On Site
Full Time
$187,500
San Francisco Bay Area
Apply

Job Overview

Job TitleCloud Infrastructure Engineer
Job TypeFull Time
Offered Salary$187,500
LocationSan Francisco Bay Area

Who's the hiring manager?

Sign up to PitchMeAI to discover the hiring manager's details for this job. We will also write them an intro email for you.

Uncover Hiring Manager

Job Description

About The Role

As an engineer in the Infrastructure department at Alchemy, you will design, deploy, and continuously improve the infrastructure powering our blockchain developer platform — serving 100+ chains, billions of daily requests, and over $150B in annual transactions. The Infrastructure team provides the infrastructure, tooling, and expertise needed to allow Alchemy engineers to ship, scale, and operate high-quality products in a fast, safe, and cost-efficient manner.

What You'll Do

  • Architect and operate scalable, self-healing infrastructure leveraging Kubernetes, Terraform, and cloud-native tools across multi-region deployments.
  • Drive AI enablement across engineering — ensuring repos, tooling, and workflows are optimized for agentic development with tools like Claude Code, Cursor, and Codex.
  • Build AI-powered infrastructure tooling and automation (e.g., automated K8s upgrades, IaC plan analysis, cost optimization advisors, MCP servers, n8n workflows).
  • Build and maintain internal developer platform (IDP) capabilities for self-service deployments, observability, and reliability.
  • Develop observability frameworks using Prometheus and Grafana for metrics, dashboards, and alerting.
  • Lead incident management with blameless post-mortems; define and enforce SLIs, SLOs, and error budgets across services.
  • Design and manage multi-cloud, multi-region network architecture — VPC design, IPAM, DNS (Cloudflare), cross-cloud connectivity, security groups, and edge-proxy/istio gateway configuration.
  • Collaborate with security teams to embed compliance into infrastructure, including IaC scanning and runtime protection.
  • Provide technical leadership and mentorship to elevate the team's operational capabilities.

What We're Looking For

  • 5+ years as an Infrastructure Engineer focused on reliability (SRE, Production Engineer, Platform Engineer).
  • Experience driving company-wide reliability efforts, including SLO frameworks and error budget policies.
  • Strong proficiency with observability stacks: OpenTelemetry, Prometheus/Grafana.
  • Deep experience with cloud infrastructure (AWS/GCP), Kubernetes, and multi-region architectures.
  • Skilled with Terraform, Helm, and GitOps workflows (e.g., ArgoCD) with an automation-first mindset.
  • Experience leveraging agentic development tools (Claude Code, Cursor, Codex) and workflow automation (n8n) to accelerate IaC and build internal tooling is a strong plus.
  • Solid networking fundamentals — VPC design, DNS, IPAM, security groups, cross-cloud connectivity, and service mesh (e.g., Istio) experience is a plus.
  • Calm and effective incident responder with a focus on systemic improvement.
  • Strong cross-functional communicator across SRE, security, and product engineering.
  • Blockchain infrastructure, distributed systems, or high-throughput RPC experience — not required but a plus.

Benefits and Perks

  • Medical, Dental, & Vision
  • Gym Reimbursement
  • Home Office Build-out Budget
  • In-Office Group Meals
  • Wellbeing & Mental Health Perks
  • Learning & Development Stipend
  • Company Sponsored Conferences & Events
  • HSA and FSA Plans
  • Fertility Benefits

More on the Role

Alchemy is committed to offering competitive compensation, including base salary as well as equity. Additionally, Alchemy offers comprehensive medical, dental, and vision coverage, as well as other benefits such as 401k and unlimited flexible time off. The base salary range for this position is estimated to be between $135,000 - $240,000 annually. Please note this range reflects base salary only, and does not include bonus, equity, or benefits. Your salary will be determined by various factors, including relevant experience, skill set, qualifications, and other business needs.

Key skills/competency

  • Cloud Infrastructure Engineer
  • Kubernetes
  • Terraform
  • Observability (Prometheus, Grafana, OpenTelemetry)
  • Networking (VPC, DNS, IPAM, Istio)
  • Site Reliability Engineering (SRE)
  • GitOps (ArgoCD)
  • AI Tools (Claude Code, Cursor, Codex)
  • Incident Management
  • IaC (Infrastructure as Code)

Tags:

Cloud Infrastructure Engineer
Kubernetes
Terraform
AWS
GCP
SRE
DevOps
Site Reliability Engineer
Platform Engineer
Observability
Prometheus
Grafana
OpenTelemetry
Networking
VPC
DNS
Istio
GitOps
ArgoCD
AI Engineering
Automation
Incident Management
Infrastructure as Code

Share Job:

How to Get Hired at Alchemy

  • Tailor your resume: Highlight 5+ years of infrastructure engineering, SRE, or platform engineering experience, emphasizing reliability efforts, SLOs, and error budgets.
  • Showcase technical skills: Detail your proficiency with Kubernetes, Terraform, cloud platforms (AWS/GCP), observability tools (Prometheus, Grafana, OpenTelemetry), and networking fundamentals.
  • Emphasize automation and AI: Mention experience with GitOps, IaC automation, and any familiarity with agentic development tools like Claude Code or n8n workflows.
  • Demonstrate incident response: Prepare to discuss your calm and effective incident response approach, focusing on systemic improvements and blameless post-mortems.
  • Communicate effectively: Be ready to showcase strong cross-functional communication skills, working with SRE, security, and product engineering teams.

Frequently Asked Questions

Find answers to common questions about this job opportunity

Explore similar opportunities that match your background