14 days ago

Site Reliability Engineer

Joveo AI

Hybrid
Full Time
$150,000
Hybrid

Job Overview

Job TitleSite Reliability Engineer
Job TypeFull Time
Offered Salary$150,000
LocationHybrid

Who's the hiring manager?

Sign up to PitchMeAI to discover the hiring manager's details for this job. We will also write them an intro email for you.

Uncover Hiring Manager

Job Description

About Joveo

Every company says they're "AI-first." We actually are. Joveo's recruitment advertising platform processes millions of hiring decisions through machine learning, real-time bidding, and predictive analytics - helping the world's largest employers find the right people, faster and fairer. But we're not done. Not even close.

Role: Site Reliability Engineer (SRE)

Location: Remote

Role Overview:

We are hiring a Site Reliability Engineer to own the availability, performance, and scalability of Joveo's production systems. You will apply software engineering principles to infrastructure and operations - reducing toil, improving observability, and keeping our platform at the reliability levels our clients depend on.

Key Responsibilities:

  • Define and maintain SLOs, SLIs, and error budgets for critical services
  • Lead incident response, blameless postmortems, and reliability improvements
  • Build internal tooling and automation to reduce operational toil
  • Partner with engineering teams to bake reliability into system design
  • Implement and evolve observability stacks — metrics, logs, and traces
  • Manage on-call rotations and build scalable incident runbooks

Required Skills & Qualifications:

  • Strong software engineering background with SRE or production ops experience
  • Proficiency in Python, Go, or similar for automation and tooling
  • Experience with observability platforms (Datadog, New Relic, Prometheus/Grafana)
  • Deep understanding of distributed systems, failure modes, and reliability patterns
  • Experience with Kubernetes, container orchestration, and cloud-native infrastructure
  • Strong incident management skills and a calm, structured approach to outages

Equal Opportunity Employer:

Joveo is an equal opportunity employer. We are committed to building an inclusive workplace and welcome applications from all qualified individuals regardless of race, color, ethnicity, nationality, gender, gender identity or expression, sexual orientation, age, religion, disability, marital status, or any other characteristic protected by applicable law. All hiring decisions are made solely on the basis of qualifications, skills, and demonstrated ability.

If your dream job is one that doesn’t fit neatly into a job title — apply. Joveo. Where AI meets the future of work.

Key skills/competency

  • Site Reliability Engineering
  • Kubernetes
  • Python
  • Go
  • Observability
  • Distributed Systems
  • Incident Management
  • Cloud-Native Infrastructure
  • Automation
  • SRE

Tags:

Site Reliability Engineer
SRE
Python
Go
Kubernetes
Observability
Distributed Systems
Cloud
Automation
Production Operations

Share Job:

How to Get Hired at Joveo AI

  • Tailor your resume: Highlight SRE experience, Python/Go proficiency, and Kubernetes skills.
  • Craft a strong cover letter: Emphasize your understanding of distributed systems and incident management.
  • Prepare for technical interviews: Review SRE principles, automation challenges, and observability concepts.
  • Showcase problem-solving: Be ready to discuss past incidents and your approach to resolving them.
  • Research Joveo AI: Understand their AI-first mission and how reliability supports it.

Frequently Asked Questions

Find answers to common questions about this job opportunity

Explore similar opportunities that match your background