Site Reliability Engineer
Joveo AI
Job Overview
Who's the hiring manager?
Sign up to PitchMeAI to discover the hiring manager's details for this job. We will also write them an intro email for you.

Job Description
About Joveo
Every company says they're "AI-first." We actually are. Joveo's recruitment advertising platform processes millions of hiring decisions through machine learning, real-time bidding, and predictive analytics - helping the world's largest employers find the right people, faster and fairer. But we're not done. Not even close.
Role: Site Reliability Engineer (SRE)
Location: Remote
Role Overview:
We are hiring a Site Reliability Engineer to own the availability, performance, and scalability of Joveo's production systems. You will apply software engineering principles to infrastructure and operations - reducing toil, improving observability, and keeping our platform at the reliability levels our clients depend on.
Key Responsibilities:
- Define and maintain SLOs, SLIs, and error budgets for critical services
- Lead incident response, blameless postmortems, and reliability improvements
- Build internal tooling and automation to reduce operational toil
- Partner with engineering teams to bake reliability into system design
- Implement and evolve observability stacks — metrics, logs, and traces
- Manage on-call rotations and build scalable incident runbooks
Required Skills & Qualifications:
- Strong software engineering background with SRE or production ops experience
- Proficiency in Python, Go, or similar for automation and tooling
- Experience with observability platforms (Datadog, New Relic, Prometheus/Grafana)
- Deep understanding of distributed systems, failure modes, and reliability patterns
- Experience with Kubernetes, container orchestration, and cloud-native infrastructure
- Strong incident management skills and a calm, structured approach to outages
Equal Opportunity Employer:
Joveo is an equal opportunity employer. We are committed to building an inclusive workplace and welcome applications from all qualified individuals regardless of race, color, ethnicity, nationality, gender, gender identity or expression, sexual orientation, age, religion, disability, marital status, or any other characteristic protected by applicable law. All hiring decisions are made solely on the basis of qualifications, skills, and demonstrated ability.
If your dream job is one that doesn’t fit neatly into a job title — apply. Joveo. Where AI meets the future of work.
Key skills/competency
- Site Reliability Engineering
- Kubernetes
- Python
- Go
- Observability
- Distributed Systems
- Incident Management
- Cloud-Native Infrastructure
- Automation
- SRE
How to Get Hired at Joveo AI
- Tailor your resume: Highlight SRE experience, Python/Go proficiency, and Kubernetes skills.
- Craft a strong cover letter: Emphasize your understanding of distributed systems and incident management.
- Prepare for technical interviews: Review SRE principles, automation challenges, and observability concepts.
- Showcase problem-solving: Be ready to discuss past incidents and your approach to resolving them.
- Research Joveo AI: Understand their AI-first mission and how reliability supports it.
Frequently Asked Questions
Find answers to common questions about this job opportunity
Explore similar opportunities that match your background