Senior Site Reliability Engineer
@ Rithum

Hybrid
Hybrid
Posted 5 days ago

Your Application Journey

Personalized Resume
Apply
Email Hiring Manager
Interview

Email Hiring Manager

XXXXXXXXXX XXXXXXXXX XXXXXX******* @rithum.com
Recommended after applying

Job Details

Overview

Rithum™ is the world’s most trusted commerce network, accelerating how brands, suppliers, and retailers work together. As a Senior Site Reliability Engineer, you will build and run large-scale, distributed, fault-tolerant systems and ensure they meet uptime and availability targets.

Responsibilities

  • Collaborate with developers and cross-functional teams on automation and reliability.
  • Design, implement, and maintain observability systems with AI/ML for anomaly detection.
  • Analyze and resolve issues in both legacy and modern environments.
  • Participate in a rotating on-call schedule to manage incidents.
  • Drive automation and operational efficiency.

Qualifications

Minimum: 3+ years as an SRE, DevOps Engineer or related role. Experience with AWS, logging and monitoring systems (CloudWatch, Grafana, Prometheus), high-level and scripting languages (Python, Bash, Typescript), IaC tools (CDK, Terraform, Ansible), and containerization (EKS, ECS).

Preferred: Bachelor’s degree or equivalent experience in Computer Science, collaborative work experience, and excellent communication skills.

What It’s Like to Work at Rithum

You will work with smart risk-takers and courageous collaborators in an inclusive, remote-first culture. Enjoy work-life balance, competitive compensation, wellness and professional development benefits, and flexible working conditions.

Benefits

  • Enhanced Private Medical Insurance and Health Cash Back Plan.
  • Life insurance & disability benefits.
  • Pension plan with 4% Company match.
  • Competitive time off package including PTO, holidays, wellness days, and volunteer day.
  • Flexible work locations – home, London office, or both.
  • Professional development stipend and learning offerings.
  • Charitable contribution match per team member.

Key skills/competency

  • Site Reliability
  • Automation
  • Observability
  • Cloud Computing
  • Scripting
  • IaC
  • Monitoring
  • AI/ML
  • DevOps
  • Incident Response

How to Get Hired at Rithum

🎯 Tips for Getting Hired

  • Research Rithum's culture: Review mission, values, and recent news.
  • Customize your resume: Highlight SRE and AWS experience.
  • Demonstrate technical skills: Emphasize AI/ML and monitoring projects.
  • Prepare for behavioral questions: Focus on collaboration and mentorship.

📝 Interview Preparation Advice

Technical Preparation

Review AWS architecture and multi-region strategies.
Practice IaC with Terraform and Ansible.
Study monitoring tools like Grafana and Prometheus.
Brush up on scripting in Python and Bash.

Behavioral Questions

Describe a time you managed a system outage.
Explain collaboration with cross-functional teams.
Discuss a mentoring experience with junior staff.
Share how you prioritize tasks independently.

Frequently Asked Questions