24 days ago

Staff Site Reliability Engineer

Qonto

Hybrid
Full Time
$150,000
Hybrid
Apply

Job Overview

Job TitleStaff Site Reliability Engineer
Job TypeFull Time
Offered Salary$150,000
LocationHybrid

Who's the hiring manager?

Sign up to PitchMeAI to discover the hiring manager's details for this job. We will also write them an intro email for you.

Uncover Hiring Manager

Job Description

Mission

Join Qonto as a Staff Site Reliability Engineer to be the strongest technical voice on our Platform Reliability team. Help us scale a reliable infrastructure as Qonto grows toward 1 million customers across Europe.

Impact

As a Staff SRE, you will play a key role in shaping how our platform evolves. You will frame complex infrastructure challenges, drive architectural decisions, and enable the entire tech department to ship faster and more reliably. You'll be a key technical reference, a mentor for junior engineers, and an active contributor to our knowledge-sharing culture. The SRE department is divided into 2 teams (18 talented engineers): Platform and Storage. You will join the Platform Reliability team, who believe that reliability is built before problems happen, not after.

Responsibilities

  • Frame complex infrastructure problems, propose clear solutions, and drive projects end-to-end on your own initiative.
  • Work with backend, data, security, and engineering efficiency teams to design, deploy, and maintain our infrastructure.
  • Spend 20 to 40% of your time writing Go services, tools, and APIs, adhering to the same standards as our backend engineers.
  • Automate repetitive tasks to minimize toil.
  • Ensure the platform is visible and debuggable through logs, metrics, and tracing.
  • Participate in the on-call rotation, lead post-incident reviews, and implement lasting fixes for incidents.
  • Share knowledge, challenge ideas, and mentor teammates.

Tech Stack

  • Cloud Services: AWS, EKS
  • Container Technology: Kubernetes, Docker
  • CI/CD: GitLab CI & ArgoCD
  • Monitoring: Prometheus, Thanos, OpenTelemetry, OpsGenie, Elasticsearch, Loki
  • Databases & Messaging: AWS RDS PostgreSQL, SQS, Redis, Kafka
  • Programming Languages: Go & Python
  • Infrastructure as Code: Terraform

What You Can Expect

  • Design robust solutions at scale: managing 25k pods, 86 microservices, and 1300 deployments per month.
  • Full autonomy to propose and drive your own ideas using lean methodologies and a bottom-up approach.
  • Modern ways of working: GitOps, AI-assisted engineering, and the freedom to challenge the technical status quo.
  • Deep cross-team collaboration: participate in spec reviews, brainstorming sessions, and problem-solving with various teams.

About You

  • Experience: Strong hands-on experience with cloud-native infrastructure in production, including managing Kubernetes clusters at scale.
  • Programming Skills: Solid Go experience (or equivalent) and comfort building tools, services, and automation.
  • AI-driven Engineering: Active use of AI tools to enhance speed, code quality, and problem-solving; curiosity for future AI advancements.
  • Problem Solver: Understand the full system, dependencies, trade-offs, and long-term impact before proposing solutions; anticipate problems.
  • Team Player: Naturally share knowledge, provide clear feedback, and help less experienced engineers grow.
At Qonto, we understand that true diversity isn't just about ticking boxes on a hiring checklist. Apply regardless of the boxes you tick! Who knows? You may have the missing piece of the puzzle we've been searching for all along. Our hiring process typically lasts 20 working days. More information on our candidate journey is available [here](link-to-candidate-journey). Recruitment scams are on the rise. We will never work with third-party platforms or agencies that request payment from candidates. If you receive a suspicious message claiming to be from Qonto, please report it right away to support@qonto.com.

Key skills/competency

Staff Site Reliability Engineer, Platform Reliability, Infrastructure, AWS, EKS, Kubernetes, Go, Python, Terraform, CI/CD

Tags:

Staff Site Reliability Engineer
SRE
Platform Reliability
Infrastructure
AWS
Kubernetes
Go
Python
Terraform
CI/CD
Observability
Automation
Cloud Native
Microservices

Share Job:

How to Get Hired at Qonto

  • Tailor your resume: Highlight your experience with Kubernetes, Go, and cloud-native infrastructure. Emphasize your AI-driven engineering skills and problem-solving approach.
  • Showcase your impact: Quantify your achievements in previous roles, especially in scaling infrastructure or improving reliability.
  • Prepare for technical interviews: Be ready to discuss complex infrastructure challenges, system design, and Go programming concepts. Practice coding exercises.
  • Demonstrate team collaboration: Prepare examples of how you've mentored engineers, shared knowledge, and worked effectively in cross-functional teams.
  • Understand Qonto's culture: Research their mission, values, and commitment to AI. Show how your skills and mindset align with their goals.

Frequently Asked Questions

Find answers to common questions about this job opportunity

Explore similar opportunities that match your background