13 hours ago

Sustaining Operations Engineer

Canonical

Hybrid
Full Time
$120,000
Hybrid

Job Overview

Job TitleSustaining Operations Engineer
Job TypeFull Time
Offered Salary$120,000
LocationHybrid

Who's the hiring manager?

Sign up to PitchMeAI to discover the hiring manager's details for this job. We will also write them an intro email for you.

Uncover Hiring Manager

Job Description

About Canonical

Canonical is a pioneering tech firm at the forefront of the global move to open source. As the company that publishes Ubuntu, one of the most important open source projects and the platform for AI, IoT, and the cloud, we are changing the world on a daily basis. We recruit on a global basis and set a very high standard for people joining the company. We expect excellence - in order to succeed, we need to be the best at what we do.

Canonical has been a remote-first company since its inception in 2004. Work at Canonical is a step into the future, and will challenge you to think differently, work smarter, learn new skills, and raise your game. Canonical provides a unique window into the world of 21st-century digital business.

The Role: Sustaining Operations Engineer

This is a fast-paced engineering role in Linux-based software-defined infrastructure and applications, covering all layers of the stack, including bare metal, virtualization (KVM) and containerization (Docker and LXC/LXD), storage (Ceph and Linux filesystems), networking (OVS, OVN and Core networking), up to OpenStack and Kubernetes, and the open source applications running on top of them.

This role is an opportunity for a technologist with a passion for Linux and open source to build a career with Canonical and drive success for our customers, community, and the company. If you have great communication skills, and a passion for troubleshooting and fixing issues in technology used by millions across the world, then you will enjoy working with some of the best people in the industry at Canonical.

This is a globally remote role.

This role deals with critical issues in the open source stack that require deep debugging and troubleshooting skills. Our engineers have to be able to work productively at any level of the stack above the kernel, in a wide range of applications, to understand and address the software issues at hand. Our group is critical to the success of our enterprise customers, partners, and Ubuntu itself.

You will be the final point of escalation for operational troubleshooting and driving issues to resolution with workarounds, guidance, and fixes to be released upstream and in Ubuntu.

What your day will look like

  • Resolve complex customer problems related to Ubuntu, OpenStack, Ceph, and/or Kubernetes
  • Maintain a close working relationship with Canonical's field, support, and product engineering teams
  • Participate in upstream communities
  • Debug issues, propose workarounds, liaise with Software Engineers on producing a patch
  • Demonstrate good judgment in technical methods and techniques
  • Prioritize work and manage your time effectively against priorities
  • Participate in team activities to improve processes, tools, and documentation
  • Maintain clear, technical, and concise communications
  • Participate in a regular weekend working rotation
  • Provide subject matter expertise as the final point of escalation on operational issues
  • Work from home and travel internationally up to 10% of work time for team meetings, events, and conferences

What we are looking for in you

  • Professional experience troubleshooting advanced Linux issues
  • Background in Computer Science, STEM, or similar
  • Exceptionally strong experience with either Linux, LXD, OpenStack, Ceph, or Kubernetes
  • Strong debugging experience with Python, Go, C, or C++ on Linux
  • Ability to troubleshoot with gdb, pdb, tcpdump, or other tools
  • Familiarity with git source code repositories and branches
  • An exceptional academic track record from both high school and preferably university
  • Willingness to travel up to 4 times a year for internal events

Additional Skills That You Might Also Bring

  • You love technology and working with brilliant people
  • You are curious, flexible, articulate, and accountable
  • You value soft skills and are passionate, enterprising, thoughtful, and self-motivated
  • You have interest in, and experience with most of the following: Ubuntu Linux - kernel or userspace, Kubernetes, OpenStack, Ceph, QEMU/KVM, LXC/LXD, Python, Go, C, Postgresql, Mongo, Debian packaging, distributed systems

What we offer you

We consider geographical location, experience, and performance in shaping compensation worldwide. We revisit compensation annually (and more often for graduates and associates) to ensure we recognise outstanding performance. In addition to base pay, we offer a performance-driven annual bonus. We provide all team members with additional benefits, which reflect our values and ideals. We balance our programs to meet local needs and ensure fairness globally.

  • Distributed work environment with twice-yearly team sprints in person - we’ve been working remotely since 2004!
  • Personal learning and development budget of USD 2,000 per year
  • Annual compensation review
  • Recognition rewards
  • Annual holiday leave
  • Maternity and paternity leave
  • Employee Assistance Programme
  • Opportunity to travel to new locations to meet colleagues from your team and others
  • Priority Pass for travel and travel upgrades for long haul company events

Key skills/competency

  • Advanced Linux Troubleshooting
  • OpenStack & Kubernetes Expertise
  • Ceph Storage Management
  • Virtualization Technologies (KVM, LXD)
  • Containerization (Docker, LXC)
  • Networking Protocols (OVS, OVN)
  • Debugging Tools (gdb, pdb, tcpdump)
  • Scripting/Programming (Python, Go, C/C++)
  • Git Source Control
  • Distributed Systems Acumen

Tags:

Sustaining Operations Engineer
troubleshooting
debugging
incident resolution
technical support
problem solving
system maintenance
infrastructure operations
root cause analysis
software patching
customer liaison
Linux
OpenStack
Kubernetes
Ceph
KVM
Docker
Python
Go
C
Git

Share Job:

How to Get Hired at Canonical

  • Research Canonical's culture: Study their mission, values, remote-first approach, and open-source contributions to align your application.
  • Tailor your resume: Highlight extensive experience with Linux, OpenStack, Kubernetes, and deep debugging skills relevant to a Sustaining Operations Engineer.
  • Showcase open source involvement: Emphasize any contributions to open-source projects, community participation, or relevant personal projects.
  • Prepare for technical depth: Expect in-depth questions on Linux internals, virtualization, containerization, networking, and scripting/debugging.
  • Demonstrate problem-solving: Practice articulating complex troubleshooting scenarios, root cause analysis, and effective resolution strategies clearly.

Frequently Asked Questions

Find answers to common questions about this job opportunity

Explore similar opportunities that match your background