Cloud Operations Administrator
DigitalOcean
Job Overview
Who's the hiring manager?
Sign up to PitchMeAI to discover the hiring manager's details for this job. We will also write them an intro email for you.

Job Description
About DigitalOcean
Dive in and do the best work of your career at DigitalOcean. Journey alongside a strong community of top talent who are relentless in their drive to build the simplest scalable cloud. If you have a growth mindset, naturally like to think big and bold, and are energized by the fast-paced environment of a true industry disruptor, you’ll find your place here. We value winning together—while learning, having fun, and making a profound difference for the dreamers and builders in the world.
We want people who are passionate about troubleshooting complex problems with systems, networking and storage at scale.
We are looking for a seasoned system administrator to help us keep the cloud running smoothly. Reporting to the manager of Cloud Operations, the GPU Operations Engineer monitors and provides first-response to all cloud health issues that impact, or could potentially impact, customer experience - internal or external. You will interface with teams across the organization to research and troubleshoot issues from single droplets to cloud-wide disturbances. Our workweek spans five days, and that may involve working on weekends.
What You'll Be Doing
- Ensuring maximum uptime for our global infrastructure
- Automating processes and building tools to improve operational efficiency
- Coordinating operational work across teams to improve the platform with minimal impact
What You'll Add To DigitalOcean
- Solid experience with Linux operating systems or Networking and day to day upkeep
- Familiarity with virtualization technologies and troubleshooting virtual machine instances
- Familiarity with containerization technologies and troubleshooting containers
- Familiarity with IPv4 Networking and troubleshooting (CCNA equivalent)
- Basic storage concepts and technologies
- Experience with monitoring systems and incident management
- Experience scripting in one or more of the following languages: Bash, Python, or Go
- Experience with GPU hardware or AI/ML, and Kubernetes
- A passion for good documentation and open communication
- Proven ability to learn!
Key skills/competency
- Cloud Operations
- System Administration
- Linux
- Networking
- Virtualization
- Containerization
- Incident Management
- Scripting (Bash, Python, Go)
- Kubernetes
- Troubleshooting
How to Get Hired at DigitalOcean
- Research DigitalOcean's culture: Study their mission, values, recent news, and employee testimonials on LinkedIn and Glassdoor.
- Tailor your resume: Highlight your cloud operations, Linux, networking, and scripting expertise, aligning with the Cloud Operations Administrator role.
- Showcase problem-solving: Prepare examples of how you've troubleshot complex system and network issues at scale, demonstrating your technical depth.
- Emphasize automation skills: Be ready to discuss your experience automating tasks and improving operational efficiency with Bash, Python, or Go.
- Demonstrate continuous learning: Express your passion for learning new technologies, especially around virtualization, containerization, AI/ML, and Kubernetes.
Frequently Asked Questions
Find answers to common questions about this job opportunity
Explore similar opportunities that match your background