DevOps Engineer
Globaltize
Job Overview
Who's the hiring manager?
Sign up to PitchMeAI to discover the hiring manager's details for this job. We will also write them an intro email for you.

Job Description
Introduction
Work Schedule: Full-Time daytime role and 24/7 on-call/on-alert availability for critical incidents, outages, or platform failures.Location: Remote – Latam, South America (Preferred)Compensation: Based on Experience
Overview
Build the future of resilient, global SaaS infrastructure. Globaltize is a high-growth technology organization dedicated to building and operating resilient SaaS infrastructure with a focus on high availability, disaster recovery, and business continuity. Our platform is currently Multi-AZ, and we are on an ambitious journey to evolve into a Multi-region and eventually Multi-cloud architecture to meet strict customer SLAs. We value a proactive monitoring mindset, reliability, and ownership. In this DevOps Engineer role, you will be a key player in ensuring our platform remains fault-tolerant and scalable while working with a modern cloud-native stack.
Key Responsibilities
- Platform Monitoring: Proactively monitor platform and client deployments to identify, troubleshoot, and resolve issues across production and non-production environments.
- Infrastructure Maintenance: Maintain and upgrade core components, including EKS clusters, networking, compute, and storage.
- Vulnerability Management: Oversee infrastructure vulnerability management and patching.
- Systems Design: Design, implement, and maintain end-to-end monitoring and alerting systems for infrastructure, applications, and security events.
- Architecture Evolution: Evolve the platform towards high resilience and fault tolerance, moving from Multi-AZ to Multi-region and Multi-cloud.
- Terraform Refactoring: Refactor and enhance Terraform modules for VPN, networking, IAM, EKS, and CI/CD to improve reliability and scalability.
- Deployment Optimization: Support and optimize client deployment processes, including Helm charts and configuration management.
- Cross-Functional Collaboration: Partner with engineering, security, and compliance teams to ensure secure and compliant infrastructure.
- Incident Management: Participate in incident response, post-mortems, DR testing, and failover exercises.
Must-Have Requirements
- SaaS Infrastructure Expertise: Proven experience building and operating resilient SaaS infrastructure including high availability and disaster recovery.
- Cloud & Automation: Production experience with cloud-native architectures, automation, and GitOps.
- Terraform & Kubernetes: Strong expertise in Terraform and Kubernetes cluster management, including scaling, observability, and troubleshooting.
- AWS Networking: Deep knowledge of AWS (VPC design, networking, routing, load balancers) and services like EC2, EKS, ECS/Fargate, Lambda, and S3.
- Compliance Knowledge: Experience working with ISO 27001 or SOC 2 compliance requirements.
- Tooling & Databases: Proficiency with Helm chart development, Docker, PostgreSQL, MongoDB, and DynamoDB.
- Monitoring Skills: Experience with CloudWatch, WAF, and building alerting systems.
- Professional Competencies: Strong problem-solving skills, bilingual English communication, and the ability to handle incident response under pressure.
- Availability: Accountability for 24/7 on-call availability for critical incidents.
Nice-to-Have Requirements
- Advanced Architectures: Specific experience refactoring Terraform modules for complex CI/CD and networking environments.
- Resilience Planning: Background in conducting high-stakes DR testing and failover exercises.
- Future Tech: Experience or strong interest in evolving platforms from Multi-AZ to Multi-cloud designs.
Why Join Us
This is an opportunity to take true ownership of a platform's reliability and play a foundational role in its global evolution. You will work in a continuous improvement environment where your analytical thinking and attention to detail directly impact our path toward a Multi-cloud future. We offer a culture of accountability and collaboration, giving you the chance to work across cross-functional teams to solve complex infrastructure challenges. If you are ready to apply your expertise in cloud-native automation to build something truly resilient, we invite you to take the next step in your career with us.
Key skills/competency
- SaaS Infrastructure
- Cloud-Native Architectures
- Terraform
- Kubernetes
- AWS Networking
- GitOps
- Monitoring Systems
- Incident Response
- ISO 27001 / SOC 2 Compliance
- PostgreSQL / MongoDB / DynamoDB
How to Get Hired at Globaltize
- Research Globaltize's culture: Study their mission, values, recent news, and employee testimonials on LinkedIn and Glassdoor.
- Customize your resume: Highlight proven SaaS infrastructure, Terraform, Kubernetes, and AWS expertise.
- Showcase cloud-native skills: Emphasize experience with GitOps, automation, and multi-cloud evolution.
- Prepare for technical deep-dives: Expect questions on AWS networking, EKS, monitoring, and incident response.
- Demonstrate proactive ownership: Be ready to discuss 24/7 on-call availability and problem-solving under pressure.
Frequently Asked Questions
Find answers to common questions about this job opportunity
Explore similar opportunities that match your background