Site Reliability Engineer
@ Taskify

Hybrid
$160,000
Hybrid
Part Time
Posted 1 day ago

Your Application Journey

Personalized Resume
Apply
Email Hiring Manager
Interview

Email Hiring Manager

XXXXXXXXX XXXXXXXXXXX XXXXXXXX****** @taskify.com
Recommended after applying

Job Details

Site Reliability Engineer

Role: Build and automate systems to keep the platform reliable, scalable, and performant.

Responsibilities:

  • Automate reliability checks, capacity planning, and service-level monitoring.
  • Mentor engineers on observability, alert management, and instrumentation.
  • Lead incident response from triage through post-mortem and remediation.
  • Own and improve load-testing, disaster recovery, and chaos engineering programs.
  • Collaborate with product and platform teams to design for reliability and scalability.

Requirements:

  • Strong background in Site Reliability Engineering.
  • Proficiency with Terraform, Python, and Go.
  • Experience with AWS infrastructure.
  • Ability to solve complex problems in distributed systems.
  • Passion for observability, metrics, and dashboards.

Nice-to-Have:

  • Experience with MySQL, MongoDB, Redis, and Snowflake.
  • Background in high-growth startup environments.

About Taskify: Work remotely with a hyper-growth AI startup shaping the future of recruitment.

Key skills/competency

  • Site Reliability Engineering
  • Terraform
  • Python
  • Go
  • AWS
  • Observability
  • Distributed Systems
  • Load Testing
  • Automation
  • Scalability

How to Get Hired at Taskify

🎯 Tips for Getting Hired

  • Research Taskify culture: Study mission and recent news.
  • Tailor your resume: Highlight SRE skills and projects.
  • Showcase cloud expertise: Emphasize AWS, Terraform, Python, and Go.
  • Prepare for technical interviews: Review distributed systems problems.

📝 Interview Preparation Advice

Technical Preparation

Review Terraform, Python, and Go scripts.
Study AWS infrastructure and cloud setups.
Practice load testing and chaos engineering.
Brush up on distributed system design principles.

Behavioral Questions

Describe a past system outage response.
Explain mentor experiences with team members.
Discuss handling high-pressure incidents calmly.
Share collaboration examples with cross-functional teams.

Frequently Asked Questions