Platform Engineer II (Observability)
@ Iterable

Hybrid
Hybrid
Posted 9 days ago

Your Application Journey

Personalized Resume
Apply
Email Hiring Manager
Interview

Email Hiring Manager

XXXXXXXXX XXXXXXXXX XXXXXXXXX******@iterable.com
Recommended after applying

Job Details

Company Overview

Iterable is the leading AI-powered customer engagement platform that helps leading brands like Redfin, SeatGeek, Priceline, Calm, and Box create dynamic, individualized experiences at scale. Our platform empowers organizations to activate customer data, design seamless cross-channel interactions, and optimize engagement—all with enterprise-grade security and compliance.

Today, nearly 1,200 brands across 50+ countries rely on Iterable to drive growth, deepen customer relationships, and deliver joyful customer experiences.

Our success is powered by extraordinary people who bring our core values—Trust, Growth Mindset, Balance, and Humility—to life. We foster a culture of innovation, collaboration, and inclusion, where ideas are valued and individuals are empowered to do their best work. That’s why we’ve been recognized as one of Inc’s Best Workplaces and Fastest Growing Companies, and were recognized on Forbes’ list of America’s Best Startup Employers in 2022.

Iterable has also been listed on Wealthfront’s Career Launching Companies List and has held a top 10 ranking on the Top 25 Companies Where Women Want to Work. With a global presence—including offices in San Francisco, New York, Denver, London, and Lisbon, plus remote employees worldwide—we are committed to building a diverse and inclusive workplace. We welcome candidates from all backgrounds and encourage you to apply.

Learn more about our story and mission on our Culture and About Us pages. Let’s shape the future of customer engagement together!

How You Will Make an Impact

At Iterable, the Observability team enables engineering teams to measure, diagnose, and improve system health. We own and evolve Iterable’s monitoring, logging, tracing, and metrics platforms—turning raw telemetry into actionable insight. As a Platform Engineer II – Observability on our tight-knit team, you’ll drive reliability by implementing modern monitoring, automation, and orchestration practices that keep our systems performing at their best.

What You’ll Do

  • Own the full observability stack (Datadog, Prometheus, Grafana, Elasticsearch, Quickwit, OpenTelemetry)—design, deploy, and scale it to support petabyte-scale telemetry.
  • Instrument and automate monitoring, logging, tracing, and metrics to ensure system visibility across 100+ services and multiple Kubernetes clusters.
  • Ship platform features—contribute code that boosts reliability, performance, and developer experience across Iterable.
  • Partner with engineering teams to improve instrumentation, refine dashboards/alerts, and embed observability into their SDLC.
  • Reduce MTTR & cost—design cost-effective telemetry pipelines and create high-signal, low-noise alerting strategies.
  • Participate in our on-call rotation that prioritizes recovery, postmortems, and continuous improvement.

What We’re Looking For

  • 2+ years of professional software or infrastructure, SRE experience.
  • Hands-on work with Kubernetes (and Docker) in production.
  • Deep experience with at least one cloud provider (AWS preferred) and Infrastructure-as-Code (Terraform, Helm, GitOps).
  • Strong programming/scripting skills in Python, Go, or similar.
  • Experience using or supporting observability platforms (Datadog, Prometheus, Elastic, OpenTelemetry, etc.) in a production environment.
  • Familiarity with CI/CD pipelines and modern DevOps practices.
  • A growth mindset, humility, and a desire to elevate those around you.
  • Bachelor’s degree in CS/Engineering — or the equivalent real-world experience.

Bonus Points

  • Built or run OpenTelemetry Collectors at scale.
  • Operated large K8s clusters or written controllers/operators.
  • Experience with GitOps.
  • Designed and executed observability cost optimization initiatives.
  • Experience in distributed tracing and high-cardinality metrics strategies.

Perks & Benefits

  • Paid parental leave.
  • Competitive salaries, meaningful equity, & 401(k) plan.
  • Medical, dental, vision, & life insurance.
  • Balance Days (additional paid holidays).
  • Fertility & Adoption Assistance.
  • Paid Sabbatical.
  • Flexible PTO.
  • Monthly Employee Wellness allowance.
  • Monthly Professional Development allowance.
  • Pre-tax commuter benefits.
  • Complete laptop workstation.

Compensation

The US base salary range for this position at the start of employment is $114,000 - $188,000. Within this range, individual pay is determined by specific US work location, as well as additional factors, including job-related skills, experience, relevant education or training, and internal equity considerations. Please note that the range listed above reflects only base salary. The total compensation package includes variable pay (where applicable), equity, plus a range of benefits, including medical, dental, vision, and financial.

In addition, we offer perks such as generous stipends for health & fitness and learning & development, among others.

How to Get Hired at Iterable

🎯 Tips for Getting Hired

  • Tailor your resume: Customize your resume to reflect relevant experience and skills for the Platform Engineer II role.
  • Network: Connect with current Iterable employees on LinkedIn for insights.
  • Prepare for technical interviews: Brush up on Kubernetes, Python, and observability tools.
  • Demonstrate your values: Share examples showcasing Iterable's values of Trust, Growth Mindset, Balance, and Humility.

📝 Interview Preparation Advice

Technical Preparation

Familiarize yourself with Datadog monitoring tools.
Practice Kubernetes administration and orchestration patterns.
Study Infrastructure-as-Code tools like Terraform.
Review Python and Go programming best practices.

Behavioral Questions

Prepare examples of teamwork in technical projects.
Think of times you resolved system performance issues.
Reflect on how you handle constructive criticism.
Consider scenarios where you took initiative to improve.

Frequently Asked Questions