11 days ago

AI Observability Engineer

Bytespoke

Hybrid
Contractor
$150,000
Hybrid
Apply

Job Overview

Job TitleAI Observability Engineer
Job TypeContractor
Offered Salary$150,000
LocationHybrid

Who's the hiring manager?

Sign up to PitchMeAI to discover the hiring manager's details for this job. We will also write them an intro email for you.

Uncover Hiring Manager

Job Description

Observability Engineer

We are seeking an experienced Observability Engineer with 7+ years of experience to design, implement, and govern observability solutions across modern distributed systems. The ideal candidate will have strong hands-on experience with OpenTelemetry, be capable of executing and validating observability-related test cases, and define standards, best practices, and reusable blueprints. Familiarity with Arize AI / Arize AX is a strong plus, especially in environments involving ML or AI-powered systems.

Key Responsibilities

Observability Design & Implementation
  • Design, implement, and maintain observability solutions using OpenTelemetry for metrics, logs, and traces.
  • Support use case needs and custom demands.
  • Instrument applications and services across microservices, cloud-native, and hybrid environments.
  • Ensure consistent telemetry data collection aligned with architectural and organizational standards.
Standards, Blueprints & Governance
  • Define and maintain observability standards, conventions, and naming strategies.
  • Create reusable blueprints, reference architectures, and dashboards for application teams.
  • Collaborate with platform, SRE, and engineering teams to enforce observability best practices.

Qualifications

  • Good understanding of LLM and AI Applications.
  • Proficiency in at least one programming language (e.g., Python, or JavaScript).
  • Experience with Arize AI / Arize AX for ML observability.
  • Hands-on experience with OpenTelemetry (OTel) SDKs, collectors, and pipelines.
  • Experience with observability backends (e.g., mainly Arize or Prometheus, Grafana, Azure Monitor, Datadog, New Relic, Elastic, etc.).

Key skills/competency

  • Observability Engineer
  • OpenTelemetry
  • Arize AI / Arize AX
  • ML Observability
  • AI Applications
  • Distributed Systems
  • Telemetry Data
  • Metrics, Logs, Traces
  • Python/JavaScript
  • SRE

Tags:

Observability Engineer
AI Observability
ML Observability
OpenTelemetry
Arize AI
Arize AX
Distributed Systems
Cloud-Native
Microservices
SRE
Python
JavaScript
Telemetry
Metrics
Logs
Traces
Observability
Platform Engineering

Share Job:

How to Get Hired at Bytespoke

  • Tailor your resume: Highlight your 7+ years of experience in observability, OpenTelemetry, and AI/ML systems. Emphasize Arize AI/AX if applicable.
  • Showcase technical skills: Detail your experience with metrics, logs, traces, and specific observability backends like Prometheus, Grafana, Datadog, or Elastic.
  • Demonstrate governance experience: Provide examples of how you've defined standards, best practices, and reusable blueprints for observability.
  • Prepare for technical interviews: Be ready to discuss your approach to designing and implementing observability solutions for distributed systems, including AI/ML applications.
  • Understand company needs: Research Bytespoke's focus on modern distributed systems and AI, and how your skills align with their goals.

Frequently Asked Questions

Find answers to common questions about this job opportunity

Explore similar opportunities that match your background