Job Overview
Job TitleAI Observability Engineer
Job TypeContractor
Offered Salary$150,000
LocationHybrid
Who's the hiring manager?
Sign up to PitchMeAI to discover the hiring manager's details for this job. We will also write them an intro email for you.

Job Description
Observability Engineer
We are seeking an experienced Observability Engineer with 7+ years of experience to design, implement, and govern observability solutions across modern distributed systems. The ideal candidate will have strong hands-on experience with OpenTelemetry, be capable of executing and validating observability-related test cases, and define standards, best practices, and reusable blueprints. Familiarity with Arize AI / Arize AX is a strong plus, especially in environments involving ML or AI-powered systems.
Key Responsibilities
Observability Design & Implementation
- Design, implement, and maintain observability solutions using OpenTelemetry for metrics, logs, and traces.
- Support use case needs and custom demands.
- Instrument applications and services across microservices, cloud-native, and hybrid environments.
- Ensure consistent telemetry data collection aligned with architectural and organizational standards.
Standards, Blueprints & Governance
- Define and maintain observability standards, conventions, and naming strategies.
- Create reusable blueprints, reference architectures, and dashboards for application teams.
- Collaborate with platform, SRE, and engineering teams to enforce observability best practices.
Qualifications
- Good understanding of LLM and AI Applications.
- Proficiency in at least one programming language (e.g., Python, or JavaScript).
- Experience with Arize AI / Arize AX for ML observability.
- Hands-on experience with OpenTelemetry (OTel) SDKs, collectors, and pipelines.
- Experience with observability backends (e.g., mainly Arize or Prometheus, Grafana, Azure Monitor, Datadog, New Relic, Elastic, etc.).
Key skills/competency
- Observability Engineer
- OpenTelemetry
- Arize AI / Arize AX
- ML Observability
- AI Applications
- Distributed Systems
- Telemetry Data
- Metrics, Logs, Traces
- Python/JavaScript
- SRE
How to Get Hired at Bytespoke
- Tailor your resume: Highlight your 7+ years of experience in observability, OpenTelemetry, and AI/ML systems. Emphasize Arize AI/AX if applicable.
- Showcase technical skills: Detail your experience with metrics, logs, traces, and specific observability backends like Prometheus, Grafana, Datadog, or Elastic.
- Demonstrate governance experience: Provide examples of how you've defined standards, best practices, and reusable blueprints for observability.
- Prepare for technical interviews: Be ready to discuss your approach to designing and implementing observability solutions for distributed systems, including AI/ML applications.
- Understand company needs: Research Bytespoke's focus on modern distributed systems and AI, and how your skills align with their goals.
Frequently Asked Questions
Find answers to common questions about this job opportunity
01What specific OpenTelemetry components are most critical for this AI Observability Engineer role at Bytespoke?
02How important is Arize AI / Arize AX experience for the Observability Engineer job at Bytespoke?
03What programming languages are preferred for the Observability Engineer position at Bytespoke?
04What kind of distributed systems will an AI Observability Engineer at Bytespoke work with?
05What does 'governance' entail for an Observability Engineer at Bytespoke?
Explore similar opportunities that match your background