14 days ago

Site Reliability Engineer II- Data Platforms

UNFI

Hybrid
Full Time
$120,000
Hybrid

Job Overview

Job TitleSite Reliability Engineer II- Data Platforms
Job TypeFull Time
CategoryCommerce
Experience5 Years
DegreeMaster
Offered Salary$120,000
LocationHybrid

Who's the hiring manager?

Sign up to PitchMeAI to discover the hiring manager's details for this job. We will also write them an intro email for you.

Uncover Hiring Manager

Job Description

Overview

The Data Platform Reliability Engineer role at UNFI is responsible for ensuring the stability, performance, and operational reliability of UNFI’s cloud-based and legacy data platforms. In this role, you will monitor, troubleshoot, and automate operational workflows for Databricks, AWS, and various enterprise ingestion and BI tools.

Job Responsibilities

Platform Reliability & Monitoring: Monitor Databricks clusters, jobs, and workflows; maintain dashboards, alerts and logs; respond to incidents and perform root cause analysis.

Cost and Performance Management: Optimize platform costs, implement cost-control measures and monitor spend for Databricks and AWS services.

Monitoring and Observability: Build dashboards, configure alerts and tune thresholds for improved signal-to-action ratio.

External Support Team & Vendor Management: Coordinate with external partners and vendors to troubleshoot, maintain SLAs and update runbooks.

Continuous Improvement & BI Platform Operations: Drive automation, optimize resource utilization and support BI tools like Power BI, Tableau, and Alteryx.

Job Requirements

Education/Certifications: Bachelor’s degree in Computer Science, Data Analytics, Systems Analysis, or related field.

Experience: 3+ years in data platform operations or reliability engineering with hands-on experience in Databricks and AWS production environments. Familiarity with tools such as Fivetran, AWS DMS, DataStage, Informatica and BI platforms.

Knowledge/Skills: Strong troubleshooting, incident management, and knowledge of governance, security, and RBAC principles. Ability to work independently and collaborate in a remote setting.

Work Environment

This is a remote position, although occasional visits to an office or UNFI locations may be required. The role demands long periods at a desk and occasional physical movement.

Key skills/competency

  • Databricks
  • AWS
  • Monitoring
  • Incident Management
  • Cost Optimization
  • Dashboarding
  • Automation
  • BI Tools
  • Vendor Coordination
  • Data Platforms

Tags:

Site Reliability Engineer II- Data Platforms
Databricks
AWS
Monitoring
Automation
Data Platforms
BI Tools
Troubleshooting
Vendor Management
Cost Optimization
Cloud
Ingestion Tools
Power BI
Tableau
Alteryx
Fivetran
AWS DMS
Informatica
DataStage

Share Job:

How to Get Hired at UNFI

  • Customize your resume: Highlight relevant data platform experience.
  • Showcase cloud skills: Emphasize AWS and Databricks expertise.
  • Demonstrate problem-solving: Share incident management success stories.
  • Prepare for technical questions: Study cost optimization and automation.

Frequently Asked Questions

Find answers to common questions about this job opportunity

Explore similar opportunities that match your background