
Director, Data Center Facility Operations - Saline Township, MI
Oracle · United States
- Hybrid
- Full-time
- $215,000 / year
- United States
Job highlights
- Lead 24/7 data center operations and uptime.
- Govern MCO operating model and incident response.
- Define enterprise strategy for monitoring and reliability.
- Oversee capacity, resiliency, and site readiness.
- Drive automation and operational tooling adoption.
About the role
Director, Data Center Facility Operations
Leads enterprise-wide performance monitoring and real-time operational governance, ensuring standardized processes for shift operations, event management, escalation, incident command, and communications. Oversees capacity and readiness for critical infrastructure (power, cooling, controls, life safety, and physical security), ensuring sites are resilient, compliant, and audit-ready.
Partners with executive leadership on multi-year operational, reliability, and financial targets; drives adoption of automation, telemetry, and predictive maintenance to reduce risk and improve mean time to restore (MTTR). Establishes crisis management standards, continuous improvement mechanisms, and a culture of operational excellence, knowledge sharing, and accountability.
Leads major expansion and transformation initiatives impacting operational readiness, serves as senior liaison across regions, and oversees the full lifecycle of critical infrastructure and hardware assets—including install, maintenance strategy, spares, vendor performance, and investment governance—to optimize reliability, security, and scalability.
Responsibilities
Key Responsibilities
24/7 Mission Critical Operations Leadership
- Owns 100% uptime operations for a portfolio of very large/complex data center sites, ensuring consistent execution of shift coverage, operational handoffs, and standardized runbooks.
- Establishes and governs the Mission Critical Operations (MCO) operating model: command structure, on-call rotations, escalation paths, and service-impacting event response.
- Ensures operational readiness for high-severity incidents through drills/tabletops, incident commander training, and continuous improvement of response playbooks.
Performance Monitoring, Controls, and Reliability
- Defines the enterprise strategy for real-time monitoring and operational health across the portfolio (BMS/EPMS/SCADA/telemetry), aligning KPIs to uptime, reliability, safety, and customer outcomes.
- Drives operating rhythms for reviewing: availability, MTTR/MTBF, alarm quality, repeat events, maintenance effectiveness, and risk posture.
- Establishes standards for preventive and predictive maintenance, MOP/SOP/EOP quality, change control, and operational compliance.
Incident, Problem, and Crisis Management
- Governs standards for event triage, incident command, escalation, stakeholder communications, and customer-impacting notifications.
- Leads post-incident reviews for P1/P0 events, ensuring root cause analysis (RCA) quality, corrective/preventive actions (CAPA), and verified closure.
- Operates as executive escalation point for highly complex incidents and cross-regional reliability risks.
Capacity, Resiliency, and Site Readiness
- Oversees evaluation of power, cooling, physical space, network/support infrastructure, and security capacity, ensuring readiness for load growth and peak conditions.
- Ensures resiliency standards are met (redundancy, maintenance windows, failover testing, generator/UPS readiness, fuel strategy as applicable).
- Directs operational risk assessments and ensures sites remain audit-ready and compliant with applicable standards and internal controls.
Automation and Operational Tooling
- Drives adoption of automation for alarm correlation, workflow orchestration, remote operations, and predictive analytics to reduce human error and improve response times.
- Standardizes data quality and instrumentation required for high-confidence operational decision-making.
Expansion, Launch, and Transformation (Operational Readiness Focus)
- Leads operational support for expansions/new builds/site launches, ensuring Day-0/Day-1 readiness, staffing, training, spares, procedures, and turnover acceptance criteria.
- Partners with engineering and construction to embed operability, maintainability, and safety into design and commissioning.
Asset Lifecycle, Vendors, and Investment Governance
- Oversees lifecycle strategy for critical infrastructure and supporting hardware assets: installation, maintenance, spares, logistics, inventory, and decommissioning.
- Establishes enterprise standards for vendor performance, SLAs, service quality, and compliance; drives corrective actions where performance gaps exist.
- Approves and manages multi-million dollar investments in upgrades, capacity expansion, reliability improvements, and risk remediation.
Core Leadership Responsibilities (unchanged But Aligned To 24/7 Ops)
Planning & Execution
Provides strategic oversight for mission-critical operational initiatives, ensuring priorities reflect reliability risk, customer impact, and compliance needs.
Collaboration & Partnership
Sets direction and builds strong partnerships with engineering, construction, security, network/IT, program management, and business stakeholders to ensure reliable 24/7 delivery.
Problem Solving
Serves as escalation for complex operational/technical issues; drives disciplined, data-driven resolution and prevention of recurrence.
Continuous Learning / Improvement
Champions operational excellence through training programs, certifications, drills, and a sustained improvement roadmap aligned to availability and risk reduction.
Performance and Development
Builds and develops a high-performing 24/7 operations organization, including shift leaders, incident commanders, and regional operations management. This role supports a 24/7/365 environment and will require participation and managing incident and team management across all shifts. Safety emphasis: explicit accountability for life safety and safe work practices (LOTO, energized work policies as applicable).
Qualifications
Disclaimer: Certain US customer or client-facing roles may be required to comply with applicable requirements, such as immunization and occupational health mandates.
Range and benefit information provided in this posting are specific to the stated locations only
US: Hiring Range in USD from: $139,400 - $291,800 per year. May be eligible for bonus, equity, and compensation deferral.
Oracle maintains broad salary ranges for its roles in order to account for variations in knowledge, skills, experience, market conditions and locations, as well as reflect Oracle's differing products, industries and lines of business. Candidates are typically placed into the range based on the preceding factors as well as internal peer equity.
Oracle US offers a comprehensive benefits package which includes the following: Medical, dental, and vision insurance, including expert medical opinion Short term disability and long term disability Life insurance and AD&D Supplemental life insurance (Employee/Spouse/Child) Health care and dependent care Flexible Spending Accounts Pre-tax commuter and parking benefits 401(k) Savings and Investment Plan with company match Paid time off: Flexible Vacation is provided to all eligible employees assigned to a salaried (non-overtime eligible) position. Accrued Vacation is provided to all other employees eligible for vacation benefits. For employees working at least 35 hours per week, the vacation accrual rate is 13 days annually for the first three years of employment and 18 days annually for subsequent years of employment. Vacation accrual is prorated for employees working between 20 and 34 hours per week. Employees working fewer than 20 hours per week are not eligible for vacation. 11 paid holidays Paid sick leave: 72 hours of paid sick leave upon date of hire. Refreshes each calendar year. Unused balance will carry over each year up to a maximum cap of 112 hours. Paid parental leave Adoption assistance Employee Stock Purchase Plan Financial planning and group legal Voluntary benefits including auto, homeowner and pet insurance
The role will generally accept applications for at least three calendar days from the posting date or as long as the job remains posted.
Career Level - M4
About Us
Only Oracle brings together the data, infrastructure, applications, and expertise to power everything from industry innovations to life-saving care. And with AI embedded across our products and services, we help customers turn that promise into a better future for all. Discover your potential at a company leading the way in AI and cloud solutions that impact billions of lives.
True innovation starts when everyone is empowered to contribute. That’s why we’re committed to growing a workforce that promotes opportunities for all with competitive benefits that support our people with flexible medical, life insurance, and retirement options. We also encourage employees to give back to their communities through our volunteer programs.
We’re committed to including people with disabilities at all stages of the employment process. If you require accessibility assistance or accommodation for a disability at any point, let us know by emailing accommodation-request_mb@oracle.com or by calling 1-888-404-2494 in the United States.
Oracle is an Equal Employment Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability and protected veterans’ status, or any other characteristic protected by law. Oracle will consider for employment qualified applicants with arrest and conviction records pursuant to applicable law.
Key skills/competency
- Data Center Operations Leadership
- Performance Monitoring
- Reliability Engineering
- Incident Management
- Crisis Management
- Capacity Planning
- Automation Strategy
- Asset Lifecycle Management
- Vendor Management
- Operational Excellence
Skills & topics
- Director of Data Center Operations
- Data Center Management
- Facility Operations
- Critical Infrastructure
- Reliability Engineering
- Performance Monitoring
- Incident Management
- Crisis Management
- Capacity Planning
- Automation
How to get hired
- Tailor your resume: Highlight experience in 24/7 operations, critical infrastructure management, and reliability engineering relevant to Oracle's needs.
- Showcase leadership skills: Emphasize your ability to lead teams, manage complex incidents, and drive operational excellence in a data center environment.
- Quantify achievements: Use data to demonstrate your impact on uptime, MTTR reduction, cost savings, and process improvements.
- Research Oracle: Understand their commitment to AI, cloud solutions, and operational excellence to align your application with their mission.
- Prepare for interviews: Be ready to discuss your experience with incident command, crisis management, and strategic planning for data center facilities.
Technical preparation
Behavioral questions
Frequently asked questions
- What are the key responsibilities for a Director of Data Center Facility Operations at Oracle?
- The Director of Data Center Facility Operations at Oracle is responsible for leading 24/7 mission-critical operations, ensuring 100% uptime, managing incident and crisis response, overseeing capacity and resiliency, driving automation, and managing the lifecycle of critical infrastructure assets. This role also involves strategic planning, collaboration with various departments, and developing a high-performing operations team.
- What is the salary range for this Director, Data Center Facility Operations role at Oracle in the US?
- The hiring range for the Director, Data Center Facility Operations role in the US is $139,400 - $291,800 per year, with potential eligibility for bonus, equity, and compensation deferral. Actual compensation will depend on factors like experience, skills, and location.
- What benefits does Oracle offer for this position?
- Oracle offers a comprehensive benefits package including medical, dental, and vision insurance, disability coverage, life insurance, flexible spending accounts, a 401(k) plan with company match, paid time off (vacation, holidays, sick leave), paid parental leave, adoption assistance, and an Employee Stock Purchase Plan.
- What kind of technical skills are essential for this role?
- Essential technical skills include expertise in data center infrastructure (power, cooling, controls), performance monitoring tools (BMS/EPMS/SCADA/telemetry), incident management systems, automation, predictive maintenance, and asset lifecycle management. A strong understanding of reliability engineering principles and change control processes is also crucial.
- How does Oracle approach diversity and inclusion for this role?
- Oracle is committed to diversity and inclusion, encouraging all qualified applicants regardless of race, color, religion, sex, national origin, sexual orientation, gender identity, disability, or protected veteran status. They also consider qualified applicants with arrest and conviction records and provide accessibility assistance for individuals with disabilities.
- What does 'MCO' stand for in the context of Oracle's data center operations?
- MCO stands for Mission Critical Operations. In this role, it refers to the operating model established and governed by the Director, Data Center Facility Operations, encompassing the command structure, on-call rotations, escalation paths, and response protocols for critical events.