Data Center Engineering Operations Engineer
Amazon Web Services (AWS)
Job Overview
Who's the hiring manager?
Sign up to PitchMeAI to discover the hiring manager's details for this job. We will also write them an intro email for you.

Job Description
About AWS Infrastructure Services
AWS Infrastructure Services owns the design, planning, delivery, and operation of all AWS global infrastructure. We are the people who keep the cloud running, supporting all AWS data centers and the servers, storage, networking, power, and cooling equipment essential for continuous customer access to innovation. We tackle the most challenging problems with thousands of variables impacting the supply chain and seek talented individuals to help.
You will join a diverse team of software, hardware, and network engineers, supply chain specialists, security experts, and operations managers. Collaboration across AWS is key to delivering the highest standards for safety and security, providing seemingly infinite capacity at the lowest possible cost for our customers. You will experience an inclusive culture that welcomes bold ideas and empowers you to own them to completion.
The Role: Data Center Engineering Operations Engineer
We are seeking a Data Center Engineering Operations Engineer to serve as a technical resource supporting Amazon within our mission-critical Data Center. This position helps ensure overall availability and reliability in all electrical and mechanical infrastructure within the data center environment in a specific location. The equipment supports mission-critical servers and must maintain better than 99.999% uptime. This self-managed role requires initiative and provides effective solutions for complex technical problems. Successful candidates will have a direct and immediate impact on improving the resiliency, efficiency, and capacities of our facilities, helping Amazon's customers by maintaining, operating, and troubleshooting Mission Critical Facilities. This includes stand-by diesel generators and related fuel systems, three-phase electrical systems (switch gear, UPSs, PDUs, wet cell batteries), CRAC, centrifugal chillers, cooling towers/water chemical systems, air handlers, pumps, and motors.
Core Tasks and Responsibilities
- Operate and maintain all mechanical, electrical, and HVAC equipment within the Data Center/Facility.
- Supervise contractors performing servicing or preventive maintenance.
- Develop work plans for emergency repair of critical assets.
- Operate under minimal supervision and work on-call/rotating schedules as needed.
- Perform basic support concepts such as ticketing systems, root cause analysis, and task prioritization.
- Perform shift duty as required and fully comply with all physical security procedures and policies.
- Ensure all safety procedures are adhered to while performing work.
- Perform rack power installs, rack PDU, and rack ATS replacements.
- Verify electrical/cooling capacity before new rack installation.
- Create and close Work Orders with appropriate data, including labor hours, equipment maintenance, and parts used, into the asset management system.
- Maintain changes in state in Mission Critical infrastructure in support of corrective/preventive maintenance.
- Test quality, performance, safety, and reliability of products, equipment, and processes.
- Serve as a front-line responder for hands-on electrical and mechanical equipment troubleshooting, including AHUs, chillers, cooling towers, chemical treatment systems, pumps, motors, VFDs, and building automation systems.
Required Competencies & Behaviors
- Ability to solve problems through root-cause elimination, understanding the broader context.
- Ensure all safety procedures are adhered to while performing work.
- Aptitude for troubleshooting and problem-solving complex issues.
- Ability to follow procedures, system documentation, and track issues through appropriate entries into a Trouble Ticket system.
- Demonstrate good judgment and instincts in decision-making.
- Prioritize appropriately in a complex, fast-paced environment.
- Willingness to take ownership for technical issues.
- Utilize EPMS and BMS to manage building workflows.
Physical Requirements
- Work on a 24x7 schedule (where applicable).
- Work at heights and from ladders.
- Perform physical tasks during the shift and coordinate body movements when using tools or equipment.
- Work in a noisy environment with ear protection.
Key Job Responsibilities
- Setting and maintaining the highest standards for safety and actively promoting a world-class safety culture in all operational procedures.
- Driving continuous improvement efforts on infrastructure through standardization of procedures and policies, while delivering performance against agreed metrics.
- Enabling the operations organization to deliver 100% uptime on all customer-supporting infrastructure.
- Collaborating effectively with internal & external stakeholders to deliver operational excellence for all AWS customers.
- Responsible for ensuring that the preventive maintenance of site-critical facility infrastructure is planned and executed to the highest standards, in accordance with AWS procedures.
- In charge of facility monitoring & supervision (via BMS, PMS, Walkthrough, etc.).
- Participation in the successful delivery of build-out and retrofit of Data Center infrastructure.
- Ensuring organizational capability to react & respond appropriately to any potential customer-impacting event on any component of electrical or mechanical infrastructure.
- Reviewing incident reports, documenting periodic trend summaries, and providing updates and recommended actions to management.
- Draft, update & maintain method statements, standard operating procedures, emergency response procedures, preventive maintenance programs, and all technical documentation pertaining to Data Center Engineering Operations.
Basic Qualifications
- Bachelor Degree in Electrical Engineering, Mechanical Engineering, HVAC, Industrial technology, or Electrotechnical Engineering.
- 7+ years of experience in the field of electrical/mechanical and refrigeration system preventive and corrective maintenance (data center, production site, energy facilities, etc.).
- Experience with critical infrastructure systems (UPS, generators, switchgear, HVAC, chilled water/cooling systems, pump).
- Familiarity with 24/7/365 mission critical environment.
- Good level of English (written and oral).
Preferred Qualifications
- Familiar with BMS and EPMS control systems and data collection/trending.
- Apprentice/trades certified or diploma in an Electrical field.
- Data Center/Manufacturing experience.
- Bachelors/Masters degree in Electrical Engineering, Mechanical Engineering or relevant discipline.
Key Skills/Competency
- Data Center Operations
- Electrical Systems
- Mechanical Systems
- HVAC Maintenance
- Preventive Maintenance
- Troubleshooting
- Root Cause Analysis
- Critical Infrastructure
- Building Management Systems (BMS)
- Emergency Response
How to Get Hired at Amazon Web Services (AWS)
- Research AWS's culture: Study their mission, values, recent news, and employee testimonials on LinkedIn and Glassdoor.
- Tailor your resume: Customize your resume to highlight experience with critical infrastructure, electrical/mechanical systems, and 24/7 operations, using keywords from the Data Center Engineering Operations Engineer job description.
- Showcase problem-solving: Prepare examples demonstrating root-cause analysis, troubleshooting complex issues, and proactive solutions in data center environments.
- Master the STAR method: Practice answering behavioral questions using the STAR method, focusing on safety, teamwork, ownership, and customer obsession.
- Understand AWS principles: Familiarize yourself with Amazon's Leadership Principles and be ready to share experiences that align with each principle during your Data Center Engineering Operations Engineer interviews.
Frequently Asked Questions
Find answers to common questions about this job opportunity
Explore similar opportunities that match your background