13 days ago

AI Factory Controls and Monitoring Engineer

NVIDIA

On Site
Full Time
$184,000
Santa Clara, CA
Apply

Job Overview

Job TitleAI Factory Controls and Monitoring Engineer
Job TypeFull Time
Offered Salary$184,000
LocationSanta Clara, CA
Map of Santa Clara, CA

Who's the hiring manager?

Sign up to PitchMeAI to discover the hiring manager's details for this job. We will also write them an intro email for you.

Uncover Hiring Manager

Job Description

About NVIDIA

NVIDIA has been redefining computer graphics, PC gaming, and accelerated computing for 30 years. It’s an outstanding legacy of innovation that’s motivated by extraordinary technology—and outstanding people. Today, we’re tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what’s never been done before takes vision, innovation, and the world’s best talent. As an NVIDIAN, you’ll be immersed in a diverse, encouraging environment where everyone is inspired to do their best work. Come join our team and see how you can make a lasting impact on the world.

About the Role

NVIDIA's AI Factories (e.g. data centers) host ground-breaking products across high-performance computing to machine learning applications for autonomous vehicles and healthcare. At the heart of our AI Factory is the ability to engineer mechanical and electrical designs in close coupling to NVIDIA's industry-leading GPU and DSX codesigns. We are seeking an AI Factory Controls and Monitoring Engineer to support control system deployments.

What You’ll Be Doing

  • Collaborate with product owners and technical leads to identify and collect requirements for our next-generation data centers.
  • Support the global design standards for the data center controls and monitoring (DCCM) system, collaborating with internal teams to develop an execution strategy and life cycle management.
  • Responsible for adapting control system reference designs and standards to AI Factory deployments. Key collaborator responsible for control system technical evaluation from site selection due diligence through site turnover to operations including: contractor selection, bid package development, MEP or equivalent experience and control system composition review, RFI response, submittal/as built reviews, and commissioning support.
  • Provide technical support to DC operations controls engineers.
  • Support IT to OT data integration enabling digital twins, agentic AI onboarding, coordinated leak detection and other applications.
  • Support standardization in controls engineering quality approval, process control, product evaluation, vendor proposals, evaluate product reliability, automated testing and software.
  • Collaborate with cross functional teams to make modifications to control settings and alarm thresholds to manage the data center space.

What We Need To See

  • Excellent interpersonal and leadership skills critical for success, depending on building rapport and credibility with multiple stakeholders across the organization.
  • BS in Engineering, CS or equivalent experience.
  • 8+ years of experience with control system design, development and management on industrial or mission critical systems.
  • Working knowledge of mechanical, electrical, life safety, and IT Networking systems associated with critical environments.
  • Understanding of OPC-UA, and Modbus (TCP & RTU) protocols and how to integrate using these protocols.
  • Troubleshooting, problem-solving skills and experience driving root cause analysis to complex projects under pressure.
  • Experience with equipment commissioning, testing, or related activities.
  • Experience with startup and configuration of Programmable Logic Controllers (PLCs) and SCADA workstations.
  • Strong understanding of Sequence of Operations (SOO) for mechanical system control. Ability to create and iterate on SOOs.

Ways To Stand Out From The Crowd

  • Experience with MQTT communication protocol, higher level data strategies, and integration to IT systems.
  • Strong understanding of data center commissioning including Level 1 through Integrated Systems Testing.
  • Strong understanding of document control and change control processes.
  • Working knowledge and experience with Data Center Infrastructure Management (DCIM), EPMS systems, Ignition SCADA software development and deployment, and programming languages: Python, PHP, SQL.
  • Working knowledge of data center power and cooling solutions, including advanced systems such as liquid cooling.

Key skills/competency

  • AI Factory Deployment Engineer
  • Controls Engineering
  • Data Center Operations
  • System Integration
  • SCADA Systems
  • PLC Programming
  • IT/OT Integration
  • Network Protocols
  • Troubleshooting
  • Commissioning

Tags:

AI Factory Deployment Engineer
Controls Engineering
Data Center
NVIDIA
SCADA
PLC
IT/OT Integration
System Design
Commissioning
High-Performance Computing

Share Job:

How to Get Hired at NVIDIA

  • Tailor your resume: Highlight experience in control systems, industrial environments, and specific protocols like OPC-UA and Modbus.
  • Showcase leadership: Emphasize your interpersonal and leadership skills, detailing how you've built rapport with stakeholders.
  • Demonstrate technical expertise: Detail your experience with PLCs, SCADA, commissioning, and IT/OT integration.
  • Prepare for interviews: Be ready to discuss complex problem-solving, root cause analysis, and your understanding of data center infrastructure.
  • Research NVIDIA: Understand their commitment to AI, data centers, and their innovative culture to align your answers.

Frequently Asked Questions

Find answers to common questions about this job opportunity

Explore similar opportunities that match your background