Senior Manager, Engineering - Enterprise AI and Automation
NVIDIA
Job Overview
Who's the hiring manager?
Sign up to PitchMeAI to discover the hiring manager's details for this job. We will also write them an intro email for you.

Job Description
About the Role: Senior Manager, Engineering - Enterprise AI and Automation
As a Senior Engineering Manager for Agentic Systems & Platform Architecture, you will lead the strategy and execution for NVIDIA’s agentic developer platform—deeply understanding how teams across the company build, evaluate, and improve autonomous agents, and turning those evolving patterns into scalable platform capabilities. You will identify gaps and friction, drive rapid proof-of-concepts on emerging agent constructs and ecosystem tools, and operationalize the best approaches into reusable building blocks, integrations, and governance mechanisms that accelerate developer productivity and agent quality. If you’re passionate about staying at the forefront of agent architectures and turning experimentation into real business impact, this role offers a chance to build a platform. This platform helps teams safely ship more autonomous systems at NVIDIA scale.
What You Will Be Doing
- Track and deeply understand evolving agent development patterns across NVIDIA and the broader ecosystem.
- Identify gaps and friction in current agent architectures, and translate insights into a platform strategy that boosts developer velocity and agent quality—backed by evaluations, benchmarking, and feedback loops.
- Assess and integrate open source and third-party tools where they add leverage; drive clear build-vs-use decisions.
- Architect and integrate high-performance data pipelines, RAG systems, vector databases, and GPU-optimized training and inference workflows.
- Lead integration of the AI Data Platform into NVIDIA’s on-prem AI Factory, optimizing GPU-to-storage throughput, data locality, and distributed inference performance.
- Establish and enforce robust Agent Governance policies across the platform, covering model/tool usage, data lineage, and ensuring adherence to compliance and Responsible AI frameworks.
- Design, implement, and maintain a centralized Agent Safety Toolkit, providing developers with pre-vetted components for input/output guardrails and prompt injection defenses.
- Lead and grow a high-performing team along with a multi-functional community to standardize procedures and scale adoption.
What We Need To See
- Bachelor’s degree in CS/Engineering or equivalent experience.
- 10+ overall years in software engineering, including 4+ years managing high-performing teams.
- Strong hands-on experience with evolving agent architectures and open-source libraries; deep expertise in LLM/agent architectures—leading POCs and integrating them into real business use cases with measurable adoption/impact.
- Ability to turn fast-moving, ambiguous problem spaces into clear platform strategy, roadmap, and outcomes.
- Proven track record building multi-team developer platforms (APIs/SDKs, reusable components, reference implementations).
- Experience building evaluation/benchmarking systems for agent workflows (metrics, regression, feedback loops).
- Strong judgment integrating OSS/3P tools; clear build-vs-use decision-making and integration strategy.
- Product approach for safety and governance: controls, audit ability, monitoring, and risk management.
- Strong leadership and executive communication (engineering, product, security, research).
Ways To Stand Out From The Crowd
- Experience implementing enterprise-grade governance for agent systems (controls, audit-ability, monitoring, policy enforcement) in production autonomous workflows.
- Demonstrated wins taking new/open-source agent constructs from POC to production adoption, with clear business impact (cycle time, quality, cost, reliability).
- Built and scaled an agent platform or agent developer experience used by multiple teams (SDKs, templates, reference apps, reusable building blocks).
- Clear point of view and real examples on build-vs-use decisions—when to adopt OSS/3P vs build internal primitives—and how to operationalize the choice.
- Deep experience with agent evaluation at scale (long-horizon tasks, tool correctness, reliability testing, automated regressions, offline/online feedback loops).
Key skills/competency
- Agent Architectures
- LLM Integration
- Platform Strategy
- AI Governance
- Data Pipelines
- GPU Optimization
- Team Leadership
- Open Source Tools
- Benchmarking
- Autonomous Systems
How to Get Hired at NVIDIA
- Research NVIDIA's AI vision: Study their mission, values, recent news on AI, and employee testimonials on LinkedIn and Glassdoor.
- Tailor your resume for AI/Automation: Highlight experience with LLMs, agent architectures, and platform leadership for NVIDIA.
- Showcase agentic system expertise: Prepare specific examples of building and scaling autonomous agents from POC to production.
- Demonstrate platform leadership: Detail your track record in developer platforms, governance, and fostering team growth.
- Master NVIDIA's interview process: Practice technical problem-solving and behavioral questions focused on leadership and innovation in AI.
Frequently Asked Questions
Find answers to common questions about this job opportunity
Explore similar opportunities that match your background