2 days ago

Product Manager - AI Data Center Infrastructure

Hewlett Packard Enterprise

On Site
Full Time
₹0
Bangalore Urban, Karnataka, India

Job Overview

Job TitleProduct Manager - AI Data Center Infrastructure
Job TypeFull Time
CategoryCommerce
Experience5 Years
DegreeMaster
Offered Salary₹0
LocationBangalore Urban, Karnataka, India

Who's the hiring manager?

Sign up to PitchMeAI to discover the hiring manager's details for this job. We will also write them an intro email for you.

Uncover Hiring Manager

Job Description

Product Manager - AI Data Center Infrastructure at Hewlett Packard Enterprise

Who We Are

Hewlett Packard Enterprise is the global edge-to-cloud company advancing the way people live and work. We help companies connect, protect, analyze, and act on their data and applications wherever they live, from edge to cloud, so they can turn insights into outcomes at the speed required to thrive in today’s complex world. Our culture thrives on finding new and better ways to accelerate what’s next. We know varied backgrounds are valued and succeed here. We have the flexibility to manage our work and personal needs. We make bold moves, together, and are a force for good. If you are looking to stretch and grow your career our culture will embrace you. Open up opportunities with HPE.

Job Family Definition:

We are seeking a Product Line Manager (PLM) for AI Data Center Infrastructure to define and deliver next-generation data center networking platforms for large-scale GPU clusters. This role is ideal for a visionary, hands-on leader who understands how AI workloads stress networks at scale and can translate that insight into clear product requirements and roadmaps.

The successful candidate will have deep experience with data center switching platforms, high-performance Ethernet fabrics, and GPU/NIC interconnects across NVIDIA and AMD ecosystems. You will drive the architecture and product strategy for scale-up and scale-out AI fabrics, enabling deterministic performance, ultra-low latency, and operational excellence for hyperscale AI training and inference clusters.

This role requires a self-starter and go-getter who can operate independently while collaborating across engineering, operations, and strategic partners.

What You Will Do

  • AI Data Center & Fabric Architecture
    • Define product requirements for AI data center network architectures supporting thousands of GPUs.
    • Develop requirements for low-latency Ethernet fabrics using Juniper QFX platforms and Apstra-based automation.
    • Enable high-bandwidth GPU and NIC interconnects optimized for large-scale distributed training and inference workloads.
  • GPU, NIC & Interconnect Strategy
    • Lead requirements definition for next-generation GPUs, NICs, and interconnect technologies, staying ahead of industry roadmaps.
    • Drive alignment with: NVIDIA: ConnectX (CX7/CX8), NVLink, NVSwitch, AMD: MI300/MI400 platforms, Pollara NICs, Infinity Fabric
    • Ensure interoperability across DAC, AEC, ACC, and optical transceivers between switches and NIC endpoints.
    • Define scale-up paths using PCIe, NVLink, NVSwitch, ensuring GPU-to-GPU symmetry, consistency, and bandwidth determinism.
  • Switching, Routing & Telemetry
    • Specify and optimize L2/L3 architectures, including EVPN-VXLAN, Class-E IPv4, and AI-optimized buffer tuning.
    • Leverage hardware telemetry, streaming sensors, and analytics for proactive performance assurance.
    • Drive automation using Python, Ansible, Apstra, Terraform, and related tools to enforce configuration consistency and compliance.
  • Performance Optimization & Troubleshooting
    • Analyze GPU job performance to identify network hotspots, congestion, packet loss, and microbursts.
    • Tune ECN, RDMA/ROCEv2, PFC, and traffic-engineering policies for AI workloads.
    • Optimize server-to-switch interactions, including: BIOS and firmware alignment, NIC queue and link-training parameters, Cable selection and management (AEC/ACC/optics)
  • Cross-Functional & Ecosystem Collaboration
    • Partner closely with AI platform teams, GPU system architects, data center operations, and strategic vendors (NVIDIA, AMD, Juniper).
    • Lead and participate in root-cause analysis for: Link flaps and training failures, FEC and PCS errors, Thermal or power-related performance degradation
    • Drive lab validation, scale testing, and certification of new optics, NIC firmware, and switch software releases.

What You Need To Bring

  • 5–10+ years of experience in data center networking, AI infrastructure, or HPC environments.
  • Strong hands-on experience with Juniper QFX platforms and JunOS.
  • Deep understanding of GPU architectures: NVIDIA: H100/H200, GB200/GB300, NVLink/NVSwitch, AMD: MI300/MI400, Pollara NICs, Infinity Fabric.
  • Proven expertise in scale-up GPU interconnects and scale-out Ethernet fabrics.
  • Strong knowledge of RDMA/ROCEv2, ECN, PFC, and buffer management.
  • Familiarity with distributed AI workloads, collective operations (NCCL, RCCL).
  • Hands-on troubleshooting experience with high-speed optics, AEC cables, link training, and NIC firmware.
  • Proficiency in automation and scripting (Python, Ansible, Bash, Terraform).

Preferred Qualifications

  • Certification : JNCIE , CCIE, (NCP-AII), (NCA-AIIO), (NCP-AIO), (NCP-AIN)
  • Experience with Apstra or other intent-based networking platforms.
  • Knowledge of 1.6T optics, 200G PAM4 SerDes, and CPO/LPO architectures.
  • Experience supporting liquid-cooled GPU clusters and rack-level power/network design.
  • Understanding of data center operations, observability, and SLAs for AI training and inference clusters.

Additional Skills

Cross Domain Knowledge, Customer Engagement, Design Thinking, Development Fundamentals, DevOps, Go-to-Market Expertise, Partner Management, Product Lifecycle Management, Security-First Mindset, Strategic Pricing, Strategy Creation, User Experience (UX), Value Creation, Vendor Management

What We Can Offer You

Health & Wellbeing

We strive to provide our team members and their loved ones with a comprehensive suite of benefits that supports their physical, financial and emotional wellbeing.

Personal & Professional Development

We also invest in your career because the better you are, the better we all are. We have specific programs catered to helping you reach any career goals you have — whether you want to become a knowledge expert in your field or apply your skills to another division.

Unconditional Inclusion

We are unconditionally inclusive in the way we work and celebrate individual uniqueness. We know varied backgrounds are valued and succeed here. We have the flexibility to manage our work and personal needs. We make bold moves, together, and are a force for good.

Let's Stay Connected

Follow @HPECareers on Instagram to see the latest on people, culture and tech at HPE.

Key skills/competency

  • AI Infrastructure
  • Data Center Networking
  • GPU Architectures
  • Ethernet Fabrics
  • Juniper QFX
  • RDMA/ROCEv2
  • Network Automation
  • Product Strategy
  • Interconnect Technologies
  • Performance Optimization

Tags:

Product Manager
AI Data Center
product strategy
data center networking
GPU architectures
roadmap definition
performance optimization
vendor management
automation
troubleshooting
requirements definition
ecosystem collaboration
Juniper QFX
JunOS
NVIDIA
AMD
NVLink
NVSwitch
RDMA
ROCEv2
ECN
PFC
EVPN-VXLAN
Python
Ansible
Apstra
Terraform

Share Job:

How to Get Hired at Hewlett Packard Enterprise

  • Research HPE's culture: Study their mission, values, recent news, and employee testimonials on LinkedIn and Glassdoor.
  • Tailor your resume: Highlight your expertise in AI infrastructure, data center networking, and product management, customizing it for Hewlett Packard Enterprise's specific needs.
  • Showcase technical depth: Emphasize hands-on experience with Juniper QFX, GPU architectures, RDMA, and network automation in your application materials and interviews.
  • Prepare for behavioral questions: Demonstrate your ability to operate independently, collaborate cross-functionally, and drive product strategy in a fast-paced environment at HPE.
  • Network effectively: Connect with current and former HPE employees, especially those in product management or engineering, to gain insights and potential referrals.

Frequently Asked Questions

Find answers to common questions about this job opportunity

Explore similar opportunities that match your background