PitchMeAI
IBM

Storage Solutions Architect- AI Infrastructure

IBM · Pune Division, Maharashtra, India

  • On site
  • Full-time
  • $160,000 / year
  • Pune Division, Maharashtra, India

Job highlights

  • Design high-performance storage for AI/ML workloads.
  • Automate infrastructure for AI/HPC environments.
  • Collaborate with partners and internal teams.
  • Educate market on IBM Storage solutions.
  • Drive client success and joint marketing.

About the role

About the Role

IBM Infrastructure is seeking a creative and innovative AI Infrastructure Storage Solutions Architect. This role is crucial in designing and implementing high-performance data environments for large-scale AI training and inference. You will work at the heart of IBM, contributing to a growth-minded culture focused on innovation and career development. Collaboration with various business units, product teams, and technology partners worldwide is key to success.

Mission of the IBM Storage Solutions Team

The mission is to engage with strategic ISV and OEM partners to advance IBM Storage strategy, expand portfolio marketability, align solution roadmaps, and collaborate on Go-To-Market (GTM) initiatives. This involves working closely with IBM's internal teams (product managers, engineers, researchers, marketers, sellers) and external technology partners to realize measurable business growth through joint solutions.

Your Role and Responsibilities

  • Architect, design, build, and operate high-performance storage systems tailored for AI/ML workloads.
  • Bridge the gap between storage technology and AI needs, including GPU clusters and data pipelines.
  • Implement infrastructure automation, performance tuning, and cloud platform solutions.
  • Collaborate with data scientists to ensure scalable, secure, and reliable data flow for complex models.
  • Design, develop, integrate, and test differentiated workload-based solutions using the IBM Storage portfolio and third-party technologies.
  • Contribute to defining storage portfolio requirements based on industry trends and partner strategies.
  • Educate the market, customers, and sales teams on the importance of IBM Storage for business-valued workloads through various formats (blogs, white papers, reference architectures, videos, social media, trade shows).
  • Drive client success stories, references, and joint marketing opportunities.
  • Act as a trusted advisor for strategic storage solutions.
  • Leverage AI-driven tools and automation to solve business problems, support data-informed decisions, and enhance team productivity.
  • Enable cross-selling of jointly developed storage solutions.
  • Lead end-to-end system design for distributed storage platforms for AI/HPC workloads.
  • Maximize IOPS and throughput for multi-node GPU clusters using deep learning frameworks.
  • Build CI/CD and automation pipelines for provisioning and monitoring AI infrastructure (Terraform, Ansible, Kubernetes).
  • Evaluate and select next-generation storage technologies (e.g., NVMe-oF, Ceph) for petabyte-scale data.
  • Integrate AIOps for AI/ML workflows, including CI/CD pipelines, monitoring, and orchestration (Kubernetes).

Key Skills and Competencies

Storage Solutions Architect, AI Infrastructure, High-Performance Computing (HPC), Machine Learning (ML), Deep Learning, Data Pipelines, Infrastructure Automation, Performance Tuning, Cloud Platforms, Kubernetes, Terraform, Ansible, NVMe-oF, Ceph, NVIDIA DGX, GPUs, DPUs, SAN, NAS, Parallel File Systems, NFS, SMB, S3, Linux, Python, Bash, Go, HA/DR.

Skills & topics

  • Storage Solutions Architect
  • AI Infrastructure
  • Storage Architecture
  • High-Performance Computing
  • Machine Learning
  • Deep Learning
  • Data Pipelines
  • Infrastructure Automation
  • Performance Tuning
  • Cloud Platforms

How to get hired

  • Tailor your resume: Highlight experience with AI infrastructure, storage architectures, and relevant technologies like NVIDIA DGX, GPUs, and DPUs.
  • Showcase automation skills: Emphasize proficiency in Python, Bash, or Go, and experience with tools like Terraform, Ansible, and Kubernetes.
  • Demonstrate cloud expertise: Detail experience with hybrid/multi-cloud AI solutions on AWS, Azure, GCP, and virtualization technologies.
  • Prepare for technical questions: Be ready to discuss SAN, NAS, parallel file systems, and performance optimization for AI workloads.
  • Understand IBM's strategy: Research IBM Storage's role in hybrid cloud and AI solutions.

Technical preparation

Master AI hardware: NVIDIA DGX, GPUs, DPUs.,Practice storage protocols: NFS, SMB, S3.,Build automation with Terraform, Ansible, Kubernetes.,Design HA/DR for distributed storage.

Behavioral questions

Describe a complex storage challenge you solved.,How do you collaborate with data scientists?,How do you stay updated on AI trends?,Explain your approach to trusted advisor relationships.

Frequently asked questions

What specific storage technologies are prioritized for AI/ML workloads at IBM?
IBM prioritizes next-generation storage technologies such as NVMe-oF (IBM Flashsystems) and Ceph to support petabyte-scale data requirements for AI/ML workloads. Experience with these and traditional systems like SAN, NAS, and Parallel File Systems is crucial.
How important is automation experience for an AI Infrastructure Storage Solutions Architect at IBM?
Automation is critical. The role requires building CI/CD and automation pipelines for provisioning and monitoring AI infrastructure using tools like Terraform, Ansible, and Kubernetes. Proficiency in scripting languages like Python, Bash, or Go is essential.
What level of experience is typically required for this Storage Solutions Architect role?
Typically, 12-15 years of experience in AI Infrastructure, systems engineering, and/or storage architectures is required. A Master's degree is preferred, but extensive experience can also be a strong qualification.
Does IBM expect candidates to have experience with specific AI hardware like NVIDIA systems?
Yes, strong expertise with NVIDIA DGX/HGX systems, GPUs, and DPUs (e.g., NVIDIA BlueField) is a key requirement for effectively architecting high-performance data environments for AI training and inference.
What is the role of collaboration in this position at IBM?
Collaboration is central. You'll engage with the entire IBM Storage brand (product managers, engineers, marketers, sellers) and technology partners worldwide to align roadmaps, design solutions, and drive GTM initiatives.
How does IBM leverage AI-driven tools in this role?
The role requires leveraging AI-driven tools and automation to solve everyday business problems, support data-informed decision-making, and significantly enhance team productivity. This includes applying these tools for rapid problem-solving.
What are the key responsibilities related to solution design and validation?
Key responsibilities include leading end-to-end system design for distributed storage platforms for AI/HPC, evaluating and selecting next-generation storage technologies, and building CI/CD pipelines for AI/ML workflows.