Cloud Solutions Architect
@ NVIDIA

Santa Clara, CA
$150,000
On Site
Full Time
Posted 18 hours ago

Your Application Journey

Personalized Resume
Apply
Email Hiring Manager
Interview

Email Hiring Manager

XXXXXXXXXX XXXXXXXXXXX XXXXXXXXXX******* @nvidia.com
Recommended after applying

Job Details

About the Cloud Solutions Architect Role

Join NVIDIA as a Cloud Solutions Architect, focusing on large-scale GPU infrastructure and AI Factory deployments. In this role, you'll work with innovative teams to architect and deploy resilient AI compute environments at scale.

What You'll Be Doing

  • Serve as the go-to technical expert on NVIDIA AI Factory solutions.
  • Architect and deploy resilient, telemetry-driven AI compute environments.
  • Collaborate with engineering teams for design wins and challenge resolution.
  • Develop robust tooling for observability, failure recovery, and performance optimization.
  • Advise clients on optimizing cloud environments and scaling high-performance workloads.

What We Need To See

  • 2+ years experience in large-scale cloud infrastructure or GPU cluster management.
  • Degree in Computer Science, Electrical Engineering, Mathematics, Physics or equivalent.
  • Strong understanding of multi-node GPU clusters, high-performance networking, and distributed storage.
  • Familiarity with infrastructure-as-code, automation, and configuration management.
  • Passion for machine learning and continuous learning of new technologies.
  • Excellent interpersonal skills to explain complex topics to non-experts.

Ways To Stand Out

  • Expertise with orchestration tools like Slurm, Kubernetes, or Run:ai.
  • Experience in AI training and inference performance optimization.
  • Proven ability to design telemetry systems and failure recovery mechanisms.
  • Hands-on with cloud-native solutions on AWS, Azure, or Google Cloud.
  • Deep knowledge of high-performance networking technologies including NVIDIA InfiniBand.

Compensation and Benefits

Salary is determined by experience and location. You may be eligible for equity and benefits.

Key skills/competency

  • Cloud Infrastructure
  • GPU Clusters
  • AI Factory
  • Observability
  • Automation
  • Telemetry
  • Networking
  • Configuration Management
  • Orchestration
  • Failure Recovery

How to Get Hired at NVIDIA

🎯 Tips for Getting Hired

  • Customize your resume: Highlight cloud infrastructure and AI expertise.
  • Research NVIDIA: Understand their culture and projects.
  • Prepare technical answers: Review GPU and orchestration systems.
  • Practice behavioral responses: Emphasize teamwork and problem solving.

📝 Interview Preparation Advice

Technical Preparation

Review cloud infrastructure best practices.
Practice automation and configuration management.
Study GPU clustering and high-performance networking.
Familiarize with orchestration tools like Kubernetes.

Behavioral Questions

Describe recent teamwork challenges overcome.
Explain your conflict resolution approach.
Discuss learning from project failures.
Share how you adapt to change.

Frequently Asked Questions