4 days ago

Solutions Architect, Generative AI

NVIDIA

On Site
Full Time
$200,000
Santa Clara, CA

Job Overview

Job TitleSolutions Architect, Generative AI
Job TypeFull Time
CategoryCommerce
Experience5 Years
DegreeMaster
Offered Salary$200,000
LocationSanta Clara, CA

Who's the hiring manager?

Sign up to PitchMeAI to discover the hiring manager's details for this job. We will also write them an intro email for you.

Uncover Hiring Manager

Job Description

About NVIDIA

NVIDIA is seeking an outstanding AI Engineer or Solutions Architect to join our growing team focused on partner enablement for Generative AI. In this role, you will lead by example, acting as both a strategic technical expert and a hands-on developer. You will directly build innovative proof-of-concept solutions and reference architectures for innovative AI applications, demonstrating the full power of the NVIDIA accelerated Generative AI platforms. By developing these foundational solutions, you will provide partners with the technical blueprints and expert guidance needed to architect and deploy their own transformative applications using NVIDIA full AI stack, from GPU systems and CUDA to NeMo and Triton.

The Generative AI Partners Enablement SA team is dedicated to applying next-generation technologies to solve customer problems. We act as trusted advisors and technical partners to our ecosystem. As a member of NPN Generative AI Solution Architecture team, you’ll be immersed in a diverse, supportive environment where everyone is inspired to do their best work. Come join the team and see how you can make a lasting impact on the world by applying accelerated computing AI and solve category defining systems and production grade AI solutions at scale.

What You Will Be Doing

  • Serve as the primary technical domain expert for pre- and post-sale for partners, embedding deeply with them to design and deploy Generative AI solutions. Maintain strong relationships with leadership and technical teams to drive adoption, and successful utilization of NVIDIA GenAI platforms.
  • Accelerate partner/customer time to value by providing repeatable reference architecture guidance, building hands-on prototypes, and advising on standard methodologies for scaling solutions to productions.
  • Define the scope, success metrics, and evaluation criteria for partner-led customer projects, ensuring they are built on standardized and reproducible GPU-accelerated workflows.
  • Enable strategic partners to launch their own Professional Services and platforms by tailoring NVIDIA agentic AI blueprints for high-impact customer workloads. You will proactively find opportunities to drive deeper adoption and utilization of NVIDIA's Generative AI products.
  • Codify knowledge and operationalize technical success practices to help partners scale impact across industries and workloads.

What We Need To See

  • MSc, PhD in Computer Science, Electrical Engineering, Software Engineer, ML Engineer, or related fields (or equivalent experience).
  • 5+ years of relevant work experience in developing and deploying AI models at scale as a Software Engineer or deep learning engineer.
  • Consistent track record of building enterprise-grade agentic AI systems using open-source models and solid foundation in deep learning, with a particular emphasis on generative models.
  • Hands-on experience with LLM and agentic frameworks (NeMo Agent Toolkit, LangChain, Semantic Kernel, Crew.ai, AutoGen) and evaluation and observability platforms. Comfortable building prototypes or proofs of concept.
  • Strong coding development and proficiency in Python, C++ and Deep Learning frameworks (PyTorch, or TensorFlow).
  • Excellent communication and presentation skills to effectively collaborate with both internal executives, partners and customers.

Ways To Stand Out From The Crowd

  • Demonstrate expertise and hands-on experience with NVIDIA AI platforms.
  • Understanding of different advanced agent architectures and emerging communication protocols (MCP or Google A2A).
  • Excellent practical knowledge of Generative AI and LLM development. Ability to train GPT and Megatron Models.
  • Understanding of MLOps life cycle management and experience with LLMOps workflows.
  • Experience with CUDA programming and benchmarking and analyzing performance foundation models.

Key skills/competency

  • Generative AI
  • Large Language Models (LLM)
  • Deep Learning
  • Python
  • PyTorch
  • TensorFlow
  • CUDA
  • Solution Architecture
  • Partner Enablement
  • MLOps

Tags:

Solutions Architect, Generative AI
Solution design
partner enablement
prototype building
reference architecture
technical advising
deployment
scaling solutions
codification
pre-sales
post-sales
Generative AI
LLM
Deep Learning
Python
PyTorch
TensorFlow
CUDA
NeMo
Triton
LangChain
Semantic Kernel
Crew.ai
AutoGen

Share Job:

How to Get Hired at NVIDIA

  • Research NVIDIA's culture: Study their mission in AI and accelerated computing, values of innovation and excellence, recent news, and employee testimonials on LinkedIn and Glassdoor.
  • Tailor your resume for Generative AI: Highlight experience with LLMs, deep learning frameworks, and solution architecture, matching keywords directly from the Solutions Architect, Generative AI job description.
  • Showcase project impact: Prepare a portfolio or discussion points on building and deploying enterprise-grade AI systems, demonstrating scalable solutions and partner enablement.
  • Master technical fundamentals: Deepen your knowledge in Generative AI, PyTorch/TensorFlow, CUDA, and MLOps workflows relevant to NVIDIA's AI ecosystem.
  • Prepare for behavioral questions: Practice articulating how you solve complex technical problems, lead strategic technical engagements, and collaborate effectively with diverse teams and partners.

Frequently Asked Questions

Find answers to common questions about this job opportunity

Explore similar opportunities that match your background