Cloud Machine Learning LLM Serving Staff engineer at Qualcomm | Apply at Qualcomm | Jobs near Bengaluru

Cloud Machine Learning LLM Serving Staff Engineer

Qualcomm is seeking ambitious, bright, and innovative engineers with experience in machine learning framework development to join their Cloud Computing team. This role involves developing hardware and software for Machine Learning solutions across data center, edge, infrastructure, and automotive markets. The position spans the entire product life cycle, from early design to commercial deployment, in a fast-paced, cross-functional environment requiring strong communication, planning, and execution skills.

Key Responsibilities

Analyze software requirements and design feasibility within given constraints, collaborating with architecture and HW engineers to implement optimal software solutions for Qualcomm's SOCs.
Identify and analyze system-level issues, working closely with software development, integration, and test teams.
Lead high-performing teams in Machine Learning software engineering, demonstrating a proven track record.
Apply a strong foundation in mathematical modeling and linear algebra, coupled with state-of-the-art ML/AI algorithms.
Improve and optimize key Deep Learning models on Qualcomm AI 100 hardware.
Build deep learning framework extensions for Qualcomm AI 100 in upstream open-source repositories.
Collaborate with internal teams to analyze and optimize training and inference for deep learning workloads.
Develop software tools and build the ecosystem around the AI SW Stack.
Work with vLLM, Triton, ExecuTorch, Inductor, and TorchDynamo to create abstraction layers for inference accelerators.
Optimize workloads for both scale-up (multi-SoC) and scale-out (multi-card) systems.
Optimize the entire deep learning pipeline, including graph compiler integration.
Apply software engineering best practices throughout the development process.

Desirable Skills and Aptitudes

Deep Learning experience or knowledge in areas such as LLMs, Natural Language Processing, Vision, Audio, and Recommendation systems.
Understanding of PyTorch and TensorFlow software stacks, including their component structures and functions.
Excellent C/C++/Python programming and software design skills, including debugging, performance analysis, and test design.
Ability to work independently, define requirements and scope, and lead development efforts.
Proficiency with open-source development practices.
Strong developer with a research mindset, driven to innovate and solve complex problems.
Knowledge of tiling and scheduling for Machine Learning operators is a plus.
Experience with C++ 14 advanced features.
Experience in software profiling and optimization techniques.
Hands-on experience with SIMD and/or multi-threaded high-performance code is a plus.
Experience with ML compilers and auto-code generation (using MLIR) is a plus.
Experience running workloads on large-scale heterogeneous clusters is a plus.
Hands-on experience with CUDA and cuDNN is a plus.

Qualifications

Bachelor's/Master's/PhD degree in Engineering, Machine learning/AI, Information Systems, Computer Science, or a related field.
8+ years of Software Engineering or related work experience.
8+ years of experience with programming languages such as C++, Python.

Minimum Qualifications

Bachelor's degree in Engineering, Information Systems, Computer Science, or related field and 4+ years of Software Engineering or related work experience.
OR Master's degree in Engineering, Information Systems, Computer Science, or related field and 3+ years of Software Engineering or related work experience.
OR PhD in Engineering, Information Systems, Computer Science, or related field and 2+ years of Software Engineering or related work experience.
2+ years of work experience with programming languages such as C, C++, Java, Python, etc.

Key skills/competency

Machine Learning
Deep Learning
LLM
Python
C++
Software Engineering
Optimization
Framework Development
Cloud Computing
System Design

Cloud Machine Learning LLM Serving Staff engineer

Job highlights

About the role

Cloud Machine Learning LLM Serving Staff Engineer

Key Responsibilities

Desirable Skills and Aptitudes

Qualifications

Minimum Qualifications

Key skills/competency

Skills & topics

How to get hired

Technical preparation

Behavioral questions

Frequently asked questions