Computer Vision Machine Learning Engineer Video Generation
Apple
Job Overview
Who's the hiring manager?
Sign up to PitchMeAI to discover the hiring manager's details for this job. We will also write them an intro email for you.

Job Description
Summary
If you are passionate about advancing video generation, building state-of-the-art models that synthesize high-quality and controllable video, and optimizing them for on-device deployment, Apple is the right place for you. As a Computer Vision Machine Learning Engineer Video Generation, you will push the boundaries of video AI.
Description
Join Apple’s Video Engineering org to develop models and infrastructure for video generation and understanding across Apple products. Work on cutting-edge generative techniques including diffusion, transformer-based models, frame interpolation, and temporal modeling, ensuring efficient performance on iPhone, iPad, and Vision Pro. Collaborate with research scientists, framework engineers, and cross-functional teams to design, train, optimize, and deploy scalable video generation systems.
Responsibilities
- Design and develop generative video models for high-fidelity controllable synthesis.
- Build large-scale training, evaluation, and benchmarking infrastructure.
- Investigate model consolidation and shared representation learning.
- Optimize algorithms for runtime, power, memory, and temporal quality on-device.
- Collaborate with product and research teams to integrate video generation technologies into Apple’s camera and video pipelines.
Minimum Qualifications
- M.S. or Ph.D. in Computer Science, Electrical Engineering, or related fields focused on computer vision or machine learning.
- Experience in generative video modeling, video prediction, temporal modeling, or frame interpolation.
- Proficiency in deep learning frameworks (PyTorch, JAX) and programming languages (Python, C++).
- Experience with large-scale training pipelines and deploying models in real-world systems.
- Strong written and verbal communication skills.
Preferred Qualifications
- Publications in top-tier conferences (CVPR, ECCV, ICCV, NeurIPS, ICLR).
- Experience with multi-modal video or text-video generation.
- Familiarity with optimizing generative models for mobile/embedded devices.
- Understanding of temporal consistency, controllable generation, and efficient large-scale infrastructure.
- Passion for building scalable, high-quality systems in cross-functional teams.
Equal Opportunity
Apple is committed to inclusion, diversity, and fair treatment for all applicants, providing reasonable accommodation to those with disabilities.
Key skills/competency
- Computer Vision
- Machine Learning
- Video Generation
- Deep Learning
- Diffusion Models
- Transformer Models
- Temporal Modeling
- Infrastructure
- Optimization
- On-device Deployment
How to Get Hired at Apple
- Customize your resume: Highlight deep learning and video generation skills.
- Research Apple: Study their culture, products, and innovation.
- Showcase projects: Emphasize work on generative models and infrastructure.
- Prepare for technical interviews: Practice deep learning and system design questions.
Frequently Asked Questions
Find answers to common questions about this job opportunity
Explore similar opportunities that match your background