13 days ago

Computer Vision Machine Learning Engineer Video Generation

Apple

On Site
Full Time
$150,000
Beijing, Beijing, China

Job Overview

Job TitleComputer Vision Machine Learning Engineer Video Generation
Job TypeFull Time
CategoryCommerce
Experience5 Years
DegreeMaster
Offered Salary$150,000
LocationBeijing, Beijing, China

Who's the hiring manager?

Sign up to PitchMeAI to discover the hiring manager's details for this job. We will also write them an intro email for you.

Uncover Hiring Manager

Job Description

Summary

If you are passionate about advancing video generation, building state-of-the-art models that synthesize high-quality and controllable video, and optimizing them for on-device deployment, Apple is the right place for you. As a Computer Vision Machine Learning Engineer Video Generation, you will push the boundaries of video AI.

Description

Join Apple’s Video Engineering org to develop models and infrastructure for video generation and understanding across Apple products. Work on cutting-edge generative techniques including diffusion, transformer-based models, frame interpolation, and temporal modeling, ensuring efficient performance on iPhone, iPad, and Vision Pro. Collaborate with research scientists, framework engineers, and cross-functional teams to design, train, optimize, and deploy scalable video generation systems.

Responsibilities

  • Design and develop generative video models for high-fidelity controllable synthesis.
  • Build large-scale training, evaluation, and benchmarking infrastructure.
  • Investigate model consolidation and shared representation learning.
  • Optimize algorithms for runtime, power, memory, and temporal quality on-device.
  • Collaborate with product and research teams to integrate video generation technologies into Apple’s camera and video pipelines.

Minimum Qualifications

  • M.S. or Ph.D. in Computer Science, Electrical Engineering, or related fields focused on computer vision or machine learning.
  • Experience in generative video modeling, video prediction, temporal modeling, or frame interpolation.
  • Proficiency in deep learning frameworks (PyTorch, JAX) and programming languages (Python, C++).
  • Experience with large-scale training pipelines and deploying models in real-world systems.
  • Strong written and verbal communication skills.

Preferred Qualifications

  • Publications in top-tier conferences (CVPR, ECCV, ICCV, NeurIPS, ICLR).
  • Experience with multi-modal video or text-video generation.
  • Familiarity with optimizing generative models for mobile/embedded devices.
  • Understanding of temporal consistency, controllable generation, and efficient large-scale infrastructure.
  • Passion for building scalable, high-quality systems in cross-functional teams.

Equal Opportunity

Apple is committed to inclusion, diversity, and fair treatment for all applicants, providing reasonable accommodation to those with disabilities.

Key skills/competency

  • Computer Vision
  • Machine Learning
  • Video Generation
  • Deep Learning
  • Diffusion Models
  • Transformer Models
  • Temporal Modeling
  • Infrastructure
  • Optimization
  • On-device Deployment

Tags:

Computer Vision Machine Learning Engineer Video Generation
video generation
deep learning
PyTorch
JAX
infrastructure
optimization
temporal modeling
diffusion
transformer
benchmarking
systems

Share Job:

How to Get Hired at Apple

  • Customize your resume: Highlight deep learning and video generation skills.
  • Research Apple: Study their culture, products, and innovation.
  • Showcase projects: Emphasize work on generative models and infrastructure.
  • Prepare for technical interviews: Practice deep learning and system design questions.

Frequently Asked Questions

Find answers to common questions about this job opportunity

Explore similar opportunities that match your background