A
Machine Learning Engineer, Video Engineering
AppleSunnyvale, California, United StatesMachine Learning and AI$147,400 - $272,1003+ years
Apply Help define the next generation of photo and video experiences at Apple. The Video Engineering group develops key image and video technologies that contribute to experiences across many Apple products and services. As a Machine Learning Engineer, you'll be developing breakthrough ML models that power spatial media features, including 3D photos, immersive video, and scene reconstruction, reaching hundreds of millions of users. As a member of a fast-paced team, you will have the unique and exciting opportunity to shape upcoming products at the frontier of generative AI, computer vision, and 3D understanding, with a strong emphasis on taking ideas from research to production.
This role requires an independent, self-motivated, and creative engineer with deep expertise in machine learning, coupled with a strong understanding of 3D computer vision and computer graphics. Your primary focus will be applying cutting-edge machine learning techniques to complex 3D visual challenges to create customer impact across current and future Apple products.
- Work both independently and collaboratively with team members to research and develop state-of-the-art ML models for 3D content creation, spatial video processing, and scene understanding, presenting and demonstrating your work to cross-functional teams and leadership alike.
- Design and train large-scale generative models on Apple's ML infrastructure, run rigorous experiments to improve quality, and optimize models for on-device deployment. Examples of past team work include Spatial Scenes, immersive 3D experiences generated from photos and videos using ML-powered depth and view synthesis.
- Bachelor's degree in Computer Science, Electrical/Computer Engineering, or a related field.
- 3+ years of experience developing and shipping machine learning models, with a focus on computer vision or generative AI.
- Expert-level proficiency in Python and PyTorch or equivalent ML frameworks.
- Strong experience with diffusion models for image synthesis, video generation, 3D generation, or related domains.
- Solid understanding of 3D computer vision including depth estimation, multi-view geometry, 3D reconstruction, novel view synthesis, or neural scene representations.
- Working knowledge of 3D computer graphics concepts such as rendering, camera models, scene representations, and coordinate systems.
- Working knowledge of C++ for performance-critical components.
- Ability to leverage AI coding assistants and LLM-based tools to accelerate development, prototyping, and problem-solving workflows.
- MS or PhD in Computer Science, Machine Learning, Computer Vision, or a related field.
- Publications in top-tier ML or vision venues.
- Experience with video diffusion models, 3D Gaussian Splatting, neural radiance fields, or differentiable rendering.
- Experience with distributed training at scale.
- Familiarity with on-device ML deployment.
- Track record of shipping ML-powered features in consumer products.
- Good written and oral interpersonal skills.
Similar Jobs
Pre-Silicon EngineerCupertino, California, United StatesMachine Learning and AI
$147,400 - $272,100Speech Scientist / Engineer (Interspeech 2022)Cupertino, California, United StatesSoftware and Services
RAN1/RAN4 Standards EngineerBeijing, Beijing, ChinaHardware
Display Metrology EngineerShenzhen, Guangdong, ChinaHardware
Molding Process EngineerShanghai, Shanghai, ChinaHardware
Molding Process EngineerShenzhen, Guangdong, ChinaHardware