Senior ML Infrastructure Engineer - Training Algorithms, SIML

Apple
Cupertino, United States of America2026-02-25

About the job

Are you passionate about Generative AI? Are you interested in working on groundbreaking generative modeling technologies to enrich billions of people? We are the Intelligence System Experience (ISE) team within Apple’s software organization. The team operates at the intersection of multimodal machine learning and system experiences. Our multidisciplinary ML teams focus on a broad spectrum of areas, including Visual Generative Foundation Models, Multimodal Understanding, Visual Understanding of People, Text, Handwriting, and Scenes, Personalization, Knowledge Extraction, Conversation Analysis, Behavioral Modeling for Proactive Suggestions, and Privacy-Preserving Learning. These innovations form the foundation of the seamless, intelligent experiences our users enjoy every day. We are seeking engineers experienced in building infrastructure for training, adapting and deploying large-scale generative models. In this role, you will be working with closely with a cross functional team of algorithm design and infrastructure engineers to benchmark, prototype and steer algorithmic choices to best fit our training & deployment infrastructure.

Responsibilities

Training optimizations & profiling targeting vision/language pre-training

Researching training recipes for effective scheduling of multimodal training workloads

Experimentation & tooling for post-training ablations including reward modeling, distillation and prompt optimization

Coordinating with post-training algorithm owners for analyzing quality / performance tradeoffs of downstream capabilities

Ablations involving optimization aware fine-tuning

Qualifications

Minimum

Bachelors, Masters, or PhD in Electrical Engineering/Computer Science or a related field (mathematics, physics or computer engineering), with a focus on machine learning; or comparable professional experience

Experienced in training / adapting LLM and Diffusion models

Advanced Fluency in PyTorch

Excellent programming skills and experience contributing software to large projects

Experience with distributed training of large models

Preferred

Strong ML Fundamentals

Experience working with large cross-functional and diverse teams.