Beschreibung:
Imagine building the next generation of AI-powered experiences at Apple. We are advancing the state of the art in foundation models, applying them across language, vision, and multimodal understanding to power features used by millions of people worldwide.
As part of the Multimodal Intelligence Team (MINT), with a track record of delivering innovations from the Apple Foundation Model to real-world applications like Visual Intelligence, you will tackle the practical challenges of scaling, optimizing, for building large models as well as integrating such models and agents into Apple products. You'll collaborate with world-class engineers and scientists to push the boundaries of foundation models and agentic systems while delivering real-world impact
Currently pursuing a PhD degree or equivalent experience in Machine Learning, Computer Vision, Natural Language Processing, Data Science, Statistics or related areas. Experience with large language models or vision language models and their application in agentic systems. Proficient programming skills in Python and experience with at least one modern deep learning framework (PyTorch, JAX, or TensorFlow).
Demonstrated publication record in relevant conferences (e.g. NeurIPS, ICML, ICLR, CVPR, etc). Experience with foundation models (language, vision-language, or multimodal). Experience post training (SFT or RL) for optimizing large models for agentic systems. Available for 6-12 months for internship
| Quelle: | Website des Unternehmens |
| Datum: | 08 Dez 2025 (geprüft am 13 Dez 2025) |
| Stellenangebote: | Praktikum |
| Bereich: | Unterhaltungselektronik |
| Dauer: | 12 Monate |
| Sprachkenntnisse: | Englisch |