Pubblicare uno stage
it
Offerta
Lavoro > Stage > Scienza/Ricerca > Stati Uniti > Menlo Park > Offerta 

Research Scientist Intern, FAIR - Text Data Research (PhD)

Meta
Stati Uniti  Menlo Park, Stati Uniti
Stage, Scienza/Ricerca, Inglese, Polacco
13
Visite
0
Candidati
Registrarsi

Descrizione del lavoro:

Meta is seeking Research Interns to join Fundamental AI Research (FAIR) Text Data Research team. The team is committed to building the data foundation for Meta's most advanced Large Language Models and contributes to data curation across all stages of LLM development (pre-training, mid-training, post-training) and all domains (e.g., web, code, agent, multilingual). We tackle the hardest challenges at trillion-scale, including organic data curation, synthetic data generation, agent and interaction data, and frontier paradigms that redefine what's possible. We are seeking candidates with a research focus in domains such as: natural language understanding and generation, language modeling, pre-training and post-training, low-resource NLP, question answering, machine translation, dialogue, cross-lingual and cross-domain transfer learning, and other related domains. Based in Meta Superintelligence Labs (MSL), our interns have an opportunity to directly contribute to Meta's frontier models like Llama, while having the chance to collaborate with researchers and engineers across MSL. Our team at FAIR offers twelve (12) to twenty-four (24) weeks long internships and we have various start dates throughout the year. To learn more about our research, visit https://research.facebook.com.
Advance our understanding of data research, such as how to overcome data walls, how to improve data compute efficiency, and how best to create synthetic data Brainstorm with research mentors, review literature and existing solutions of a challenging real-world research problem Develop novel solutions, implement prototypes, and perform extensive experiments to test the proposed solutions in meaningful benchmarks and metrics, analyze the results and verify the conclusions Draft and polish research reports and/or publications Present research outcomes to internal and/or external audiences Contribute research that can be applied to Meta product development

Requisiti del candidato:

Currently has or is in the process of obtaining a PhD degree in the field of Large Language Models, Natural Language Processing, Machine Learning, Artificial Intelligence, or equivalent Experience in Python, C++, or other related languages Proven track record of achieving significant results as demonstrated by publications at leading conferences (NeurIPS, ICML, ICLR, ACL, EMNLP, NAACL, COLM) Must obtain work authorization in the country of employment at the time of hire, and maintain ongoing work authorization during employment Experience advancing AI techniques in Natural Language Processing, including contributions to open source libraries and frameworks in Language Experience manipulating and analyzing complex, large scale, high-dimensionality data from varying sources Experience in utilizing theoretical and empirical research to solve problems Experience working and communicating cross functionally in a team environment Experience with large-scale data processing using e.g. pyspark Experience with Pytorch Intent to return to degree-program after the completion of the internship/co-op

Provenienza: Web dell'azienda
Pubblicato il: 12 Dic 2025  (verificato il 14 Dic 2025)
Tipo di impiego: Stage
Settore: ICT / Informatica
Lingue: Inglese, Polacco
Registrarsi
124.206 lavori e stage
in 158 Paesi
Registrati