Job Specifications
Persons in these roles are welcome to work remotely from Berkeley, CA.
Compensation dependent on current degree enrollment. Research Intern, Undergrad: $94,320 (Annually) Research Intern, Masters: $106,110 (Annually) Research Intern, PhD: $140,000 (Annually)
Who You Are:
Ai2 is seeking talented and motivated Research Interns to join the FlexOlmo team, working on a series of large language models designed for flexible data use, with a focus on Mixture-of-Experts (MoE), long-context language models (LCLMs), and retrieval. Research Interns will be based in Berkeley, CA.
This internship offers a unique opportunity to contribute to cutting-edge research in natural language processing and machine learning in an exciting, fast-paced research environment. As a Research Intern, you have the opportunity to:
Define and lead a high-impact research project.
Train and release leading models.
Collaborate with and learn from team members across Ai2.
Build open-source software for the research community.
Author scientific papers for publication in a high-profile conference or journal.
Ai2 Research Internship Information:
Duration: 12 weeks
Start date: Flexible
Candidates: A PhD candidate or a master/undergraduate student with a strong research background.
We are currently hiring for start dates in 2026. If you have questions, feel free to reach out to olmo-careers@allenai.org. Please apply by January 31, 2026 at 11:00pm Pacific Time to be considered for our Spring/Summer 2026 internships.
Who We Are:
We design new architectures and training methods that help models use data more effectively—through improved training, inference-time conditioning, and retrieval—broadening the types of data they can leverage and ultimately enhancing performance. We also develop scientific methodologies for evaluating and understanding these systems. Our team produces high-impact research and expertly engineered open-source tools that accelerate NLP research worldwide.
We lead the FlexOlmo project, whose first release in July 2025 focused on a new Mixture-of-Experts architecture. Looking ahead, we plan to pursue creative, groundbreaking research that delivers scientific insights and practical solutions for building architectures and training methods that unlock the use of large and diverse data sources.
Your Next Challenge:
Why FlexOlmo? We are building the foundation for research into the next generation of language models designed for flexible data use.
FlexOlmo is a small, tightly knit team, giving you the unique opportunity to work closely with team members toward one high-impact project.
We encourage open collaboration on intern projects, even with researchers at external institutions. Interns will be based in Berkeley, with opportunities to engage actively with the University of California, Berkeley, and the BAIR lab.
Our pay is competitive, and visa sponsorship is available.
We are committed to open science and support students freely publishing papers, as exemplified by our first release: FlexOlmo: Open Language Models for Flexible Data Use.
To see more about our current, future, and past interns, check out our internship page on our website!
What You’ll Need:
Qualifications:
Pursuing a Ph,D. degree in Computer Science or similar field with research experience in machine learning, natural language processing, language and vision, or related areas.
Outstanding individual contributor (IC) skills, especially with deep learning frameworks (e.g. PyTorch).
An outstanding publication record at AI-related venues, such as NeurIPS, ICLR, ICML, COLM, ACL, EMNLP. We will specifically evaluate the quality of publications in terms of rigor and impact, not the quantity.
Research experience in areas such as large language models, training dynamics, scaling laws, and data curation. Experience with mixture-of-experts, long-context language models, and retrieval is preferred but not required.
Located [or willing to relocate] in Berkley, CA.
Physical Demands and Work Environment:
The physical demands described here are representative of those that must be met by a team member to successfully perform the essential functions of this position. Reasonable accommodations may be made to enable individuals with disabilities to perform the functions.
Must be able to remain in a stationary position for long periods of time.
The ability to communicate information and ideas so others will understand. Must be able to exchange accurate information in these situations.
The ability to observe details at close range.
Can work under deadlines.
A Little More About Ai2:
Ai2 is a Seattle based non-profit AI research institute founded in 2014 by the late Paul Allen. Our mission is building breakthrough AI to solve the world’s biggest problems. We develop foundational AI research and innovation to deliver real-world impact through large-scale open models, data, robotics, conservation, and beyond.
In addition to Ai2’s core mission, we also aim to contribu
About the Company
We are a Seattle-based non-profit AI research institute founded in 2014 by the late Paul Allen. We develop foundational AI research and innovation to deliver real-world impact through large-scale open models, data, robotics, conservation, and beyond.
Know more