Research Internship at Cohere
Who are we?
Our mission is to scale intelligence to serve humanity. We’re training and deploying frontier models for developers and enterprises building AI systems to power magical experiences like content generation, semantic search, RAG, and agents. Our work is instrumental to the widespread adoption of AI.
We obsess over what we build. Each of us is responsible for increasing the capabilities of our models and the value they provide to our customers. We work hard and move fast to do what’s best for them.
Cohere is a team of researchers, engineers, designers, and more—each one passionate and world-class in their craft. We believe diverse perspectives are essential to building great products.
Join us on our mission and help shape the future!
Why this role?
As a Research Intern at Cohere, you’ll collaborate with researchers and engineers to design and implement novel ideas, contributing to state-of-the-art models and their deployment into production.
We have internship openings across multiple teams including base model training, retrieval augmented generation, data and evaluation, safety, and finetuning. We also welcome applications in any research area related to large language models (LLMs).
Note: You must be currently pursuing a PhD in Machine Learning, NLP, or a related discipline, and available for a full-time internship lasting 4–6 months.
Responsibilities
- Conduct cutting-edge ML research, building and training large language models.
- Work on projects in areas such as evaluation, multimodal models, and optimization.
- Share research through publications, datasets, and open-source code.
- Contribute to research with real-world applications in Cohere's product development.
You may be a good fit if you:
- Are pursuing or in the process of obtaining a PhD in ML, NLP, AI, or a related field (exceptional non-PhD candidates also considered).
- Are eligible to work in the country of employment throughout the internship.
- Have experience with large-scale distributed training, data pipelines, or state-of-the-art ML models.
- Are familiar with autoregressive models like Transformers.
- Possess strong communication and problem-solving skills.
- Are proficient in Python, C, C++, Lua, or similar languages.
- Have experience with ML frameworks such as JAX, PyTorch, or TensorFlow.
- Have built systems using machine learning or deep learning techniques.
- Are passionate about applied NLP models and products.
Preferred Qualifications
- Publications in top-tier venues in ML, NLP, AI, CV, optimization, CS, stats, applied math, or data science.
- Ability to tackle analytical problems using quantitative methods.
- Proficiency in handling high-dimensional data from multiple sources.
- Experience applying theoretical and empirical research to practical problems.
If you don't meet every qualification but are excited about the role, we encourage you to apply!
Internship Perks (for Full-Time Employees)
- 🤝 Open, inclusive culture and work environment
- 🧑💻 Work with a team at the cutting edge of AI research
- 🍽 Weekly lunch stipend, in-office meals & snacks
- 🦷 Full health and dental benefits, with separate mental health budget
- 🐣 100% parental leave top-up for 6 months (Canada, US, UK)
- 🎨 Personal enrichment benefits (arts, culture, wellness, workspace, etc.)
- 🏙 Remote-flexible setup with offices in Toronto, New York, San Francisco, and London; co-working stipend
- ✈️ 6 weeks of vacation
We value diversity and are committed to creating an inclusive environment. If you need accommodations during the recruitment process, please let us know via our Accommodations Request Form.
Join us at Cohere and be part of something extraordinary.