Microsoft
Applied Scientist Intern: Multimodal Conversational AI
Found: February 16, 2026
This role is based in Cambridge, United Kingdom.
Responsibilities:
- Conduct experiments and develop algorithms to improve live voice conversation experiences with AI agents.
- Collaborate with CMD Labs researchers and engineers to leverage existing assets and datasets.
- Present results internally and prepare work for publication in leading academic AI conferences.
Qualifications:
- Currently enrolled in a PhD program or published MSc candidate in Computer Science or related field.
- Experience with training transformer models or LLMs using text, audio, and/or images.
- Proficient in Python with experience in PyTorch or TensorFlow.
Preferred:
- Research related to multimodal AI, including computer vision and audio modeling.
- Experience in live speech processing and conversational AI.