Released a large dataset Updesh with ~8M data points covering 13 Indian languages. Led the creation of MEGA (Multilingual Evaluation of Generative AI, 2023), the first large-scale benchmark to evaluate generative LLMs on 16 NLP tasks. Research has been shipped in M365 Copilots, supporting 52 languages.
Research Experience
Works at Microsoft Research India, leading interdisciplinary teams spanning NLP, machine learning, linguistics, HCI, and social science. Actively contributes to the research community through conference organization and reviewing. In 2025, serving as an Area Chair for ACL Rolling Review (ARR), an Area Chair for COLM, and Tutorial Chair for IndoML. Collaborates actively with product groups within Microsoft and is building a team of Applied Scientists in Bangalore.
Background
Research Interests: Multilingual and multicultural NLP, evaluation, and responsible AI. Focuses on ensuring that language technologies work equitably across diverse languages, cultures, and communities. Recent work involves participatory approaches to evaluation, data collection, and policy to ensure that models reflect the preferences of users from diverse regions and cultures.
Miscellany
Contributes to Microsoft's policy efforts on multilingual product strategy, focusing on inclusivity and language diversity.