Sunayana Sitaram
Scholar

Sunayana Sitaram

Google Scholar ID: PUxwYrkAAAAJ
Microsoft Research India
Multilingual NLPevaluationLLMs and culturemultilingualismLLMs
Citations & Impact
All-time
Citations
2,597
 
H-index
25
 
i10-index
50
 
Publications
20
 
Co-authors
167
list available
Resume (English only)
Academic Achievements
  • Released a large dataset Updesh with ~8M data points covering 13 Indian languages. Led the creation of MEGA (Multilingual Evaluation of Generative AI, 2023), the first large-scale benchmark to evaluate generative LLMs on 16 NLP tasks. Research has been shipped in M365 Copilots, supporting 52 languages.
Research Experience
  • Works at Microsoft Research India, leading interdisciplinary teams spanning NLP, machine learning, linguistics, HCI, and social science. Actively contributes to the research community through conference organization and reviewing. In 2025, serving as an Area Chair for ACL Rolling Review (ARR), an Area Chair for COLM, and Tutorial Chair for IndoML. Collaborates actively with product groups within Microsoft and is building a team of Applied Scientists in Bangalore.
Background
  • Research Interests: Multilingual and multicultural NLP, evaluation, and responsible AI. Focuses on ensuring that language technologies work equitably across diverse languages, cultures, and communities. Recent work involves participatory approaches to evaluation, data collection, and policy to ensure that models reflect the preferences of users from diverse regions and cultures.
Miscellany
  • Contributes to Microsoft's policy efforts on multilingual product strategy, focusing on inclusivity and language diversity.