Scholar

Sunayana Sitaram

Google Scholar ID: PUxwYrkAAAAJ

Microsoft Research India

Multilingual NLPevaluationLLMs and culturemultilingualismLLMs

Homepage↗Google Scholar↗

Citations & Impact

All-time

Citations

2,597

H-index

i10-index

Publications

Co-authors

167

list available

Contact

Emailsunayana.sitaram@microsoft.com TwitterOpen ↗LinkedInOpen ↗

Publications

15 items

The Geometry of LLM-as-Judge: Why Inter-LLM Consensus Is Not Human Alignment

2026

Cited

DEPART: DEcomposing PARiTy across Multilingual LLMs

2026

Cited

Building Benchmarks from the Ground Up: Community-Centered Evaluation of LLMs in Healthcare Chatbot Settings

2025

Cited

The role of synthetic data in Multilingual, Multi-cultural AI systems: Lessons from Indic Languages

2025

Cited

Fluent but Culturally Distant: Can Regional Training Teach Cultural Understanding?

2025

Cited

A Multilingual, Culture-First Approach to Addressing Misgendering in LLM Applications

2025

Cited

Uncovering inequalities in new knowledge learning by large language models across different languages

2025

Cited

Exploring Pretraining via Active Forgetting for Improving Cross Lingual Transfer for Decoder Language Models

arXiv.org · 2024

Cited

Resume (English only)

Academic Achievements

Released a large dataset Updesh with ~8M data points covering 13 Indian languages. Led the creation of MEGA (Multilingual Evaluation of Generative AI, 2023), the first large-scale benchmark to evaluate generative LLMs on 16 NLP tasks. Research has been shipped in M365 Copilots, supporting 52 languages.

Research Experience

Works at Microsoft Research India, leading interdisciplinary teams spanning NLP, machine learning, linguistics, HCI, and social science. Actively contributes to the research community through conference organization and reviewing. In 2025, serving as an Area Chair for ACL Rolling Review (ARR), an Area Chair for COLM, and Tutorial Chair for IndoML. Collaborates actively with product groups within Microsoft and is building a team of Applied Scientists in Bangalore.

Background

Research Interests: Multilingual and multicultural NLP, evaluation, and responsible AI. Focuses on ensuring that language technologies work equitably across diverse languages, cultures, and communities. Recent work involves participatory approaches to evaluation, data collection, and policy to ensure that models reflect the preferences of users from diverse regions and cultures.

Miscellany