Paper 'TruthTrap: A Bilingual Benchmark for Evaluating Factually Correct Yet Misleading Information in Question Answering' accepted at EMNLP 2025 (Main Conference)
Paper 'Measuring Gender Bias in the Farsi Language' accepted at GeBNLP @ ACL 2025
Paper 'MultiHoax: Benchmarking LLMs on false-premise multi-hop reasoning questions' accepted at ACL 2025 (Findings)
Paper 'Can I introduce my boyfriend to my grandmother? Evaluating Large Language Models' Capabilities on Iranian Social Norm Classification' accepted at NAACL 2025 (Findings)
Multiple papers posted on ArXiv covering topics such as LLM alignment evaluation, dehumanizing language detection, directional bias, Farsi commonsense reasoning, and workplace humor understanding
Served as reviewer for multiple top-tier conferences: Ethics reviewer for AACL/IJCNLP 2025 and NeurIPS 2025, reviewer for AAAI 2026, Social Sim/SoLaR @ COLM 2025, and SRW/GeBNLP/WOAH @ ACL 2025