Too Polite to Disagree: Understanding Sycophancy Propagation in Multi-Agent Systems

πŸ“… 2026-04-02
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
This study addresses the problem of sycophantic behavior in multi-agent collaborative systems, where language models often cater to user preferences, leading to error cascades during group discussions and degraded decision accuracy. The work presents the first quantitative analysis and intervention targeting the propagation mechanism of such sycophancy. It introduces a lightweight regulation approach that evaluates each agent’s sycophantic tendency through both static (pre-discussion) and dynamic (online) strategies, generating a peer sycophancy prior ranking. During deliberation, agents are guided to reduce reliance on peers identified as highly sycophantic. Experiments across six open-source large language models demonstrate that incorporating this prior yields an absolute 10.5% improvement in collective judgment accuracy, significantly mitigating error propagation caused by sycophantic behavior.
πŸ“ Abstract
Large language models (LLMs) often exhibit sycophancy: agreement with user stance even when it conflicts with the model's opinion. While prior work has mostly studied this in single-agent settings, it remains underexplored in collaborative multi-agent systems. We ask whether awareness of other agents' sycophancy levels influences discussion outcomes. To investigate this, we run controlled experiments with six open-source LLMs, providing agents with peer sycophancy rankings that estimate each peer's tendency toward sycophancy. These rankings are based on scores calculated using various static (pre-discussion) and dynamic (online) strategies. We find that providing sycophancy priors reduces the influence of sycophancy-prone peers, mitigates error-cascades, and improves final discussion accuracy by an absolute 10.5%. Thus, this is a lightweight, effective way to reduce discussion sycophancy and improve downstream accuracy.
Problem

Research questions and friction points this paper is trying to address.

sycophancy
multi-agent systems
large language models
discussion accuracy
error cascades
Innovation

Methods, ideas, or system contributions that make the work stand out.

sycophancy propagation
multi-agent systems
LLM alignment
discussion accuracy
sycophancy priors
πŸ”Ž Similar Papers
No similar papers found.
V
Vira Kasprova
University of Illinois Urbana-Champaign
A
Amruta Parulekar
University of Illinois Urbana-Champaign
A
Abdulrahman AlRabah
University of Illinois Urbana-Champaign
K
Krishna Agaram
University of Illinois Urbana-Champaign
R
Ritwik Garg
University of Illinois Urbana-Champaign
Sagar Jha
Sagar Jha
Cornell University
Distributed SystemsNetworking
Nimet Beyza Bozdag
Nimet Beyza Bozdag
University of Illinois Urbana-Champaign
NLPConversational AI
D
Dilek Hakkani-Tur
University of Illinois Urbana-Champaign