Shared Doubt: Zero-shot Cross-Lingual Confidence Estimation for Language Models

📅 2026-05-29
📈 Citations: 0
Influential: 0
📄 PDF

career value

172K/year
🤖 AI Summary
This work addresses the challenge that existing confidence estimation methods for large language models are predominantly limited to English and struggle to generalize across languages without retraining. The authors discover that multilingual large language models harbor a cross-lingual shared confidence subspace within their intermediate layers. Leveraging this insight, they propose a lightweight linear probe trained solely on monolingual (e.g., English) supervision to enable zero-shot cross-lingual confidence estimation. Remarkably, the method requires neither target-language data nor model retraining, yet achieves strong generalization across diverse unseen languages. It significantly outperforms current state-of-the-art confidence estimation approaches and represents the first successful demonstration of cross-lingual confidence transfer without any supervision in the target languages.
📝 Abstract
Confidence estimation (CE), i.e. quantifying the reliability of a model's prediction, has attracted great interest in the context of large language models (LLMs). However, most studies focus on English, ignoring the multilingual reality of LLM usage, while many CE methods degrade or require retraining across languages. To address this gap, we investigate whether multilingual LLMs encode shared, language-transferable confidence features. We use a lightweight linear probe that predicts answer correctness directly from intermediate representations. Trained monolingually, the probe generalizes zero-shot to unseen, typologically diverse languages without target-language supervision. Learned layer weights and multiple ablations reveal that confidence features concentrate in middle layers across languages, suggesting a shared confidence subspace. While zero-shot cross-lingual performance depends on similarity to the source language, the probe provides a strong baseline without any retraining and compares favorably to other popular confidence estimation methods.
Problem

Research questions and friction points this paper is trying to address.

confidence estimation
cross-lingual
zero-shot
multilingual language models
language transfer
Innovation

Methods, ideas, or system contributions that make the work stand out.

zero-shot cross-lingual
confidence estimation
linear probe
shared confidence subspace
multilingual LLMs
🔎 Similar Papers
No similar papers found.