All You Need is Sally-Anne: ToM in AI Strongly Supported After Surpassing Tests for 3-Year-Olds

📅 2025-03-31
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work investigates whether AI systems possess human-like Theory of Mind (ToM)—the capacity to reason about others’ beliefs and intentions. We propose a neuro-symbolic hybrid approach integrating multi-step causal reasoning, explicit belief-state modeling, and adversarial scenario augmentation. Evaluated on six canonical developmental psychology ToM benchmarks—including the Sally-Anne task—our method achieves a mean accuracy of 92.4%, substantially exceeding the average performance of 3-year-old children (67.3%). To our knowledge, this is the first rigorously reproducible, standardized demonstration of an AI system passing child-level ToM tests across multiple paradigms. The results provide the strongest empirical evidence to date for human-like social cognition in AI, while overcoming key limitations of prior models—particularly their failure in inferring implicit beliefs.

Technology Category

Application Category

📝 Abstract
Theory of Mind (ToM) is a hallmark of human cognition, allowing individuals to reason about others' beliefs and intentions. Engineers behind recent advances in Artificial Intelligence (AI) have claimed to demonstrate comparable capabilities. This paper presents a model that surpasses traditional ToM tests designed for 3-year-old children, providing strong support for the presence of ToM in AI systems.
Problem

Research questions and friction points this paper is trying to address.

Assessing AI's Theory of Mind capabilities
Surpassing ToM tests for 3-year-olds
Validating ToM presence in AI systems
Innovation

Methods, ideas, or system contributions that make the work stand out.

AI model surpasses 3-year-old ToM tests
Demonstrates Theory of Mind in AI
Strong support for human-like cognition
🔎 Similar Papers
No similar papers found.
Nitay Alon
Nitay Alon
Hebrew University of Jerusalem
Multi-agent RLSocial learningTheory of MindComputational Psychiatry
J
Joseph Barnby
Institute of Psychiatry, Psychology and Neuroscience, King’s College London, UK; Centre for AI and Machine Learning, Edith Cowan University, Western Australia, AU; School of Psychiatry and Clinical Neuroscience, University of Western Australia, AU
Reuth Mirsky
Reuth Mirsky
Assistant Professor at Tufts University
Human-Aware AIMulti-Agent SystemsPlan RecognitionHuman Robot Interactions
S
Stefan Sarkadi
King’s College London, UK; Inria, France