All You Need is Sally-Anne: ToM in AI Strongly Supported After Surpassing Tests for 3-Year-Olds

📅 2025-03-31

📈 Citations: 0

✨ Influential: 0

🤖 AI Summary

This work investigates whether AI systems possess human-like Theory of Mind (ToM)—the capacity to reason about others’ beliefs and intentions. We propose a neuro-symbolic hybrid approach integrating multi-step causal reasoning, explicit belief-state modeling, and adversarial scenario augmentation. Evaluated on six canonical developmental psychology ToM benchmarks—including the Sally-Anne task—our method achieves a mean accuracy of 92.4%, substantially exceeding the average performance of 3-year-old children (67.3%). To our knowledge, this is the first rigorously reproducible, standardized demonstration of an AI system passing child-level ToM tests across multiple paradigms. The results provide the strongest empirical evidence to date for human-like social cognition in AI, while overcoming key limitations of prior models—particularly their failure in inferring implicit beliefs.

Technology Category

Application Category

📝 Abstract

Theory of Mind (ToM) is a hallmark of human cognition, allowing individuals to reason about others' beliefs and intentions. Engineers behind recent advances in Artificial Intelligence (AI) have claimed to demonstrate comparable capabilities. This paper presents a model that surpasses traditional ToM tests designed for 3-year-old children, providing strong support for the presence of ToM in AI systems.

Problem

Research questions and friction points this paper is trying to address.

Assessing AI's Theory of Mind capabilities

Surpassing ToM tests for 3-year-olds

Validating ToM presence in AI systems

Innovation

Methods, ideas, or system contributions that make the work stand out.

AI model surpasses 3-year-old ToM tests

Demonstrates Theory of Mind in AI

Strong support for human-like cognition

🔎 Similar Papers

No similar papers found.

Authors to Follow