Scaling Behavior Cloning Improves Causal Reasoning: An Open Model for Real-Time Video Game Playing

📅 2026-01-08
🏛️ arXiv.org
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work investigates how behavioral cloning can enhance causal reasoning capabilities in agents operating within complex 3D video games while achieving real-time performance. Leveraging large-scale human gameplay data, we train deep neural networks with up to 1.2 billion parameters, enabling near-human-level decision-making at interactive speeds on consumer-grade GPUs. We systematically uncover, for the first time, the synergistic effect between model scale and data scale in learning causal policies, and validate this scaling law through a newly constructed multidimensional benchmark for causal reasoning. To support further research in open-world agent development, we publicly release a high-quality gameplay dataset, training code, and pretrained models.

Technology Category

Application Category

📝 Abstract
Behavior cloning has seen a resurgence as scaling model and data sizes demonstrate strong performance. In this work, we introduce an open recipe for training a video game playing foundation model designed for inference in realtime on a consumer GPU. We release all data (8300+ hours of high quality human gameplay), training and inference code, and pretrained checkpoints under an open license. Empirically, we show that our best model achieves performance competitive with human players across a variety of 3D games. We use this recipe to investigate the scaling laws of behavior cloning, with a focus on causal reasoning. In a controlled toy setting, we first demonstrate that increasing training data and network depth leads to the model learning a more causal policy. We then validate these findings at scale, analyzing models up to 1.2 billion parameters. We observe that the causal improvements seen in the toy domain hold true as model size and training steps increase.
Problem

Research questions and friction points this paper is trying to address.

behavior cloning
causal reasoning
scaling laws
video game playing
foundation model
Innovation

Methods, ideas, or system contributions that make the work stand out.

behavior cloning
scaling laws
causal reasoning
foundation model
real-time inference
🔎 Similar Papers
No similar papers found.
Yuguang Yue
Yuguang Yue
Amazon
Bayesian StatisticsReinforcement Learning
I
Irakli Salia
Player2, USA
S
Samuel Hunt
Player2, USA
C
Chris Green
Player2, USA
W
Wenzhe Shi
Player2, USA
J
Jonathan J. Hunt
Player2, USA