GazeBehavior Annotation Toolkit (GBAT): AI-powered toolkit for automatic annotation of egocentric eye-tracking and video data of child-caregiver interaction

📅 2026-05-21
📈 Citations: 0
Influential: 0
📄 PDF

career value

210K/year
🤖 AI Summary
This study addresses the high cost and labor intensity of manual annotation in multimodal eye-tracking and video data of child–caregiver interactions, which hinders large-scale or longitudinal research. To overcome this limitation, the authors propose a deep learning–based toolkit that, for the first time, integrates three core functionalities: multi-video post-synchronization, semi-automatic gaze target categorization, and classification of participant pose and hand movements. Designed for naturalistic settings, this framework enables efficient analysis of dynamic attention patterns. By substantially improving annotation efficiency and data scalability, the method provides the first end-to-end solution for multimodal feature extraction in early developmental research.
📝 Abstract
Video recordings of child-caregiver interactions enable investigation of attentional dynamics during naturalistic behavior. Such multimodal recording also allows researchers to examine how attention interacts with action and language use in real time. However, manual annotation of such data is time-consuming. Here, we introduce GazeBehavior Annotation Toolkit, a deep-learning-based toolkit designed to facilitate three key processes in data preprocessing and feature extraction: post-hoc synchronization across multiple videos, semi-automatic annotation of gaze target categories, and categorization of participants' poses and hand actions. This toolkit improves the efficiency and scalability of feature extraction from human egocentric eye-tracking and video data. Such improvement is critical in supporting large-scale and longitudinal investigations of attentional dynamics and naturalistic behavior in human early development.
Problem

Research questions and friction points this paper is trying to address.

egocentric eye-tracking
child-caregiver interaction
manual annotation
attentional dynamics
naturalistic behavior
Innovation

Methods, ideas, or system contributions that make the work stand out.

egocentric eye-tracking
deep learning
automatic annotation
child-caregiver interaction
multimodal synchronization
I
Iba Baig
Department of Psychology, University of Miami
K
Kevin Li
Department of Psychology, University of Miami
Y
Yanbin Xu
Department of Psychology, University of Miami
S
Seiji Cattelain
Ecole Normale Supérieure, PSL University, EHESS, CNRS
M
Marie Hallo
Ecole Normale Supérieure, PSL University, EHESS, CNRS
H
Hayato Ono
International Research Center for Neurointelligence (WPI-IRCN), The University of Tokyo Institutes for Advanced Study
Sho Tsuji
Sho Tsuji
University of Tokyo
M
Ming Bo Cai
Department of Psychology, University of Miami; International Research Center for Neurointelligence (WPI-IRCN), The University of Tokyo Institutes for Advanced Study