An Attention Mechanism for Robust Multimodal Integration in a Global Workspace Architecture

πŸ“… 2026-02-09
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
This work addresses the limitations of existing global workspace architectures, which lack effective attention mechanisms and struggle to balance noise robustness with cross-task generalization in multimodal fusion. Inspired by cognitive neuroscience, we propose the first explicit modality selection attention mechanism tailored for global workspace frameworks. Our approach employs a learnable, top-down attention process to dynamically integrate and select relevant modality-specific information. Evaluated on the Simple Shapes and MM-IMDb 1.0 datasets, the proposed method significantly enhances robustness to input noise while achieving performance on par with state-of-the-art approaches on MM-IMDb 1.0. These results demonstrate its superior capacity for cross-task and cross-modal generalization, highlighting the efficacy of biologically inspired attention in multimodal reasoning systems.

Technology Category

Application Category

πŸ“ Abstract
Global Workspace Theory (GWT), inspired by cognitive neuroscience, posits that flexible cognition could arise via the attentional selection of a relevant subset of modalities within a multimodal integration system. This cognitive framework can inspire novel computational architectures for multimodal integration. Indeed, recent implementations of GWT have explored its multimodal representation capabilities, but the related attention mechanisms remain understudied. Here, we propose and evaluate a top-down attention mechanism to select modalities inside a global workspace. First, we demonstrate that our attention mechanism improves noise robustness of a global workspace system on two multimodal datasets of increasing complexity: Simple Shapes and MM-IMDb 1.0. Second, we highlight various cross-task and cross-modality generalization capabilities that are not shared by multimodal attention models from the literature. Comparing against existing baselines on the MM-IMDb 1.0 benchmark, we find our attention mechanism makes the global workspace competitive with the state of the art.
Problem

Research questions and friction points this paper is trying to address.

multimodal integration
attention mechanism
global workspace
noise robustness
cross-modality generalization
Innovation

Methods, ideas, or system contributions that make the work stand out.

attention mechanism
multimodal integration
global workspace theory
noise robustness
cross-modality generalization
πŸ”Ž Similar Papers
No similar papers found.
R
Roland Bertin-Johannet
Univ. Toulouse, CNRS, CerCo; ANITI, Artificial and Natural Intelligence Toulouse Institute
L
Lara Scipio
Univ. Toulouse, CNRS, CerCo; ANITI, Artificial and Natural Intelligence Toulouse Institute
L
Leopold MaytiΓ©
Univ. Toulouse, CNRS, CerCo; ANITI, Artificial and Natural Intelligence Toulouse Institute
Rufin VanRullen
Rufin VanRullen
Research Director, CNRS, CerCo, ANITI, TMBI, Univ. Toulouse
AINeural NetworksVisual PerceptionAttentionOscillations