Double-Flow GAN model for the reconstruction of perceived faces from brain activities

📅 2023-12-12
🏛️ arXiv.org
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This study addresses the challenge of high-fidelity facial image reconstruction from fMRI data, tackling two key limitations: difficulty in extracting high-level semantic features (e.g., identity, expression, gender) and poor cross-subject generalizability. We propose a dual-stream GAN architecture that enhances discriminator capability and mitigates generator domain bias to improve reconstruction consistency. A vision-feature-based cross-modal pretraining paradigm is introduced to enable effective transfer of conditional generative models. Additionally, a lightweight fMRI-to-neural-response alignment pretraining module is incorporated to significantly boost cross-individual generalization. Our method achieves state-of-the-art performance across multiple quantitative metrics—marking the first demonstration of highly consistent reconstruction of identity, expression, and gender attributes from fMRI, thereby advancing brain-to-image mapping to a new SOTA level.
📝 Abstract
Face plays an important role in humans visual perception, and reconstructing perceived faces from brain activities is challenging because of its difficulty in extracting high-level features and maintaining consistency of multiple face attributes, such as expression, identity, gender, etc. In this study, we proposed a novel reconstruction framework, which we called Double-Flow GAN, that can enhance the capability of discriminator and handle imbalances in images from certain domains that are too easy for generators. We also designed a pretraining process that uses features extracted from images as conditions for making it possible to pretrain the conditional reconstruction model from fMRI in a larger pure image dataset. Moreover, we developed a simple pretrained model for fMRI alignment to alleviate the problem of cross-subject reconstruction due to the variations of brain structure among different subjects. We conducted experiments by using our proposed method and traditional reconstruction models. Results showed that the proposed method is significant at accurately reconstructing multiple face attributes, outperforms the previous reconstruction models, and exhibited state-of-the-art reconstruction abilities.
Problem

Research questions and friction points this paper is trying to address.

Brain Activity
Facial Reconstruction
Inter-individual Variation
Innovation

Methods, ideas, or system contributions that make the work stand out.

Double-Flow GAN
Brain Activity Reconstruction
Individual Adaptation
🔎 Similar Papers
No similar papers found.
Z
Zihao Wang
School of Computer Science and Engineering, Beihang University, Beijing, China
J
Jing Zhao
School of Engineering Medicine, Beihang University, Beijing, China; School of Biological Science and Medical Engineering, Beihang University, Beijing, China
H
Hui Zhang
School of Engineering Medicine, Beihang University, Beijing, China; Key Laboratory of Biomechanics and Mechanobiology, Ministry of Education, Beihang University, Beijing, China; Key Laboratory of Big Data-Based Precision Medicine, Ministry of Industry and Information Technology of the People’s Republic of China, Beihang University, Beijing, China