DeepGB-TB: A Risk-Balanced Cross-Attention Gradient-Boosted Convolutional Network for Rapid, Interpretable Tuberculosis Screening

📅 2025-08-02
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address the challenge of tuberculosis (TB) screening in resource-limited settings, this paper proposes a lightweight, interpretable multimodal AI method that performs real-time, offline TB risk assessment on mobile devices using only cough audio recordings and basic demographic data. Methodologically, we introduce a cross-modal bidirectional cross-attention (CM-BCA) module to effectively fuse 1D-CNN–extracted audio features with structured demographic features from gradient-boosting trees, and propose a TB-risk-balanced loss (TRBL) to mitigate high-risk false negatives. Evaluated on real-world data from 1,105 participants across seven countries, our model achieves an AUROC of 0.903 and F1-score of 0.851—significantly outperforming existing approaches. Furthermore, the framework provides clinically interpretable outputs (e.g., feature-level risk attribution), ensuring transparency and facilitating deployment in primary healthcare settings.

Technology Category

Application Category

📝 Abstract
Large-scale tuberculosis (TB) screening is limited by the high cost and operational complexity of traditional diagnostics, creating a need for artificial-intelligence solutions. We propose DeepGB-TB, a non-invasive system that instantly assigns TB risk scores using only cough audio and basic demographic data. The model couples a lightweight one-dimensional convolutional neural network for audio processing with a gradient-boosted decision tree for tabular features. Its principal innovation is a Cross-Modal Bidirectional Cross-Attention module (CM-BCA) that iteratively exchanges salient cues between modalities, emulating the way clinicians integrate symptoms and risk factors. To meet the clinical priority of minimizing missed cases, we design a Tuberculosis Risk-Balanced Loss (TRBL) that places stronger penalties on false-negative predictions, thereby reducing high-risk misclassifications. DeepGB-TB is evaluated on a diverse dataset of 1,105 patients collected across seven countries, achieving an AUROC of 0.903 and an F1-score of 0.851, representing a new state of the art. Its computational efficiency enables real-time, offline inference directly on common mobile devices, making it ideal for low-resource settings. Importantly, the system produces clinically validated explanations that promote trust and adoption by frontline health workers. By coupling AI innovation with public-health requirements for speed, affordability, and reliability, DeepGB-TB offers a tool for advancing global TB control.
Problem

Research questions and friction points this paper is trying to address.

Rapid tuberculosis screening using cough audio and demographics
Minimizing missed TB cases with risk-balanced predictions
Interpretable AI for low-resource clinical settings
Innovation

Methods, ideas, or system contributions that make the work stand out.

Cross-Modal Bidirectional Cross-Attention module for multi-modal fusion
Tuberculosis Risk-Balanced Loss to minimize false negatives
Lightweight CNN and gradient-boosted trees for efficient processing
🔎 Similar Papers
No similar papers found.
Z
Zhixiang Lu
School of Artificial Intelligence and Advanced Computing, Xi’an Jiaotong-Liverpool University
Y
Yulong Li
School of Artificial Intelligence and Advanced Computing, Xi’an Jiaotong-Liverpool University
F
Feilong Tang
Mohamed bin Zayed University of Artificial Intelligence
Zhengyong Jiang
Zhengyong Jiang
Xi’an Jiaotong-Liverpool University
Deep LearningReinforcement Learning
C
Chong Li
School of Artificial Intelligence and Advanced Computing, Xi’an Jiaotong-Liverpool University
Mian Zhou
Mian Zhou
Xi'an Jiaotong-Liverpool University
Tenglong Li
Tenglong Li
Institute of Automation, Chinese Academy of Sciences
Hardware Architecture
Jionglong Su
Jionglong Su
Xi'an Jiaotong-Liverpool University
AI Big Data Machine Learning Statistics