🤖 AI Summary
Speech development in children with cochlear implants (CIs) exhibits high interindividual variability, and conventional clinical predictors—such as age at implantation and residual hearing—demonstrate limited prognostic accuracy.
Method: This study proposes a deep transfer learning framework leveraging pre-implantation, multi-center structural and functional brain MRI data. It introduces a novel bilinear attention fusion mechanism to enable adaptive, site-invariant weighting of neuroimaging features and synergistic anatomical–functional modeling.
Contribution/Results: The model significantly outperforms traditional machine learning approaches on multi-center data, achieving 92.39% accuracy, 97.7% AUC, and balanced sensitivity (91.22%) and specificity (93.56%). It represents the first demonstration of a single deep learning model generalizing across geographically and institutionally diverse CI cohorts for global prognostic assessment. This work establishes a robust, generalizable neuroimaging biomarker tool for individualized prediction of spoken language outcomes following CI.
📝 Abstract
Cochlear implants (CI) significantly improve spoken language in children with severe-to-profound sensorineural hearing loss (SNHL), yet outcomes remain more variable than in children with normal hearing. This variability cannot be reliably predicted for individual children using age at implantation or residual hearing. This study aims to compare the accuracy of traditional machine learning (ML) to deep transfer learning (DTL) algorithms to predict post-CI spoken language development of children with bilateral SNHL using a binary classification model of high versus low language improvers. A total of 278 implanted children enrolled from three centers. The accuracy, sensitivity and specificity of prediction models based upon brain neuroanatomic features using traditional ML and DTL learning. DTL prediction models using bilinear attention-based fusion strategy achieved: accuracy of 92.39% (95% CI, 90.70%-94.07%), sensitivity of 91.22% (95% CI, 89.98%-92.47%), specificity of 93.56% (95% CI, 90.91%-96.21%), and area under the curve (AUC) of 0.977 (95% CI, 0.969-0.986). DTL outperformed traditional ML models in all outcome measures. DTL was significantly improved by direct capture of discriminative and task-specific information that are advantages of representation learning enabled by this approach over ML. The results support the feasibility of a single DTL prediction model for language prediction of children served by CI programs worldwide.