π€ AI Summary
A publicly available, high-quality multimodal MRI dataset for treatment-naΓ―ve nasopharyngeal carcinoma (NPC) is currently lacking, hindering the development of intelligent diagnostic, precise tumor segmentation, and staging algorithms.
Method: We constructed the first open multimodal MRI benchmark dataset specifically for NPC, comprising 831 axial T1-weighted, T2-weighted, and contrast-enhanced T1-weighted scans from 277 treatment-naΓ―ve patients. All scans are accompanied by pixel-level tumor segmentations manually delineated by radiologists and structured clinical metadata. Imaging acquisition followed standardized protocols, and annotations underwent rigorous double-blinded quality control, ensuring tight alignment among imaging sequences, lesion annotations, and clinical phenotypes.
Contribution/Results: This dataset fills a critical gap in publicly accessible, multimodal NPC benchmarks. It serves as foundational infrastructure for training deep learning models, validating algorithms across institutions, and developing clinical decision-support systems.
π Abstract
Multi-modality magnetic resonance imaging data with various sequences facilitate the early diagnosis, tumor segmentation, and disease staging in the management of nasopharyngeal carcinoma (NPC). The lack of publicly available, comprehensive datasets limits advancements in diagnosis, treatment planning, and the development of machine learning algorithms for NPC. Addressing this critical need, we introduce the first comprehensive NPC MRI dataset, encompassing MR axial imaging of 277 primary NPC patients. This dataset includes T1-weighted, T2-weighted, and contrast-enhanced T1-weighted sequences, totaling 831 scans. In addition to the corresponding clinical data, manually annotated and labeled segmentations by experienced radiologists offer high-quality data resources from untreated primary NPC.