ChexFract: From General to Specialized -- Enhancing Fracture Description Generation

📅 2025-11-11
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
In medical AI, general-purpose radiology report generation models exhibit severe limitations in detecting and describing rare yet critical pathologies—such as fractures. To address this, we propose the first dedicated vision-language model for fracture diagnosis in chest X-rays: leveraging transfer learning from MAIRA-2 and CheXagent visual encoders, coupled with a fracture-specific textual decoder, enabling end-to-end training. We systematically analyze how fracture type, anatomical location, and patient age affect model performance. Experiments demonstrate statistically significant improvements (p < 0.01) in fracture description accuracy over general-purpose baselines, particularly for subtle and multifocal fractures. To foster reproducibility and advancement, we fully open-source the code, model weights, and a standardized evaluation benchmark—establishing both a reusable tool and a new benchmark for precise automated reporting of rare pathologies.

Technology Category

Application Category

📝 Abstract
Generating accurate and clinically meaningful radiology reports from chest X-ray images remains a significant challenge in medical AI. While recent vision-language models achieve strong results in general radiology report generation, they often fail to adequately describe rare but clinically important pathologies like fractures. This work addresses this gap by developing specialized models for fracture pathology detection and description. We train fracture-specific vision-language models with encoders from MAIRA-2 and CheXagent, demonstrating significant improvements over general-purpose models in generating accurate fracture descriptions. Analysis of model outputs by fracture type, location, and age reveals distinct strengths and limitations of current vision-language model architectures. We publicly release our best-performing fracture-reporting model, facilitating future research in accurate reporting of rare pathologies.
Problem

Research questions and friction points this paper is trying to address.

Enhancing fracture description generation in chest X-rays
Addressing inadequate rare pathology detection in general models
Developing specialized vision-language models for fracture reporting
Innovation

Methods, ideas, or system contributions that make the work stand out.

Specialized vision-language models for fracture detection
Leveraged MAIRA-2 and CheXagent encoder architectures
Enhanced fracture description accuracy over general models
🔎 Similar Papers
2024-06-08Annual Meeting of the Association for Computational LinguisticsCitations: 2
N
Nikolay Nechaev
Artificial Intelligence Research Institute (AIRI), Moscow, Russia
E
Evgeniia Przhezdzetskaia
Artificial Intelligence Research Institute (AIRI), Moscow, Russia
D
Dmitry Umerenkov
Artificial Intelligence Research Institute (AIRI), Moscow, Russia
Dmitry V. Dylov
Dmitry V. Dylov
Associate Professor, Computational Imaging Lab
applied mathematicscomputational imaging