DiagR1: A Vision-Language Model Trained via Reinforcement Learning for Digestive Pathology Diagnosis

📅 2025-07-24
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address factual hallucinations and opaque reasoning in multimodal models for gastrointestinal (GI) pathology image diagnosis, this work introduces the first large-scale GI pathology dataset annotated with explicit clinical reasoning chains. We propose a “prompt-argumentation” strategy that jointly optimizes lesion classification and anatomical localization, and design a Grouped Relative Policy Optimization (GRPO) framework integrating vision-language modeling with structured prompt engineering. Built upon supervised fine-tuning, GRPO enhances reasoning auditability via intra-group consistency optimization. Experiments on real-world pathology report generation demonstrate that our method achieves a 18.7% improvement in clinical relevance, a 32.4% increase in structural completeness, and a 41.2% reduction in diagnostic error rate over state-of-the-art baselines—significantly advancing model accuracy, trustworthiness, and clinical utility.

Technology Category

Application Category

📝 Abstract
Multimodal large models have shown great potential in automating pathology image analysis. However, current multimodal models for gastrointestinal pathology are constrained by both data quality and reasoning transparency: pervasive noise and incomplete annotations in public datasets predispose vision language models to factual hallucinations when generating diagnostic text, while the absence of explicit intermediate reasoning chains renders the outputs difficult to audit and thus less trustworthy in clinical practice. To address these issues, we construct a large scale gastrointestinal pathology dataset containing both microscopic descriptions and diagnostic conclusions, and propose a prompt argumentation strategy that incorporates lesion classification and anatomical site information. This design guides the model to better capture image specific features and maintain semantic consistency in generation. Furthermore, we employ a post training pipeline that combines supervised fine tuning with Group Relative Policy Optimization (GRPO) to improve reasoning quality and output structure. Experimental results on real world pathology report generation tasks demonstrate that our approach significantly outperforms state of the art open source and proprietary baselines in terms of generation quality, structural completeness, and clinical relevance. Our solution outperforms state of the art models with 18.7% higher clinical relevance, 32.4% improved structural completeness, and 41.2% fewer diagnostic errors, demonstrating superior accuracy and clinical utility compared to existing solutions.
Problem

Research questions and friction points this paper is trying to address.

Improving diagnostic accuracy in gastrointestinal pathology via multimodal models
Reducing factual hallucinations in pathology report generation
Enhancing reasoning transparency and clinical trustworthiness in AI diagnostics
Innovation

Methods, ideas, or system contributions that make the work stand out.

Reinforcement learning for pathology diagnosis
Prompt argumentation with lesion classification
Post-training pipeline with GRPO optimization
🔎 Similar Papers
No similar papers found.
Minxi Ouyang
Minxi Ouyang
Tsinghua University
cvpathology
L
Lianghui Zhu
Shenzhen International Graduate School, Tsinghua University, Beijing, China
Y
Yaqing Bao
Greater Bay Area Center for Medical Device Evaluation and Inspection.NMPA, Shenzhen, Guangdong Province,P.R. China
Q
Qiang Huang
Shenzhen Shengqiang Technology Co., Ltd., Shenzhen, Guangdong, China
J
Jingli Ouyang
Shenzhen International Graduate School, Tsinghua University, Beijing, China
T
Tian Guan
Shenzhen International Graduate School, Tsinghua University, Beijing, China
Xitong Ling
Xitong Ling
Tsinghua University
AI4PathologyFoundation-ModelVision-Language-Model
J
Jiawen Li
Shenzhen International Graduate School, Tsinghua University, Beijing, China
S
Song Duan
Department of Pathology, Chongqing University Affiliated Three Gorges Hospital, Chongqing, China
Wenbin Dai
Wenbin Dai
Shanghai Jiao Tong University
Industrial Edge ComputingIndustrial InformaticsAutomation Code GenerationIndustrial Control Software
L
Li Zheng
Department of Immunology, College of Basic Medical Sciences, China Medical University, Shenyang, Liaoning Province, P.R. China
X
Xuemei Zhang
Department of Pathology, Liuzhou People’s Hospital Affiliated to Guangxi Medical University, Liuzhou, Guangxi, China
Yonghong He
Yonghong He
清华大学深圳国际研究生院
生物医学工程,光学成像,AI图像处理、病理大模型