SkinGPT-R1: Adapter-Only Dual Distillation for Efficient Dermatology Reasoning

📅 2025-11-19
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This study addresses the opacity and poor verifiability of diagnostic reasoning in dermatology. To this end, we propose an interpretable vision-language reasoning framework grounded in explicit Chain-of-Thought (CoT) prompting. Methodologically, we construct DermCoT—the first dermatology-specific CoT corpus—and design DermEval, a clinically aligned six-dimensional evaluation framework, alongside the benchmark DermBench. We further introduce an Adapter-only dual-distillation architecture that jointly incorporates visual distillation, multi-task supervision, and expert-annotated data for end-to-end training. Experimental results demonstrate that our model achieves 4.031/5 on DermBench—outperforming Vision-R1 by 41%—and consistently surpasses baselines across three major dermatological classification benchmarks. It significantly improves both diagnostic accuracy and the quality of natural-language reasoning explanations, achieving breakthroughs in medical plausibility and reasoning verifiability.

Technology Category

Application Category

📝 Abstract
We present SkinGPT-R1, a dermatology focused vision language model that makes diagnostic chain of thought reasoning explicit, step by step, and verifiable. To support skin specific reasoning, we build DermCoT, a corpus of standardized dermatologic chain of thought narratives that combines 10,000 DermEval filtered training cases with 3,000 dermatologist scored certified cases, and we define DermEval as a physician aligned six dimensional evaluator and DermBench as the corresponding benchmark for dermatologic chain of thought quality. On DermBench, across 14 general, reasoning, and medical vision language models, SkinGPT-R1 achieves an average score of 4.031 out of 5 over the six clinician defined dimensions, ranks 1st among all systems, and improves the average score over Vision-R1 by about 41%. On three dermatology classification benchmarks, SkinGPT-R1 delivers stable accuracy gains over Vision-R1 and remains competitive among strong vision language models. Ablation results further show that DermCoT based chain of thought supervision provides substantial improvements over the base model and that adding dermatology aware visual distillation yields consistent additional gains in both narrative quality and recognition.
Problem

Research questions and friction points this paper is trying to address.

Developing efficient dermatology reasoning with explicit diagnostic chain-of-thought
Creating standardized evaluation benchmarks for dermatologic reasoning quality
Improving diagnostic accuracy through specialized visual-language distillation techniques
Innovation

Methods, ideas, or system contributions that make the work stand out.

Adapter-only dual distillation for efficient reasoning
DermCoT corpus with standardized dermatologic narratives
DermEval six-dimensional physician-aligned evaluation framework
🔎 Similar Papers
No similar papers found.
Y
Yuhao Shen
School of Data Science, The Chinese University of Hong Kong, Shenzhen
J
Jiahe Qian
Institute of Automation, Chinese Academy of Sciences
Z
Zhangtianyi Chen
School of Data Science, The Chinese University of Hong Kong, Shenzhen
Y
Yuanhao He
School of Data Science, The Chinese University of Hong Kong, Shenzhen
Juexiao Zhou
Juexiao Zhou
Assistant Professor, The Chinese University of Hong Kong, Shenzhen
AI for HealthcareEthical AIBioinformaticsPrivacyAGI