Towards Conditioning Clinical Text Generation for User Control

📅 2025-02-24
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Clinical deployment of LLMs faces critical challenges including severe hallucination, factual inconsistency, and insufficient clinician controllability. To address these, we propose an automated clinical data augmentation framework where the LLM acts as a human proxy—modeling physician intent to generate high-quality, conditionally constrained training samples. Our approach enables fine-grained, real-time clinician intervention in the generation process without increasing model complexity or cognitive load. It integrates conditional text generation, BioNLP-specific fine-tuning, and task-adaptive augmentation. On the ACL’24 BioNLP “Discharge Me!” benchmark, our method achieves a new state-of-the-art: +34% improvement over the baseline (vs. +9% without augmentation). Human evaluation confirms significant gains in relevance, accuracy, and factual consistency. This work introduces the novel “LLM as human proxy” paradigm, establishing a scalable, trustworthy, and controllable generation framework for clinical NLP.

Technology Category

Application Category

📝 Abstract
Deploying natural language generation systems in clinical settings remains challenging despite advances in Large Language Models (LLMs), which continue to exhibit hallucinations and factual inconsistencies, necessitating human oversight. This paper explores automated dataset augmentation using LLMs as human proxies to condition LLMs for clinician control without increasing cognitive workload. On the BioNLP ACL'24 Discharge Me! Shared Task, we achieve new state-of-the-art results with simpler methods than prior submissions through more efficient training, yielding a 9% relative improvement without augmented training and up to 34% with dataset augmentation. Preliminary human evaluation further supports the effectiveness of our approach, highlighting the potential of augmenting clinical text generation for control to enhance relevance, accuracy, and factual consistency.
Problem

Research questions and friction points this paper is trying to address.

Enhance clinical text generation for user control
Reduce hallucinations and factual inconsistencies in LLMs
Improve relevance, accuracy, and factual consistency
Innovation

Methods, ideas, or system contributions that make the work stand out.

LLMs automate dataset augmentation
Enhances clinical text generation control
Improves relevance and factual consistency
🔎 Similar Papers
No similar papers found.
O
Osman Alperen Koracs
Institute for AI in Medicine (IKIM), University Hospital Essen (AöR), Essen, Germany
R
Rabi Bahnan
Institute for AI in Medicine (IKIM), University Hospital Essen (AöR), Essen, Germany
J
J. Kleesiek
Institute for AI in Medicine (IKIM), University Hospital Essen (AöR), Essen, Germany; Cancer Research Center Cologne Essen (CCCE), West German Cancer Center Essen, University Hospital Essen (AöR), Essen, Germany; German Cancer Consortium (DKTK, Partner site Essen), Heidelberg, Germany; Department of Physics, TU Dortmund, Dortmund, Germany
Amin Dada
Amin Dada
Institute for AI in Medicine (IKIM), University Hospital Essen