ArgHiTZ at ArchEHR-QA 2025: A Two-Step Divide and Conquer Approach to Patient Question Answering for Top Factuality

📅 2025-06-15
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address the factual consistency bottleneck in clinical question answering (QA) for the ArchEHR-QA 2025 shared task, this paper proposes a knowledge-free, two-stage decoupled framework. First, it retrieves salient sentence segments from electronic health records (EHRs) via semantic similarity matching, followed by BERT-based re-ranking for precise candidate selection. Second, it conditions an end-to-end generative model exclusively on the refined retrieved evidence to produce answers. Crucially, this work pioneers the deep integration of a re-ranker into the retrieval-generation pipeline—without external knowledge—thereby substantially improving answer factuality. In official evaluation, the system achieved top performance in factual consistency among 30 participating teams (Rank 1), with an overall score of 0.44 (Rank 8). This study establishes an interpretable, reproducible, and lightweight paradigm for high-assurance clinical QA, empirically validating the critical role of decoupled optimization of retrieval and generation in ensuring medical fact consistency.

Technology Category

Application Category

📝 Abstract
This work presents three different approaches to address the ArchEHR-QA 2025 Shared Task on automated patient question answering. We introduce an end-to-end prompt-based baseline and two two-step methods to divide the task, without utilizing any external knowledge. Both two step approaches first extract essential sentences from the clinical text, by prompt or similarity ranking, and then generate the final answer from these notes. Results indicate that the re-ranker based two-step system performs best, highlighting the importance of selecting the right approach for each subtask. Our best run achieved an overall score of 0.44, ranking 8th out of 30 on the leaderboard, securing the top position in overall factuality.
Problem

Research questions and friction points this paper is trying to address.

Automated patient question answering in clinical texts
Extracting essential sentences without external knowledge
Improving factuality in medical question answering
Innovation

Methods, ideas, or system contributions that make the work stand out.

Two-step divide and conquer approach
Prompt-based essential sentence extraction
Re-ranker for optimal sentence selection
🔎 Similar Papers
No similar papers found.
A
Adrián Cuadrón
HiTZ Center - Ixa, University of the Basque Country UPV/EHU
A
Aimar Sagasti
HiTZ Center - Ixa, University of the Basque Country UPV/EHU
M
Maitane Urruela
HiTZ Center - Ixa, University of the Basque Country UPV/EHU
I
Iker De la Iglesia
HiTZ Center - Ixa, University of the Basque Country UPV/EHU
A
Ane G Domingo-Aldame
HiTZ Center - Ixa, University of the Basque Country UPV/EHU
A
Aitziber Atutxa
HiTZ Center - Ixa, University of the Basque Country UPV/EHU
Josu Goikoetxea
Josu Goikoetxea
Associate Professor in UPV/EHU
Ander Barrena
Ander Barrena
University of the Basque Country
Natural Language ProcessingDeep Learning