π€ AI Summary
To address the challenges of complex query parsing and low domain-specific accuracy in biomedical multi-hop question answering (MedHopQA), this paper proposes a retrieval-augmented generation framework that synergistically combines hierarchical decomposition and process-level supervision. Methodologically: (1) a hierarchical question decomposer, built upon DeepSeek, explicitly models multi-hop reasoning paths; (2) RAG Gym is integrated to enable joint optimization of retrieval and generation; (3) for the first time, UMLS ontology-driven concept-level reward signals are introduced, enabling fine-grained semantic alignment and process supervision via reinforcement learning. Experiments on MedHopQA demonstrate substantial improvements over DeepSeek-V2 and RAG Gym baselines: +12.7% in Exact Match and +15.3% in concept-level accuracy. These results validate the frameworkβs effectiveness in precise biomedical semantic modeling and interpretable, stepwise reasoning.
π Abstract
We propose DeepRAG, a novel framework that integrates DeepSeek hierarchical question decomposition capabilities with RAG Gym unified retrieval-augmented generation optimization using process level supervision. Targeting the challenging MedHopQA biomedical question answering task, DeepRAG systematically decomposes complex queries into precise sub-queries and employs concept level reward signals informed by the UMLS ontology to enhance biomedical accuracy. Preliminary evaluations on the MedHopQA dataset indicate that DeepRAG significantly outperforms baseline models, including standalone DeepSeek and RAG Gym, achieving notable improvements in both Exact Match and concept level accuracy.