Knowledgeable-r1: Policy Optimization for Knowledge Exploration in Retrieval-Augmented Generation

📅 2025-06-05

📈 Citations: 0

✨ Influential: 0

🤖 AI Summary

Current RAG systems over-rely on retrieved context, rendering them vulnerable to misleading or redundant information while neglecting the large language model’s intrinsic parametric knowledge. To address this, we propose a multi-strategy joint sampling framework that dynamically integrates parametric and contextual knowledge and resolves conflicts between them through strategy optimization, collaborative sampling across multiple distributions, and explicit knowledge capability modeling. Our approach introduces, for the first time in knowledge capability exploration, a learnable joint distribution mechanism. This significantly enhances RAG’s robustness against parametric–contextual conflicts: it achieves a 17.07% absolute accuracy gain on counterfactual reasoning tasks and delivers consistent performance improvements across diverse RAG benchmarks. The framework establishes a novel paradigm for balancing retrieval augmentation with model-internal knowledge, advancing the principled integration of parametric and external knowledge sources.

Technology Category

Application Category

📝 Abstract

Retrieval-augmented generation (RAG) is a mainstream method for improving performance on knowledge-intensive tasks. However,current RAG systems often place too much emphasis on retrieved contexts. This can lead to reliance on inaccurate sources and overlook the model's inherent knowledge, especially when dealing with misleading or excessive information. To resolve this imbalance, we propose Knowledgeable-r1 that using joint sampling and define multi policy distributions in knowledge capability exploration to stimulate large language models'self-integrated utilization of parametric and contextual knowledge. Experiments show that Knowledgeable-r1 significantly enhances robustness and reasoning accuracy in both parameters and contextual conflict tasks and general RAG tasks, especially outperforming baselines by 17.07% in counterfactual scenarios and demonstrating consistent gains across RAG tasks. Our code are available at https://github.com/lcy80366872/ knowledgeable-r1.

Problem

Research questions and friction points this paper is trying to address.

Balancing reliance on retrieved contexts and model's inherent knowledge

Addressing inaccuracies from misleading or excessive information in RAG

Enhancing robustness and reasoning accuracy in knowledge-intensive tasks

Innovation

Methods, ideas, or system contributions that make the work stand out.

Joint sampling for knowledge exploration

Multi-policy distributions in RAG

Enhancing parametric and contextual knowledge integration

🔎 Similar Papers

No similar papers found.

Authors to Follow