Ranking Free RAG: Replacing Re-ranking with Selection in RAG for Sensitive Domains

📅 2025-05-21
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Traditional RAG systems rely on similarity-based retrieval and heuristic re-ranking (e.g., top-k), leading to poor interpretability, weak robustness, and vulnerability to adversarial attacks—especially in high-stakes domains like law and finance. To address these limitations, this paper proposes METEORA, a reasoning-driven, three-stage evidence selection framework. It employs a preference-tuned LLM to generate interpretable selection rationales, applies elbow detection for adaptive evidence pruning, and enhances contextual coverage via neighborhood expansion. A dual-LLM coordination mechanism—comprising a rationale generator and a verifier—unifies the reasoning flow, while integrated toxicity filtering and consistency verification eliminate reliance on top-k heuristics. Evaluated across six benchmark datasets, METEORA reduces evidence usage by 50% and improves accuracy by 33.34%. Under adversarial poisoning, its F1 score rises significantly from 0.10 to 0.44.

Technology Category

Application Category

📝 Abstract
Traditional Retrieval-Augmented Generation (RAG) pipelines rely on similarity-based retrieval and re-ranking, which depend on heuristics such as top-k, and lack explainability, interpretability, and robustness against adversarial content. To address this gap, we propose a novel method METEORA that replaces re-ranking in RAG with a rationale-driven selection approach. METEORA operates in two stages. First, a general-purpose LLM is preference-tuned to generate rationales conditioned on the input query using direct preference optimization. These rationales guide the evidence chunk selection engine, which selects relevant chunks in three stages: pairing individual rationales with corresponding retrieved chunks for local relevance, global selection with elbow detection for adaptive cutoff, and context expansion via neighboring chunks. This process eliminates the need for top-k heuristics. The rationales are also used for consistency check using a Verifier LLM to detect and filter poisoned or misleading content for safe generation. The framework provides explainable and interpretable evidence flow by using rationales consistently across both selection and verification. Our evaluation across six datasets spanning legal, financial, and academic research domains shows that METEORA improves generation accuracy by 33.34% while using approximately 50% fewer chunks than state-of-the-art re-ranking methods. In adversarial settings, METEORA significantly improves the F1 score from 0.10 to 0.44 over the state-of-the-art perplexity-based defense baseline, demonstrating strong resilience to poisoning attacks. Code available at: https://anonymous.4open.science/r/METEORA-DC46/README.md
Problem

Research questions and friction points this paper is trying to address.

Replacing heuristic-based re-ranking with rationale-driven selection in RAG
Improving explainability and robustness against adversarial content
Enhancing generation accuracy while reducing chunk usage
Innovation

Methods, ideas, or system contributions that make the work stand out.

Replaces re-ranking with rationale-driven selection
Uses preference-tuned LLM for rationale generation
Verifies content consistency with Verifier LLM
🔎 Similar Papers
No similar papers found.
Y
Yash Saxena
UMBC, Baltimore, Maryland, USA
A
Anpur Padia
UMBC, Baltimore, Maryland, USA
M
Mandar S Chaudhary
eBay Inc., San Jose, California, USA
Kalpa Gunaratna
Kalpa Gunaratna
Staff Research Scientist, Samsung Research America
Natural Language ProcessingLLMKnowledge GraphsDeep Learning
S
Srinivasan Parthasarathy
Ohio State University, Columbus, Ohio, USA
Manas Gaur
Manas Gaur
University of Maryland Baltimore County
Neurosymbolic AIKnowledge-infused LearningArtificial IntelligenceNLPKnowledge Graphs