REARANK: Reasoning Re-ranking Agent via Reinforcement Learning

📅 2025-05-26

📈 Citations: 0

✨ Influential: 0

🤖 AI Summary

Listwise reranking in information retrieval suffers from limited performance, poor interpretability, and heavy reliance on large-scale annotated data. Method: This paper proposes REARANK-7B, a reasoning-prioritized reranking agent built upon Qwen2.5-7B, integrating reinforcement learning optimization and lightweight data augmentation to achieve strong generalization with only 179 labeled samples. Its core innovation is the “reasoning-prior” paradigm—explicitly modeling list-level semantic relationships and decision logic to jointly enhance ranking accuracy and process interpretability. Results: REARANK-7B significantly outperforms conventional baselines across mainstream IR benchmarks. It matches GPT-4’s performance on in-domain and out-of-domain tasks, and—critically—achieves the first reported superiority over GPT-4 on the inference-intensive BRIGHT benchmark, demonstrating both efficiency and robustness in complex, reasoning-heavy retrieval scenarios.

Technology Category

Application Category

📝 Abstract

We present REARANK, a large language model (LLM)-based listwise reasoning reranking agent. REARANK explicitly reasons before reranking, significantly improving both performance and interpretability. Leveraging reinforcement learning and data augmentation, REARANK achieves substantial improvements over baseline models across popular information retrieval benchmarks, notably requiring only 179 annotated samples. Built on top of Qwen2.5-7B, our REARANK-7B demonstrates performance comparable to GPT-4 on both in-domain and out-of-domain benchmarks and even surpasses GPT-4 on reasoning-intensive BRIGHT benchmarks. These results underscore the effectiveness of our approach and highlight how reinforcement learning can enhance LLM reasoning capabilities in reranking.

Problem

Research questions and friction points this paper is trying to address.

Improves ranking performance via reasoning-enhanced reranking

Enhances interpretability in listwise ranking with LLMs

Achieves GPT-4 level results with minimal annotated data

Innovation

Methods, ideas, or system contributions that make the work stand out.

LLM-based listwise reasoning reranking agent

Reinforcement learning enhances reasoning capabilities

Minimal annotated samples for substantial improvements

🔎 Similar Papers

No similar papers found.

Authors to Follow