RankEvolve: Automating the Discovery of Retrieval Algorithms via LLM-Driven Evolution

📅 2026-02-18
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work proposes a large language model (LLM)-driven program evolution framework that automatically discovers improved lexical retrieval functions by representing candidate algorithms as executable code. Departing from traditional methods like BM25, which rely on manual parameter tuning and struggle to identify superior ranking functions, the approach integrates LLMs with evolutionary search. Starting from seed algorithms such as BM25, it iteratively applies mutation and recombination operators guided by performance feedback across multiple datasets. The evolved retrieval functions consistently outperform strong baselines on BEIR, BRIGHT, and TREC Deep Learning 2019/2020 benchmarks, demonstrating not only significant gains in effectiveness but also strong generalization and cross-dataset transfer capabilities.

Technology Category

Application Category

📝 Abstract
Retrieval algorithms like BM25 and query likelihood with Dirichlet smoothing remain strong and efficient first-stage rankers, yet improvements have mostly relied on parameter tuning and human intuition. We investigate whether a large language model, guided by an evaluator and evolutionary search, can automatically discover improved lexical retrieval algorithms. We introduce RankEvolve, a program evolution setup based on AlphaEvolve, in which candidate ranking algorithms are represented as executable code and iteratively mutated, recombined, and selected based on retrieval performance across 12 IR datasets from BEIR and BRIGHT. RankEvolve starts from two seed programs: BM25 and query likelihood with Dirichlet smoothing. The evolved algorithms are novel, effective, and show promising transfer to the full BEIR and BRIGHT benchmarks as well as TREC DL 19 and 20. Our results suggest that evaluator-guided LLM program evolution is a practical path towards automatic discovery of novel ranking algorithms.
Problem

Research questions and friction points this paper is trying to address.

retrieval algorithms
automatic discovery
ranking algorithms
program evolution
information retrieval
Innovation

Methods, ideas, or system contributions that make the work stand out.

LLM-driven evolution
program synthesis
lexical retrieval
algorithm discovery
evolutionary search
🔎 Similar Papers
No similar papers found.
J
Jinming Nian
Santa Clara University
F
Fangchen Li
Independent Researcher
D
Dae Hoon Park
Walmart Global Tech
Yi Fang
Yi Fang
Computer Science and Engineering, Santa Clara University
Information retrievalArtificial intelligence