ReTreever: Tree-based Coarse-to-Fine Representations for Retrieval

📅 2025-02-11
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Document retrieval faces challenges including high computational cost, excessive memory consumption, and poor interpretability due to high-dimensional embeddings. To address these, we propose an end-to-end trainable binary tree retrieval architecture that jointly optimizes hierarchical node routing functions to achieve semantic alignment between queries and documents along tree paths. The model implicitly learns multi-granularity semantic groupings, enabling both coarse-grained fast filtering and fine-grained precise matching. By employing path-based similarity measurement, our approach balances retrieval accuracy and efficiency: it preserves full representational capacity while outperforming existing hierarchical methods in accuracy and achieving the lowest latency. Moreover, the explicit tree structure significantly enhances system transparency and decision traceability. Overall, this work establishes a novel paradigm for efficient and interpretable document retrieval.

Technology Category

Application Category

📝 Abstract
Document retrieval is a core component of question-answering systems, as it enables conditioning answer generation on new and large-scale corpora. While effective, the standard practice of encoding documents into high-dimensional embeddings for similarity search entails large memory and compute footprints, and also makes it hard to inspect the inner workings of the system. In this paper, we propose a tree-based method for organizing and representing reference documents at various granular levels, which offers the flexibility to balance cost and utility, and eases the inspection of the corpus content and retrieval operations. Our method, called ReTreever, jointly learns a routing function per internal node of a binary tree such that query and reference documents are assigned to similar tree branches, hence directly optimizing for retrieval performance. Our evaluations show that ReTreever generally preserves full representation accuracy. Its hierarchical structure further provides strong coarse representations and enhances transparency by indirectly learning meaningful semantic groupings. Among hierarchical retrieval methods, ReTreever achieves the best retrieval accuracy at the lowest latency, proving that this family of techniques can be viable in practical applications.
Problem

Research questions and friction points this paper is trying to address.

Efficient document retrieval
Memory and compute optimization
Enhanced system transparency
Innovation

Methods, ideas, or system contributions that make the work stand out.

Tree-based document representation
Hierarchical semantic grouping
Low-latency retrieval accuracy
🔎 Similar Papers
No similar papers found.
S
Shubham Gupta
ServiceNow Research, Mila – Qu´ebec Artificial Intelligence Institute, Universit´e Laval
Z
Zichao Li
ServiceNow Research, McGill University, Mila – Qu´ebec Artificial Intelligence Institute
T
Tianyi Chen
ServiceNow Research, Mila – Qu´ebec Artificial Intelligence Institute
Cem Subakan
Cem Subakan
Assistant Prof. at Laval University, Computer Science Dept. / Mila, Associate Academic Member
Machine LearningLearning AlgorithmsMachine Learning for Speech and Audio
Siva Reddy
Siva Reddy
McGill University, Mila Quebec AI Institute
Natural Language ProcessingComputational LinguisticsDeep LearningSemantics
P
Perouz Taslakian
ServiceNow Research
Valentina Zantedeschi
Valentina Zantedeschi
ServiceNow, Laval University
Machine Learning