STABLE: Efficient Hybrid Nearest Neighbor Search via Magnitude-Uniformity and Cardinality-Robustness

📅 2026-04-02
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Existing hybrid approximate nearest neighbor search methods struggle to handle the challenges posed by data distribution heterogeneity, particularly the discrepancies in similarity scales and sensitivity to attribute cardinality. This work proposes STABLE, a novel framework that jointly addresses these issues for the first time. STABLE introduces an enhanced heterogeneous semantic-aware AUTO metric to unify feature similarity and attribute consistency, constructs a heterogeneous semantic relational graph indexed via HELP, and incorporates a dynamic heterogeneous routing mechanism to enable efficient retrieval. Evaluated on five benchmark datasets with varying attribute cardinalities, STABLE substantially outperforms state-of-the-art methods, achieving significant improvements in accuracy, efficiency, and robustness.
📝 Abstract
Hybrid Approximate Nearest Neighbor Search (Hybrid ANNS) is a foundational search technology for large-scale heterogeneous data and has gained significant attention in both academia and industry. However, current approaches overlook the heterogeneity in data distribution, thus ignoring two major challenges: the Compatibility Barrier for Similarity Magnitude Heterogeneity and the Tolerance Bottleneck to Attribute Cardinality. To overcome these issues, we propose the robuSt heTerogeneity-Aware hyBrid retrievaL framEwork, STABLE, designed for accurate, efficient, and robust hybrid ANNS under datasets with various distributions. Specifically, we introduce an enhAnced heterogeneoUs semanTic perceptiOn (AUTO) metric to achieve a joint measurement of feature similarity and attribute consistency, addressing similarity magnitude heterogeneity and improving robustness to datasets with various attribute cardinalities. Thereafter, we construct our Heterogeneous sEmantic reLation graPh (HELP) index based on AUTO to organize heterogeneous semantic relations. Finally, we employ a novel Dynamic Heterogeneity Routing method to ensure an efficient search. Extensive experiments on five feature vector benchmarks with various attribute cardinalities demonstrate the superior performance of STABLE.
Problem

Research questions and friction points this paper is trying to address.

Hybrid ANNS
heterogeneity
similarity magnitude
attribute cardinality
nearest neighbor search
Innovation

Methods, ideas, or system contributions that make the work stand out.

Hybrid ANNS
heterogeneity-aware
similarity magnitude
attribute cardinality
semantic relation graph
🔎 Similar Papers
No similar papers found.
Q
Qianyun Yang
School of Software, Shandong University, Jinan, 250100, China; National Graduate School for Elite Engineers, Shandong University, Jinan, 250100, China
Z
Zhiwei Chen
School of Software, Shandong University, Jinan, 250100, China
Yupeng Hu
Yupeng Hu
Shandong University
Multimedia Information RetrievalData Mining and Knowledge Discovery
Z
Zixu Li
School of Software, Shandong University, Jinan, 250100, China
Z
Zhiheng Fu
School of Software, Shandong University, Jinan, 250100, China
L
Liqiang Nie
School of Computer Science and Technology, Harbin Institute of Technology (Shenzhen), Shenzhen, 518000, China