Efficient Algorithms for Adversarially Robust Approximate Nearest Neighbor Search

📅 2026-01-01

🏛️ arXiv.org

📈 Citations: 0

✨ Influential: 0

career value

244K/year

🤖 AI Summary

This work addresses approximate nearest neighbor (ANN) search in adaptive adversarial settings, distinguishing between high-dimensional regimes ($d = \omega(\sqrt{Q})$) and low-dimensional ones ($d = O(\sqrt{Q})$). The paper proposes a novel algorithm that integrates differential privacy, fairness, and locality-sensitive hashing (LSH). Its key contributions include establishing the first theoretical connection between adaptive security and fairness, introducing a concentric-ring LSH structure that overcomes the traditional $\sqrt{n}$ query time barrier, and incorporating robust decision primitives with metric covering techniques. In high dimensions, the method achieves information-theoretically secure and efficient search; in low dimensions, it guarantees— with high probability—“all-or-nothing” correctness for arbitrary queries, substantially outperforming existing approaches.

Technology Category

Application Category

📝 Abstract

We study the Approximate Nearest Neighbor (ANN) problem under a powerful adaptive adversary that controls both the dataset and a sequence of $Q$ queries. Primarily, for the high-dimensional regime of $d = \omega(\sqrt{Q})$, we introduce a sequence of algorithms with progressively stronger guarantees. We first establish a novel connection between adaptive security and \textit{fairness}, leveraging fair ANN search to hide internal randomness from the adversary with information-theoretic guarantees. To achieve data-independent performance, we then reduce the search problem to a robust decision primitive, solved using a differentially private mechanism on a Locality-Sensitive Hashing (LSH) data structure. This approach, however, faces an inherent $\sqrt{n}$ query time barrier. To break the barrier, we propose a novel concentric-annuli LSH construction that synthesizes these fairness and differential privacy techniques. The analysis introduces a new method for robustly releasing timing information from the underlying algorithm instances and, as a corollary, also improves existing results for fair ANN. In addition, for the low-dimensional regime $d = O(\sqrt{Q})$, we propose specialized algorithms that provide a strong ``for-all''guarantee: correctness on \textit{every} possible query with high probability. We introduce novel metric covering constructions that simplify and improve prior approaches for ANN in Hamming and $\ell_p$ spaces.

Problem

Research questions and friction points this paper is trying to address.

Adversarially Robust

Approximate Nearest Neighbor

Adaptive Adversary

High-dimensional Search

Fairness

Innovation

Methods, ideas, or system contributions that make the work stand out.

adversarially robust

fairness

differential privacy