Locally Differentially Private Thresholding Bandits

📅 2025-07-30

📈 Citations: 0

✨ Influential: 0

🤖 AI Summary

This paper studies the thresholded multi-armed bandit problem under local differential privacy (LDP): identifying arms whose expected reward exceeds a given threshold, subject to fixed budget and confidence constraints. We propose a privacy mechanism based on Bernoulli randomized response and develop a unified algorithmic framework that integrates concentration inequality analysis with information-theoretic lower bound derivation to jointly optimize privacy preservation and decision efficiency. We prove that the proposed algorithm achieves a sample complexity within at most a logarithmic factor of the fundamental information-theoretic lower bound for LDP threshold identification—establishing, for the first time, near-optimal trade-offs among estimation error, privacy loss, and sampling efficiency. Extensive experiments demonstrate its high efficiency and robustness in arm identification under strong LDP guarantees, revealing the intrinsic precision limits of sequential decision-making under privacy constraints.

Technology Category

Application Category

📝 Abstract

This work investigates the impact of ensuring local differential privacy in the thresholding bandit problem. We consider both the fixed budget and fixed confidence settings. We propose methods that utilize private responses, obtained through a Bernoulli-based differentially private mechanism, to identify arms with expected rewards exceeding a predefined threshold. We show that this procedure provides strong privacy guarantees and derive theoretical performance bounds on the proposed algorithms. Additionally, we present general lower bounds that characterize the additional loss incurred by any differentially private mechanism, and show that the presented algorithms match these lower bounds up to poly-logarithmic factors. Our results provide valuable insights into privacy-preserving decision-making frameworks in bandit problems.

Problem

Research questions and friction points this paper is trying to address.

Impact of local differential privacy on thresholding bandits

Methods to identify arms exceeding reward threshold privately

Theoretical bounds on privacy-preserving bandit algorithms

Innovation

Methods, ideas, or system contributions that make the work stand out.

Bernoulli-based differentially private mechanism

Private responses for threshold identification

Matching lower bounds up to poly-logarithmic factors

🔎 Similar Papers

No similar papers found.

Authors to Follow