Communication-Efficient Publication of Sparse Vectors under Differential Privacy

📅 2025-06-25

📈 Citations: 0

✨ Influential: 0

career value

223K/year

🤖 AI Summary

Differentially private release of sparse vectors—such as social adjacency matrices, recommendation interaction matrices, and SNP data—is hindered by traditional randomized response mechanisms, whose communication cost scales as Ω(nN), rendering them infeasible for large-scale settings. Method: We propose the first efficient ε-differentially private algorithm whose communication complexity *decreases* with shrinking privacy budget ε, achieving O(εm) communication cost—surpassing the non-private lower bound Ω(m log n). Our approach integrates a refined randomized response mechanism with sparse-structure-aware encoding and lightweight aggregation. Contribution/Results: Under strict ε-differential privacy, our method simultaneously reduces both communication and computational overhead. Theoretical analysis and extensive experiments confirm that it matches the accuracy of classical baselines while drastically cutting communication—especially beneficial for massive sparse data. This represents the first scheme to break the non-private communication barrier in differentially private sparse vector release.

Technology Category

Application Category

📝 Abstract

In this work, we propose a differentially private algorithm for publishing matrices aggregated from sparse vectors. These matrices include social network adjacency matrices, user-item interaction matrices in recommendation systems, and single nucleotide polymorphisms (SNPs) in DNA data. Traditionally, differential privacy in vector collection relies on randomized response, but this approach incurs high communication costs. Specifically, for a matrix with $N$ users, $n$ columns, and $m$ nonzero elements, conventional methods require $Ω(n imes N)$ communication, making them impractical for large-scale data. Our algorithm significantly reduces this cost to $O(varepsilon m)$, where $varepsilon$ is the privacy budget. Notably, this is even lower than the non-private case, which requires $Ω(m log n)$ communication. Moreover, as the privacy budget decreases, communication cost further reduces, enabling better privacy with improved efficiency. We theoretically prove that our method yields results identical to those of randomized response, and experimental evaluations confirm its effectiveness in terms of accuracy, communication efficiency, and computational complexity.

Problem

Research questions and friction points this paper is trying to address.

Reducing communication cost for private sparse vector publication

Enhancing privacy in social network and DNA data sharing

Achieving efficient private matrix aggregation in large-scale systems

Innovation

Methods, ideas, or system contributions that make the work stand out.

Differentially private sparse vector algorithm

Reduces communication cost to O(εm)

Matches randomized response accuracy

🔎 Similar Papers

No similar papers found.