Correlation Clustering and (De)Sparsification: Graph Sketches Can Match Classical Algorithms

📅 2025-04-05
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Approximation ratios for correlation clustering in sublinear models—such as semi-streaming and Massively Parallel Computation (MPC)—have long lagged behind those achieved by classical centralized algorithms. Method: This paper introduces the first generic graph *de-sparsification* paradigm, enabling black-box transfer of classical correlation clustering algorithms to sublinear settings. It employs graph sketching and de-sparsification as a preprocessing step, preserving the original approximation guarantee (e.g., 3-approximation) of any classical algorithm without modifying its internal logic, while respecting sublinear space or communication constraints. Contribution/Results: Theoretical analysis shows that the method’s space and communication complexity depend only on the graph’s sparsity, substantially improving upon prior model-specific approaches. Extensive experiments validate the paradigm’s effectiveness and broad applicability across both semi-streaming and MPC models.

Technology Category

Application Category

📝 Abstract
Correlation clustering is a widely-used approach for clustering large data sets based only on pairwise similarity information. In recent years, there has been a steady stream of better and better classical algorithms for approximating this problem. Meanwhile, another line of research has focused on porting the classical advances to various sublinear algorithm models, including semi-streaming, Massively Parallel Computation (MPC), and distributed computing. Yet, these latter works typically rely on ad-hoc approaches that do not necessarily keep up with advances in approximation ratios achieved by classical algorithms. Hence, the motivating question for our work is this: can the gains made by classical algorithms for correlation clustering be ported over to sublinear algorithms in a emph{black-box manner}? We answer this question in the affirmative by introducing the paradigm of graph de-sparsification.
Problem

Research questions and friction points this paper is trying to address.

Port classical correlation clustering to sublinear algorithms
Match classical approximation ratios in sublinear models
Use graph de-sparsification for black-box adaptation
Innovation

Methods, ideas, or system contributions that make the work stand out.

Graph de-sparsification for sublinear algorithms
Black-box porting of classical algorithm gains
Matching classical algorithms with graph sketches
🔎 Similar Papers
No similar papers found.