Covariate Connectivity Combined Clustering for Weighted Networks

📅 2025-11-21
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Traditional community detection methods rely solely on network topology and thus underperform when node attributes contain community signals; existing covariate-assisted approaches often require pre-specified numbers of clusters, incur high computational costs, and are incompatible with weighted networks. This paper proposes an adaptive spectral clustering framework for weighted networks that jointly models topological connectivity and node covariates. A data-driven mechanism automatically balances their respective contributions, while a spectral gap–based heuristic estimates the number of communities without prior specification or MCMC sampling. The method integrates refined spectral clustering, a joint similarity metric, and adaptive parameter tuning, substantially improving accuracy and robustness. Extensive simulations demonstrate superior performance over state-of-the-art baselines. Empirical evaluation on a real-world airport accessibility network confirms its scalability, interpretability, and practical utility.

Technology Category

Application Category

📝 Abstract
Community detection is a central task in network analysis, with applications in social, biological, and technological systems. Traditional algorithms rely primarily on network topology, which can fail when community signals are partly encoded in node-specific attributes. Existing covariate-assisted methods often assume the number of clusters is known, involve computationally intensive inference, or are not designed for weighted networks. We propose $ ext{C}^4$: Covariate Connectivity Combined Clustering, an adaptive spectral clustering algorithm that integrates network connectivity and node-level covariates into a unified similarity representation. $ ext{C}^4$ balances the two sources of information through a data-driven tuning parameter, estimates the number of communities via an eigengap heuristic, and avoids reliance on costly sampling-based procedures. Simulation studies show that $ ext{C}^4$ achieves higher accuracy and robustness than competing approaches across diverse scenarios. Application to an airport reachability network demonstrates the method's scalability, interpretability, and practical utility for real-world weighted networks.
Problem

Research questions and friction points this paper is trying to address.

Detects communities in weighted networks using node attributes and topology
Overcomes limitations of methods requiring known cluster counts or intensive computation
Provides automated community estimation without sampling-based inference procedures
Innovation

Methods, ideas, or system contributions that make the work stand out.

Integrates network connectivity and node-level covariates
Balances information sources with data-driven tuning parameter
Estimates community count via eigengap heuristic method
🔎 Similar Papers
No similar papers found.
Z
Zeyu Hu
Department of Statistics, University of Connecticut, Storrs, CT 06269
Wenrui Li
Wenrui Li
Assistant Professor, University of Connecticut
StatisticsNetwork scienceBiostatistics
J
Jun Yan
Department of Statistics, University of Connecticut, Storrs, CT 06269
P
Panpan Zhang
Department of Biostatistics, Vanderbilt University Medical Center, Nashville, TN 37203