FedWSQ: Efficient Federated Learning with Weight Standardization and Distribution-Aware Non-Uniform Quantization

📅 2025-06-30
📈 Citations: 0
Influential: 0
📄 PDF

career value

175K/year
🤖 AI Summary
Federated learning faces two critical challenges: model bias induced by statistical heterogeneity (non-IID data) and excessive communication overhead. To address these, we propose FedWSQ—a novel framework integrating Weight Standardization (WS) and Distribution-Aware Non-Uniform Quantization (DANUQ). WS mitigates bias by normalizing local model updates across clients, while DANUQ dynamically constructs non-uniform quantization intervals based on local gradient/weight distributions, preserving information fidelity even at ultra-low bitwidths (e.g., 1–2 bits). Evaluated on CIFAR-10/100 and Tiny-ImageNet under extreme non-IID settings and stringent bandwidth constraints, FedWSQ achieves 3.2–7.8% higher accuracy than state-of-the-art baselines and reduces communication volume to only 12–18% of the original. To the best of our knowledge, this is the first approach to jointly optimize for both high robustness against data heterogeneity and ultra-efficient transmission.

Technology Category

Application Category

📝 Abstract
Federated learning (FL) often suffers from performance degradation due to key challenges such as data heterogeneity and communication constraints. To address these limitations, we present a novel FL framework called FedWSQ, which integrates weight standardization (WS) and the proposed distribution-aware non-uniform quantization (DANUQ). WS enhances FL performance by filtering out biased components in local updates during training, thereby improving the robustness of the model against data heterogeneity and unstable client participation. In addition, DANUQ minimizes quantization errors by leveraging the statistical properties of local model updates. As a result, FedWSQ significantly reduces communication overhead while maintaining superior model accuracy. Extensive experiments on FL benchmark datasets demonstrate that FedWSQ consistently outperforms existing FL methods across various challenging FL settings, including extreme data heterogeneity and ultra-low-bit communication scenarios.
Problem

Research questions and friction points this paper is trying to address.

Addresses performance degradation in federated learning from data heterogeneity
Reduces communication overhead with efficient weight standardization and quantization
Improves model accuracy in extreme data heterogeneity and low-bit scenarios
Innovation

Methods, ideas, or system contributions that make the work stand out.

Weight standardization filters biased local updates
Distribution-aware non-uniform quantization minimizes errors
Reduces communication overhead while maintaining accuracy