A solvable model for unsupervised federated learning

📅 2026-06-11
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the performance limitations of collaborative learning among participants (students) in unsupervised federated learning under heterogeneous data distributions. The authors propose a teacher–multi-student generative framework and, drawing upon equilibrium disordered systems theory and Bayesian inference, theoretically demonstrate for the first time that inter-student interactions can systematically enhance learning performance: students with high noise require fewer samples, while those with low noise achieve higher overlap with the true signal. They further derive the optimal Bayesian conditions for recovering the teacher’s signal and establish a formal mapping between the proposed framework and restricted Boltzmann machines. Numerical experiments quantitatively validate the influence of interaction strength, sample complexity, and noise levels on overall learning performance.
📝 Abstract
We introduce a theoretical framework for analyzing federated learning in a generative setting through a teacher-multiple interacting students scenario, in which each student receives a distinct realization of the data, either through a different noise corruption or by accessing a different subset, possibly of varying size. Using theoretical tools in equilibrium disordered system, we analytically show that interactions among students systematically enhance learning performance: highly noisy students require fewer samples to recover the underlying pattern, while low-noise students achieve a larger overlap with the ground-truth signal. We derive the optimal Bayesian conditions for teacher recovery as functions of the sample complexity, noise level, and interaction strength, and validate these predictions through numerical simulations. The resulting dynamics can be mapped onto equilibrium sampling in a Restricted Boltzmann Machine with a structured hidden layer, providing a principled theoretical understanding of how interactions improve distributed generative modeling.
Problem

Research questions and friction points this paper is trying to address.

unsupervised federated learning
teacher-student model
distributed generative modeling
sample complexity
noise corruption
Innovation

Methods, ideas, or system contributions that make the work stand out.

unsupervised federated learning
teacher-student model
disordered systems
Restricted Boltzmann Machine
Bayesian inference
Giovanni Catania
Giovanni Catania
Universidad Complutense Madrid
statistical physics
Aurélien Decelle
Aurélien Decelle
Research, Universidad Politécnica de Madrid
statistical physicsmachine learningBayesian inferenceartificial intelligence
G
Gianluca Manzan
Inria Saclay - Tau team, Bât 660 Université Paris-Saclay, Orsay Cedex 91405; LISN, Tau team, Bât 660 Université Paris-Saclay, Orsay Cedex 91405; Departamento de Física Teórica, Universidad Complutense de Madrid, 28040 Madrid, Spain.; Department of Mathematics, University of Bologna, Piazza di Porta San Donato 5, 40126, Bologna (BO), Italy.
B
Beatriz Seoane
Departamento de Física Teórica & IPARCOS, Universidad Complutense de Madrid, 28040 Madrid, Spain.; GISC - Grupo Interdisciplinar de Sistemas Complejos, 28040 Madrid, Spain.
Daniele Tantari
Daniele Tantari
Università di Bologna
Statistical InferenceStatistical mechanicsQuantitative FinanceComplex NetworksSpin Glasses