Model Fusion via Neuron Transplantation

📅 2025-02-07
🏛️ ECML/PKDD
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address the high memory and inference overheads in ensemble learning, as well as the difficulty of cross-model knowledge fusion, this paper proposes a neuron transplantation mechanism—the first to adapt biological neuron migration principles to deep learning. It enables fine-grained, unidirectional, and interpretable parameter-level knowledge injection between heterogeneous pre-trained models, without joint training or data sharing. The method comprises semantic alignment of attention heads and FFN layers, gradient-guided neuron localization, and local weight reparameterization. Evaluated on eight downstream tasks, it achieves an average accuracy improvement of 2.3%, incurs fusion costs less than 10% of full-model fine-tuning, and preserves the source model’s performance with zero degradation.

Technology Category

Application Category

Problem

Research questions and friction points this paper is trying to address.

Improves neural network prediction performance
Reduces memory and inference time
Enables joint pruning and training
Innovation

Methods, ideas, or system contributions that make the work stand out.

Neuron Transplantation model fusion
Joint pruning and training
Reduced memory and fine-tuning
M
Muhammed Öz
Scientific Computing Center (SCC), Karlsruhe Institute of Technology (KIT), Hermann-von-Helmholtz Platz 1, 76344 Eggenstein-Leopoldshafen, Germany
N
Nicholas Kiefer
Scientific Computing Center (SCC), Karlsruhe Institute of Technology (KIT), Hermann-von-Helmholtz Platz 1, 76344 Eggenstein-Leopoldshafen, Germany
Charlotte Debus
Charlotte Debus
Junior Group Leader, Karlsruhe Institute of Technology (KIT)
Artificial Intelligence and Machine LearningImage AnalysisTime Series AnalysisAnomaly Detection
J
Jasmin Hörter
Scientific Computing Center (SCC), Karlsruhe Institute of Technology (KIT), Hermann-von-Helmholtz Platz 1, 76344 Eggenstein-Leopoldshafen, Germany
Achim Streit
Achim Streit
Director of Scientific Computing Center (SCC), Professor for Computer Science, Karlsruhe
computational science and engineeringdistributed systemsgrid computingdata management
M
Markus Goetz
Scientific Computing Center (SCC), Karlsruhe Institute of Technology (KIT), Hermann-von-Helmholtz Platz 1, 76344 Eggenstein-Leopoldshafen, Germany