Model Fusion via Neuron Transplantation

📅 2025-02-07

🏛️ ECML/PKDD

📈 Citations: 0

✨ Influential: 0

🤖 AI Summary

To address the high memory and inference overheads in ensemble learning, as well as the difficulty of cross-model knowledge fusion, this paper proposes a neuron transplantation mechanism—the first to adapt biological neuron migration principles to deep learning. It enables fine-grained, unidirectional, and interpretable parameter-level knowledge injection between heterogeneous pre-trained models, without joint training or data sharing. The method comprises semantic alignment of attention heads and FFN layers, gradient-guided neuron localization, and local weight reparameterization. Evaluated on eight downstream tasks, it achieves an average accuracy improvement of 2.3%, incurs fusion costs less than 10% of full-model fine-tuning, and preserves the source model’s performance with zero degradation.

Technology Category

Application Category

Problem

Research questions and friction points this paper is trying to address.

Improves neural network prediction performance

Reduces memory and inference time

Enables joint pruning and training

Innovation

Methods, ideas, or system contributions that make the work stand out.

Neuron Transplantation model fusion

Joint pruning and training

Reduced memory and fine-tuning

🔎 Similar Papers

Foldable SuperNets: Scalable Merging of Transformers with Different Initializations and Tasks

2024-10-02arXiv.orgCitations: 2

Authors to Follow