Training Hybrid Neural Networks with Multimode Optical Nonlinearities Using Digital Twins

📅 2025-01-14
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address the high energy consumption and computational demands of large-scale neural network training, this work proposes a photonic–electronic hybrid neural network architecture. It leverages the intrinsic, massive nonlinear optical response of femtosecond pulse propagation in multimode fiber to implement a low-power physical computing layer. Crucially, we introduce the first differentiable digital twin model based on multimode fiber nonlinearity, enabling end-to-end joint training of the photonic and electronic layers while exhibiting strong robustness to experimental drift. Experimental evaluation on image classification achieves state-of-the-art accuracy at the time of publication; simulation and hardware measurements show excellent agreement. Moreover, the architecture significantly reduces computational energy consumption and hardware dependency. By tightly integrating physical dynamics with learnable models, this work establishes a scalable, physics-informed paradigm for green AI.

Technology Category

Application Category

📝 Abstract
The ability to train ever-larger neural networks brings artificial intelligence to the forefront of scientific and technical discoveries. However, their exponentially increasing size creates a proportionally greater demand for energy and computational hardware. Incorporating complex physical events in networks as fixed, efficient computation modules can address this demand by decreasing the complexity of trainable layers. Here, we utilize ultrashort pulse propagation in multimode fibers, which perform large-scale nonlinear transformations, for this purpose. Training the hybrid architecture is achieved through a neural model that differentiably approximates the optical system. The training algorithm updates the neural simulator and backpropagates the error signal over this proxy to optimize layers preceding the optical one. Our experimental results achieve state-of-the-art image classification accuracies and simulation fidelity. Moreover, the framework demonstrates exceptional resilience to experimental drifts. By integrating low-energy physical systems into neural networks, this approach enables scalable, energy-efficient AI models with significantly reduced computational demands.
Problem

Research questions and friction points this paper is trying to address.

Energy Efficiency
Neural Networks
Computational Demand
Innovation

Methods, ideas, or system contributions that make the work stand out.

Physics-based Neural Networks
Optical Fiber Computations
Energy-efficient AI
🔎 Similar Papers
No similar papers found.
I
Ilker Oguz
EPFL, Institute of Electrical and Micro Engineering, 1015 Lausanne, Switzerland
L
Louis J. E. Suter
EPFL, Institute of Electrical and Micro Engineering, 1015 Lausanne, Switzerland
J
Jih-Liang Hsieh
EPFL, Institute of Electrical and Micro Engineering, 1015 Lausanne, Switzerland
Mustafa Yildirim
Mustafa Yildirim
EPFL, Institute of Electrical and Micro Engineering, 1015 Lausanne, Switzerland
N
Niyazi Ulaş Dinç
EPFL, Institute of Electrical and Micro Engineering, 1015 Lausanne, Switzerland
Christophe Moser
Christophe Moser
Professor of Microengineering, EPFL
volume holographic gratingsmultimode fiber imagingmultimode fiber laserVolumetric printing
Demetri Psaltis
Demetri Psaltis
Ecole Polytechnique Federale de Lausanne
Optics