LANTERN: A Machine Learning Framework for Lipid Nanoparticle Transfection Efficiency Prediction

📅 2025-07-03
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Low screening efficiency of ionizable lipids for RNA therapeutics and poor accuracy/generalizability of existing LNP transfection prediction models—due to low-quality experimental data and weak molecular feature representation—hinder rational design of RNA delivery systems. To address these challenges, we propose LANTERN, a machine learning framework that integrates chemically informative Morgan fingerprints with expert-curated molecular descriptors and employs a lightweight multilayer perceptron, thereby balancing predictive performance and model interpretability. In cross-dataset evaluation, LANTERN achieves R² = 0.8161 and Pearson correlation r = 0.9053, substantially outperforming the state-of-the-art model AGILE (R² = 0.2655). Moreover, LANTERN demonstrates superior stability and extrapolation capability across diverse lipid chemical spaces. This work provides a robust, interpretable, and generalizable predictive tool to accelerate the development of high-efficiency RNA delivery formulations.

Technology Category

Application Category

📝 Abstract
The discovery of new ionizable lipids for efficient lipid nanoparticle (LNP)-mediated RNA delivery remains a critical bottleneck for RNA-based therapeutics development. Recent advances have highlighted the potential of machine learning (ML) to predict transfection efficiency from molecular structure, enabling high-throughput virtual screening and accelerating lead identification. However, existing approaches are hindered by inadequate data quality, ineffective feature representations, low predictive accuracy, and poor generalizability. Here, we present LANTERN (Lipid nANoparticle Transfection Efficiency pRedictioN), a robust ML framework for predicting transfection efficiency based on ionizable lipid representation. We benchmarked a diverse set of ML models against AGILE, a previously published model developed for transfection prediction. Our results show that combining simpler models with chemically informative features, particularly count-based Morgan fingerprints, outperforms more complex models that rely on internally learned embeddings, such as AGILE. We also show that a multi-layer perceptron trained on a combination of Morgan fingerprints and Expert descriptors achieved the highest performance ($ ext{R}^2$ = 0.8161, r = 0.9053), significantly exceeding AGILE ($ ext{R}^2$ = 0.2655, r = 0.5488). We show that the models in LANTERN consistently have strong performance across multiple evaluation metrics. Thus, LANTERN offers a robust benchmarking framework for LNP transfection prediction and serves as a valuable tool for accelerating lipid-based RNA delivery systems design.
Problem

Research questions and friction points this paper is trying to address.

Predicting LNP transfection efficiency for RNA delivery
Overcoming data quality and model accuracy limitations
Accelerating ionizable lipid discovery for RNA therapeutics
Innovation

Methods, ideas, or system contributions that make the work stand out.

Combines simpler models with chemically informative features
Uses Morgan fingerprints and Expert descriptors
Outperforms complex models like AGILE
🔎 Similar Papers
No similar papers found.