GLACIER: A Multimodal Student-Teacher Foundation Model for Molecular Property Prediction

πŸ“… 2026-06-09
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
This work addresses the limitations of existing molecular property prediction models, which often suffer from high computational costs, poor scalability, and a reliance on single-modality inputs, thereby hindering effective integration of complementary information from molecular graphs, SMILES strings, and physicochemical descriptors. To overcome these challenges, we propose the first lightweight multimodal teacher-student framework that jointly pretrains student encoders across all three modalities. Our approach incorporates a Finsler geometry-aware fusion module and a contrastive learning-based knowledge distillation strategy to efficiently transfer knowledge from large teacher models such as MiniMol and MolFormer. The resulting model achieves significant improvements in prediction accuracy and generalization while maintaining a compact architecture, demonstrating a favorable trade-off between performance and efficiency across multiple challenging molecular property prediction tasks. The code is publicly available.
πŸ“ Abstract
Deep learning models facilitate the discovery of molecules with tailored properties among billions of candidate compounds. However, the computational burden to develop and deploy state-of-the-art models continuously increases, limiting their scalability. Most large-scale models are unimodal in nature and overlook the potential to leverage complementary molecular data modalities. To address these shortcomings, this paper introduces the Graph-Language Alignment for Chemical Inference and Exploration using Representations (GLACIER) model, a student-teacher framework that integrates molecular graphs, SMILES strings, and physicochemical descriptors to learn rich molecular embeddings. Our framework consists of three stages: (1) we pretrain three student encoders on 100,000 drug-like molecules: a message-passing neural network for molecular graphs, a transformer-based encoder for SMILES strings, and a multilayer perceptron for physicochemical descriptors, (2) we fuse these student modalities using a novel Finsler geometry-aware module, and (3) distill complementary knowledge from large teacher models, including MiniMol and MolFormer, into a single lightweight model via contrastive learning. We demonstrate that GLACIER is a robust framework that delivers high predictive performance and computational efficiency in complex molecular property prediction tasks. Our code is publicly available at https://github.com/eemokey/glacier.
Problem

Research questions and friction points this paper is trying to address.

molecular property prediction
multimodal learning
computational scalability
deep learning
molecular representation
Innovation

Methods, ideas, or system contributions that make the work stand out.

multimodal fusion
student-teacher framework
Finsler geometry-aware module
molecular representation learning
knowledge distillation
πŸ”Ž Similar Papers
No similar papers found.
πŸ’Ό Related Jobs
Postdoctoral Fellow – AI-Driven Multi-Omics Integration for Predictive Toxicology
Pfizer
The annual base salary for this position ranges from $64,600.00 to $107,600.00. In addition, this position is eligible for participation in Pfizer’s Global Performance Plan with a bonus target of 7.5% of the base salary. We offer comprehensive and generous benefits and programs to help our colleagues lead healthy lives and to support each of life’s moments. Benefits offered include a 401(k) plan with Pfizer Matching Contributions and an additional Pfizer Retirement Savings Contribution, paid vacation, holiday and personal days, paid caregiver/parental and medical leave, and health benefits to include medical, prescription drug, dental and vision coverage. Learn more at Pfizer Candidate Site – U.S. Benefits | (uscandidates.mypfizerbenefits.com). Pfizer compensation structures and benefit packages are aligned based on the location of hire. The United States salary range provided does not apply to Tampa, FL or any location outside of the United States. Relocation assistance may be available based on business needs and/or eligibility.
Hybrid
E
Emily Nguyen
Department of Computer Science, University of Southern California, Los Angeles, California, USA
Y
Yongchan Hong
Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, California, USA
H
Harsh Toshniwal
Department of Computer Science, University of Southern California, Los Angeles, California, USA
Yan Liu
Yan Liu
Professor, Director of Machine Learning Center@USC
ML for Time SeriesExplainable AIPhysics-informed AIAI for health careAI for Sustainability
A
Andreas Luttens
Department of Medical Biochemistry and Biophysics, Science for Life Laboratory, Karolinska Institutet, Stockholm, Sweden