GastroDL-Fusion: A Dual-Modal Deep Learning Framework Integrating Protein-Ligand Complexes and Gene Sequences for Gastrointestinal Disease Drug Discovery

📅 2025-11-07
📈 Citations: 0
Influential: 0
📄 PDF

career value

210K/year
🤖 AI Summary
Accurate prediction of protein–ligand binding affinity remains challenging in gastrointestinal disease (e.g., gastric ulcers, Crohn’s disease, ulcerative colitis) drug and vaccine development. Method: We propose the first bimodal deep learning framework integrating both protein–ligand structural data and disease-associated gene sequences. It employs Graph Isomorphism Networks (GIN) to encode molecular graph structures and leverages pre-trained protein language models (ProtBERT/ESM) to represent pathogenic gene sequences, augmented by a cross-modal interaction module for joint structural–sequential modeling. Contribution/Results: Evaluated on a gastrointestinal disease target dataset, our framework achieves a mean absolute error (MAE) of 1.12 and root-mean-square error (RMSE) of 1.75—substantially outperforming unimodal baselines (e.g., CNN, BiLSTM). This advance establishes a mechanism-informed paradigm for precision drug discovery.

Technology Category

Application Category

📝 Abstract
Accurate prediction of protein-ligand binding affinity plays a pivotal role in accelerating the discovery of novel drugs and vaccines, particularly for gastrointestinal (GI) diseases such as gastric ulcers, Crohn's disease, and ulcerative colitis. Traditional computational models often rely on structural information alone and thus fail to capture the genetic determinants that influence disease mechanisms and therapeutic responses. To address this gap, we propose GastroDL-Fusion, a dual-modal deep learning framework that integrates protein-ligand complex data with disease-associated gene sequence information for drug and vaccine development. In our approach, protein-ligand complexes are represented as molecular graphs and modeled using a Graph Isomorphism Network (GIN), while gene sequences are encoded into biologically meaningful embeddings via a pre-trained Transformer (ProtBERT/ESM). These complementary modalities are fused through a multi-layer perceptron to enable robust cross-modal interaction learning. We evaluate the model on benchmark datasets of GI disease-related targets, demonstrating that GastroDL-Fusion significantly improves predictive performance over conventional methods. Specifically, the model achieves a mean absolute error (MAE) of 1.12 and a root mean square error (RMSE) of 1.75, outperforming CNN, BiLSTM, GIN, and Transformer-only baselines. These results confirm that incorporating both structural and genetic features yields more accurate predictions of binding affinities, providing a reliable computational tool for accelerating the design of targeted therapies and vaccines in the context of gastrointestinal diseases.
Problem

Research questions and friction points this paper is trying to address.

Predicts protein-ligand binding affinity for gastrointestinal disease drug discovery
Integrates protein-ligand complexes with gene sequence information
Overcomes limitations of structure-only models by capturing genetic determinants
Innovation

Methods, ideas, or system contributions that make the work stand out.

Integrates protein-ligand complexes with gene sequences
Uses Graph Isomorphism Network for molecular graphs
Fuses modalities via multi-layer perceptron for interactions
💼 Related Jobs
Postdoctoral Fellow – AI-Driven Multi-Omics Integration for Predictive Toxicology
Pfizer
The annual base salary for this position ranges from $64,600.00 to $107,600.00. In addition, this position is eligible for participation in Pfizer’s Global Performance Plan with a bonus target of 7.5% of the base salary. We offer comprehensive and generous benefits and programs to help our colleagues lead healthy lives and to support each of life’s moments. Benefits offered include a 401(k) plan with Pfizer Matching Contributions and an additional Pfizer Retirement Savings Contribution, paid vacation, holiday and personal days, paid caregiver/parental and medical leave, and health benefits to include medical, prescription drug, dental and vision coverage. Learn more at Pfizer Candidate Site – U.S. Benefits | (uscandidates.mypfizerbenefits.com). Pfizer compensation structures and benefit packages are aligned based on the location of hire. The United States salary range provided does not apply to Tampa, FL or any location outside of the United States. Relocation assistance may be available based on business needs and/or eligibility.
Hybrid
Ziyang Gao
Ziyang Gao
Southeast University
NLP,LLM
A
Annie Cheung
University of Michigan, Ann Arbor, MI, USA
Y
Yihao Ou
Georgia Institute of Technology, Atlanta, GA, USA