Recent Developments in GNNs for Drug Discovery

📅 2025-06-02
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Existing surveys on graph neural networks (GNNs) in computational drug discovery lack systematic, structured taxonomies—particularly across input representations and downstream task paradigms—hindering comparative analysis and methodological advancement. Method: This work presents a comprehensive review of recent GNN advances for molecular generation, property prediction, and drug–drug interaction prediction. We introduce a novel dual-axis classification framework: (i) by input representation type (e.g., atom/bond-level, subgraph-level, 3D geometric), and (ii) by downstream task paradigm (e.g., single- vs. multi-task, supervised vs. self-supervised). Benchmark evaluations are unified across QM9, MoleculeNet, DrugBank, and other standard datasets. Contribution/Results: We establish an end-to-end methodology framework spanning representation learning, model design, and evaluation. Key insights include fundamental capability limits (e.g., modeling long-range dependencies, generalizing across 3D conformers) and persistent challenges (e.g., limited interpretability). The review delivers a structured, knowledge-graph–inspired synthesis—serving as a reproducible, cross-disciplinary guide for algorithmic innovation and translational application.

Technology Category

Application Category

📝 Abstract
In this paper, we review recent developments and the role of Graph Neural Networks (GNNs) in computational drug discovery, including molecule generation, molecular property prediction, and drug-drug interaction prediction. By summarizing the most recent developments in this area, we underscore the capabilities of GNNs to comprehend intricate molecular patterns, while exploring both their current and prospective applications. We initiate our discussion by examining various molecular representations, followed by detailed discussions and categorization of existing GNN models based on their input types and downstream application tasks. We also collect a list of commonly used benchmark datasets for a variety of applications. We conclude the paper with brief discussions and summarize common trends in this important research area.
Problem

Research questions and friction points this paper is trying to address.

Review GNNs' role in computational drug discovery
Explore GNNs for molecule and property prediction
Summarize GNN models and benchmark datasets
Innovation

Methods, ideas, or system contributions that make the work stand out.

GNNs analyze complex molecular patterns
GNNs classify models by input types
GNNs utilize benchmark datasets widely
Zhengyu Fang
Zhengyu Fang
Case Western Reserve University
Machine learningDeep LearningGen AITime-SeriesAI for Science
Xiaoge Zhang
Xiaoge Zhang
The Hong Kong Polytechnic University
Artificial IntelligenceRisk and ReliabilityData ScienceUncertainty Quantification
A
Anyin Zhao
Department of Computer and Data Sciences, Case Western Reserve University
X
Xiao Li
Department of Computer and Data Sciences, Case Western Reserve University; Department of Biochemistry, Case Western Reserve University; Center for RNA Science and Therapeutics, Case Western Reserve University; Department of Biomedical Engineering, Case Western Reserve University
Huiyuan Chen
Huiyuan Chen
Amazon
Machine LearningDeep LearningRecommender Systems
J
Jing Li
Department of Computer and Data Sciences, Case Western Reserve University