BSL: A Unified and Generalizable Multitask Learning Platform for Virtual Drug Discovery from Design to Synthesis

📅 2025-08-02
📈 Citations: 0
Influential: 0
📄 PDF

career value

195K/year
🤖 AI Summary
Current virtual drug discovery platforms suffer from two major limitations: (1) incomplete task coverage and fragmented workflows, and (2) weak generalization—particularly for out-of-distribution (OOD) molecules. To address these challenges, we introduce BSL, an open-source deep learning platform that unifies seven core tasks—including molecular generation, property prediction, and activity assessment—within a single end-to-end multi-task learning framework. BSL innovatively integrates generative models with graph neural networks, adopts a modular and extensible architecture, and incorporates OOD-aware representation learning and evaluation mechanisms to significantly enhance cross-molecular-space generalization. Empirically, BSL achieves state-of-the-art performance across multiple benchmarks. Furthermore, it successfully identified three novel, experimentally validated active compounds targeting the GluN1/GluN3A subunits of the NMDA receptor—demonstrating both algorithmic advancement and practical utility in de novo drug discovery.

Technology Category

Application Category

📝 Abstract
Drug discovery is of great social significance in safeguarding human health, prolonging life, and addressing the challenges of major diseases. In recent years, artificial intelligence has demonstrated remarkable advantages in key tasks across bioinformatics and pharmacology, owing to its efficient data processing and data representation capabilities. However, most existing computational platforms cover only a subset of core tasks, leading to fragmented workflows and low efficiency. In addition, they often lack algorithmic innovation and show poor generalization to out-of-distribution (OOD) data, which greatly hinders the progress of drug discovery. To address these limitations, we propose Baishenglai (BSL), a deep learning-enhanced, open-access platform designed for virtual drug discovery. BSL integrates seven core tasks within a unified and modular framework, incorporating advanced technologies such as generative models and graph neural networks. In addition to achieving state-of-the-art (SOTA) performance on multiple benchmark datasets, the platform emphasizes evaluation mechanisms that focus on generalization to OOD molecular structures. Comparative experiments with existing platforms and baseline methods demonstrate that BSL provides a comprehensive, scalable, and effective solution for virtual drug discovery, offering both algorithmic innovation and high-precision prediction for real-world pharmaceutical research. In addition, BSL demonstrated its practical utility by discovering novel modulators of the GluN1/GluN3A NMDA receptor, successfully identifying three compounds with clear bioactivity in in-vitro electrophysiological assays. These results highlight BSL as a promising and comprehensive platform for accelerating biomedical research and drug discovery. The platform is accessible at https://www.baishenglai.net.
Problem

Research questions and friction points this paper is trying to address.

Unified platform for multitask drug discovery workflows
Improving generalization to out-of-distribution molecular data
Integrating seven core tasks with advanced AI technologies
Innovation

Methods, ideas, or system contributions that make the work stand out.

Unified multitask learning platform for drug discovery
Generative models and graph neural networks integration
Focus on generalization to out-of-distribution molecular structures
💼 Related Jobs
Postdoctoral Fellow – AI-Driven Multi-Omics Integration for Predictive Toxicology
Pfizer
The annual base salary for this position ranges from $64,600.00 to $107,600.00. In addition, this position is eligible for participation in Pfizer’s Global Performance Plan with a bonus target of 7.5% of the base salary. We offer comprehensive and generous benefits and programs to help our colleagues lead healthy lives and to support each of life’s moments. Benefits offered include a 401(k) plan with Pfizer Matching Contributions and an additional Pfizer Retirement Savings Contribution, paid vacation, holiday and personal days, paid caregiver/parental and medical leave, and health benefits to include medical, prescription drug, dental and vision coverage. Learn more at Pfizer Candidate Site – U.S. Benefits | (uscandidates.mypfizerbenefits.com). Pfizer compensation structures and benefit packages are aligned based on the location of hire. The United States salary range provided does not apply to Tampa, FL or any location outside of the United States. Relocation assistance may be available based on business needs and/or eligibility.
Hybrid
AI Data Engineer--LLMs / Agentic Systems
Pfizer
The annual base salary for this position ranges from $106,000.00 to $176,600.00. In addition, this position is eligible for participation in Pfizer’s Global Performance Plan with a bonus target of 15.0% of the base salary and eligibility to participate in our share based long term incentive program. We offer comprehensive and generous benefits and programs to help our colleagues lead healthy lives and to support each of life’s moments. Benefits offered include a 401(k) plan with Pfizer Matching Contributions and an additional Pfizer Retirement Savings Contribution, paid vacation, holiday and personal days, paid caregiver/parental and medical leave, and health benefits to include medical, prescription drug, dental and vision coverage. Learn more at Pfizer Candidate Site – U.S. Benefits | (uscandidates.mypfizerbenefits.com). Pfizer compensation structures and benefit packages are aligned based on the location of hire. The United States salary range provided does not apply to Tampa, FL or any location outside of the United States. Relocation assistance may be available based on business needs and/or eligibility.
United States - Massachusetts - Cambridge
K
Kun Li
School of Computer Science, Wuhan University, Wuhan 430037, Hubei, China
Z
Zhennan Wu
School of Computer Science, Wuhan University, Wuhan 430037, Hubei, China
Y
Yida Xiong
School of Computer Science, Wuhan University, Wuhan 430037, Hubei, China
Hongzhi Zhang
Hongzhi Zhang
Professor of Computer Science and Technology, Harbin Institute of Technology
Deep LearningArtificial IntelligenceComputer Vision
L
Longtao Hu
School of Computer Science, Wuhan University, Wuhan 430037, Hubei, China
Z
Zhonglie Liu
School of Computer Science, Wuhan University, Wuhan 430037, Hubei, China
J
Junqi Zeng
School of Computer Science, Wuhan University, Wuhan 430037, Hubei, China
Wenjie Wu
Wenjie Wu
Shanghai Jiao Tong University
Machine LearningQuantum ComputingLLM
M
Mukun Chen
School of Computer Science, Wuhan University, Wuhan 430037, Hubei, China
J
Jiameng Chen
School of Computer Science, Wuhan University, Wuhan 430037, Hubei, China
Wenbin Hu
Wenbin Hu
School of Computer, Wuhan University
Artificial IntelligentIntelligent Optimization and SimulationIntelligent Transportation ScienceComplex System and Social N