Empowering Chemical Structures with Biological Insights for Scalable Phenotypic Virtual Screening

πŸ“… 2026-03-16
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
This study addresses a critical gap in drug discovery: structural screening lacks biological context, while phenotypic screening, though biologically relevant, is costly and difficult to scale. To bridge this divide, the authors propose DECODE, a novel framework that embeds biological semantics directly into chemical representations. Leveraging limited paired transcriptomic and morphological data as supervisory signals, DECODE constructs a scalable biological fingerprint encoder that operates without requiring biological assay data during inference. The method enables high-accuracy zero-shot phenotypic prediction without any biological inputs at test time and effectively mitigates experimental noise. In zero-shot mechanism-of-action prediction tasks, DECODE achieves over a 20% performance improvement, and in external validation, it demonstrates a six-fold increase in hit rates for novel anticancer compounds.

Technology Category

Application Category

πŸ“ Abstract
Motivation: The scalable identification of bioactive compounds is essential for contemporary drug discovery. This process faces a key trade-off: structural screening offers scalability but lacks biological context, whereas high-content phenotypic profiling provides deep biological insights but is resource-intensive. The primary challenge is to extract robust biological signals from noisy data and encode them into representations that do not require biological data at inference. Results: This study presents DECODE (DEcomposing Cellular Observations of Drug Effects), a framework that bridges this gap by empowering chemical representations with intrinsic biological semantics to enable structure-based in silico biological profiling. DECODE leverages limited paired transcriptomic and morphological data as supervisory signals during training, enabling the extraction of a measurement-invariant biological fingerprint from chemical structures and explicit filtering of experimental noise. Our evaluations demonstrate that DECODE retrieves functionally similar drugs in zero-shot settings with over 20% relative improvement over chemical baselines in mechanism-of-action (MOA) prediction. Furthermore, the framework achieves a 6-fold increase in hit rates for novel anti-cancer agents during external validation. Availability and implementation: The codes and datasets of DECODE are available at https://github.com/lian-xiao/DECODE.
Problem

Research questions and friction points this paper is trying to address.

phenotypic screening
chemical representation
biological semantics
scalable drug discovery
bioactive compounds
Innovation

Methods, ideas, or system contributions that make the work stand out.

biological fingerprint
phenotypic virtual screening
zero-shot drug retrieval
structure-based biological profiling
DECODE
πŸ”Ž Similar Papers
No similar papers found.
X
Xiaoqing Lian
State Key Laboratory of Chemo and Biosensing, College of Computer Science and Electronic Engineering; The Ministry of Education Key Laboratory of Fusion Computing of Supercomputing and Artificial Intelligence, Hunan University, 410082, Changsha, China
P
Pengsen Ma
State Key Laboratory of Chemo and Biosensing, College of Computer Science and Electronic Engineering; The Ministry of Education Key Laboratory of Fusion Computing of Supercomputing and Artificial Intelligence, Hunan University, 410082, Changsha, China
T
Tengfeng Ma
State Key Laboratory of Chemo and Biosensing, College of Computer Science and Electronic Engineering; The Ministry of Education Key Laboratory of Fusion Computing of Supercomputing and Artificial Intelligence, Hunan University, 410082, Changsha, China
Z
Zhonghao Ren
State Key Laboratory of Chemo and Biosensing, College of Computer Science and Electronic Engineering; The Ministry of Education Key Laboratory of Fusion Computing of Supercomputing and Artificial Intelligence, Hunan University, 410082, Changsha, China
X
Xibao Cai
State Key Laboratory of Chemo and Biosensing, College of Computer Science and Electronic Engineering; The Ministry of Education Key Laboratory of Fusion Computing of Supercomputing and Artificial Intelligence, Hunan University, 410082, Changsha, China
Z
Zhixiang Cheng
State Key Laboratory of Chemo and Biosensing, College of Computer Science and Electronic Engineering; The Ministry of Education Key Laboratory of Fusion Computing of Supercomputing and Artificial Intelligence, Hunan University, 410082, Changsha, China
Bosheng Song
Bosheng Song
Associate Professor, Hunan University
Membrane ComputingFormal Languages
H
He Wang
School of Artificial Intelligence and Computer Science, Jiangnan University, 214122, Wuxi, China
Xiang Pan
Xiang Pan
Jiangnan University
医学人ε·₯智能;AIθ―η‰©ε‘ηŽ°δΈŽεΌ€ε‘
Y
Yangyang Chen
Department of Computer Science, University of Tsukuba, Tsukuba, Ibaraki, Japan
S
Sisi Yuan
School of Chinese Medicine, Hong Kong Baptist University, Kowloon Tong, Kowloon, 999077, Hong Kong SAR, China
Chen Lin
Chen Lin
Professor of Radiology and Imaging Science, IU School of Medicine
RadiologyMedical ImagingMRI