From Literature to Hypotheses: An AI Co-Scientist System for Biomarker-Guided Drug Combination Hypothesis Generation

📅 2026-02-28
📈 Citations: 0
Influential: 0
📄 PDF

career value

203K/year
🤖 AI Summary
This work addresses the challenge of efficiently translating complex biomarker mechanisms—burdened by the exponential growth of biomedical literature and databases—into testable drug combination hypotheses. To this end, we propose CoDHy, a human–AI collaborative research system that integrates structured databases and unstructured literature to construct a task-oriented knowledge graph. CoDHy introduces a novel framework combining knowledge graph embeddings with agent-based reasoning to enable traceable, intervenable, and transparent generation, validation, and ranking of drug combination hypotheses. Through an interactive interface and an end-to-end workflow, CoDHy effectively supports researcher-driven exploratory hypothesis generation and decision-making in translational oncology, with its feasibility and practical utility demonstrated in real-world scenarios.

Technology Category

Application Category

📝 Abstract
The rapid growth of biomedical literature and curated databases has made it increasingly difficult for researchers to systematically connect biomarker mechanisms to actionable drug combination hypotheses. We present AI Co-Scientist (CoDHy), an interactive, human-in-the-loop system for biomarker-guided drug combination hypothesis generation in cancer research. CoDHy integrates structured biomedical databases and unstructured literature evidence into a task-specific knowledge graph, which serves as the basis for graph-based reasoning and hypothesis construction. The system combines knowledge graph embeddings with agent-based reasoning to generate, validate, and rank candidate drug combinations, while explicitly grounding each hypothesis in retrievable evidence. Through a web-based interface, users can configure the scientific context, inspect intermediate results, and iteratively refine hypotheses, enabling transparent and researcher-steerable exploration rather than automated decision-making. We demonstrate CoDHy as a system for exploratory hypothesis generation and decision support in translational oncology, highlighting its design, interaction workflow, and practical use cases.
Problem

Research questions and friction points this paper is trying to address.

biomarker
drug combination
hypothesis generation
biomedical literature
knowledge graph
Innovation

Methods, ideas, or system contributions that make the work stand out.

knowledge graph
agent-based reasoning
biomarker-guided drug combination
human-in-the-loop AI
hypothesis generation
💼 Related Jobs
AI Data Engineer--LLMs / Agentic Systems
Pfizer
The annual base salary for this position ranges from $106,000.00 to $176,600.00. In addition, this position is eligible for participation in Pfizer’s Global Performance Plan with a bonus target of 15.0% of the base salary and eligibility to participate in our share based long term incentive program. We offer comprehensive and generous benefits and programs to help our colleagues lead healthy lives and to support each of life’s moments. Benefits offered include a 401(k) plan with Pfizer Matching Contributions and an additional Pfizer Retirement Savings Contribution, paid vacation, holiday and personal days, paid caregiver/parental and medical leave, and health benefits to include medical, prescription drug, dental and vision coverage. Learn more at Pfizer Candidate Site – U.S. Benefits | (uscandidates.mypfizerbenefits.com). Pfizer compensation structures and benefit packages are aligned based on the location of hire. The United States salary range provided does not apply to Tampa, FL or any location outside of the United States. Relocation assistance may be available based on business needs and/or eligibility.
United States - Massachusetts - Cambridge
R
Raneen Younis
Peter L. Reichertz Institute for Medical Informatics (PLRI), Hannover Medical School; Lower Saxony Center for Artificial Intelligence and Causal Methods in Medicine (CAIMed)
S
Suvinava Basak
Peter L. Reichertz Institute for Medical Informatics (PLRI), Hannover Medical School; Lower Saxony Center for Artificial Intelligence and Causal Methods in Medicine (CAIMed)
L
Lukas Chavez
Sanford Burnham Prebys (SBP) Medical Discovery Institute, San Diego
Zahra Ahmadi
Zahra Ahmadi
Junior Group Leader, PLRI Medical Informatics Institute, Medical School of Hannover
Human-centered AIMultimodal LearningData MiningMachine Learning