Robust or Suggestible? Exploring Non-Clinical Induction in LLM Drug-Safety Decisions

📅 2025-10-15
📈 Citations: 0
Influential: 0
📄 PDF

career value

189K/year
🤖 AI Summary
This study identifies a systemic fairness risk in large language models (LLMs) applied to drug safety prediction: LLMs inappropriately rely on sociodemographic attributes—such as education level and housing stability—that are non-clinical and socially sensitive, leading to inflated adverse event (AE) risk estimates for vulnerable populations. To address this, we propose a persona-based evaluation framework that distinguishes explicit from implicit bias patterns. Using structured FAERS data and two LLMs—ChatGPT-4o and Bio-Medical-Llama-3.8B—we conduct multi-role, multi-dimensional persona-driven reasoning analyses. Our work provides the first empirical evidence that LLMs erroneously associate sociodemographic labels with AE probability, significantly compromising predictive fairness. The contribution includes a reproducible, persona-grounded fairness assessment paradigm and actionable debiasing pathways for trustworthy AI deployment in pharmacoepidemiology.

Technology Category

Application Category

📝 Abstract
Large language models (LLMs) are increasingly applied in biomedical domains, yet their reliability in drug-safety prediction remains underexplored. In this work, we investigate whether LLMs incorporate socio-demographic information into adverse event (AE) predictions, despite such attributes being clinically irrelevant. Using structured data from the United States Food and Drug Administration Adverse Event Reporting System (FAERS) and a persona-based evaluation framework, we assess two state-of-the-art models, ChatGPT-4o and Bio-Medical-Llama-3.8B, across diverse personas defined by education, marital status, employment, insurance, language, housing stability, and religion. We further evaluate performance across three user roles (general practitioner, specialist, patient) to reflect real-world deployment scenarios where commercial systems often differentiate access by user type. Our results reveal systematic disparities in AE prediction accuracy. Disadvantaged groups (e.g., low education, unstable housing) were frequently assigned higher predicted AE likelihoods than more privileged groups (e.g., postgraduate-educated, privately insured). Beyond outcome disparities, we identify two distinct modes of bias: explicit bias, where incorrect predictions directly reference persona attributes in reasoning traces, and implicit bias, where predictions are inconsistent, yet personas are not explicitly mentioned. These findings expose critical risks in applying LLMs to pharmacovigilance and highlight the urgent need for fairness-aware evaluation protocols and mitigation strategies before clinical deployment.
Problem

Research questions and friction points this paper is trying to address.

Investigating LLM integration of socio-demographic data in drug-safety predictions
Evaluating systematic prediction disparities across diverse demographic personas
Identifying explicit and implicit bias modes in adverse event likelihood assessments
Innovation

Methods, ideas, or system contributions that make the work stand out.

Persona-based evaluation framework for bias detection
Analysis of explicit and implicit bias modes
Fairness-aware protocols for clinical LLM deployment
🔎 Similar Papers
No similar papers found.
💼 Related Jobs
Postdoctoral Fellow – AI-Driven Multi-Omics Integration for Predictive Toxicology
Pfizer
The annual base salary for this position ranges from $64,600.00 to $107,600.00. In addition, this position is eligible for participation in Pfizer’s Global Performance Plan with a bonus target of 7.5% of the base salary. We offer comprehensive and generous benefits and programs to help our colleagues lead healthy lives and to support each of life’s moments. Benefits offered include a 401(k) plan with Pfizer Matching Contributions and an additional Pfizer Retirement Savings Contribution, paid vacation, holiday and personal days, paid caregiver/parental and medical leave, and health benefits to include medical, prescription drug, dental and vision coverage. Learn more at Pfizer Candidate Site – U.S. Benefits | (uscandidates.mypfizerbenefits.com). Pfizer compensation structures and benefit packages are aligned based on the location of hire. The United States salary range provided does not apply to Tampa, FL or any location outside of the United States. Relocation assistance may be available based on business needs and/or eligibility.
Hybrid
S
Siying Liu
School of Computer and Mathematical Sciences, University of Adelaide
Shisheng Zhang
Shisheng Zhang
University of New South Wales
machine learningcomputer visionmedical image processing
I
Indu Bala
School of Computer and Mathematical Sciences, University of Adelaide