Repurposing the scientific literature with vision-language models

📅 2025-02-26
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
General-purpose vision-language models (VLMs) lack domain-specific expertise in neurosurgery, limiting their clinical and academic utility. Method: We introduce NeuroPubs—the first neurosurgery-specific multimodal database—curated from peer-reviewed journals and used to train a domain-customized VLM. Our approach integrates multimodal alignment, curriculum learning, and domain-adaptive fine-tuning to transform scholarly literature into an AI-accessible, multimodal scientific knowledge base. Contribution/Results: The model enables three high-value applications: (1) publication-grade figure caption generation (70% output rated production-ready), (2) ABNS-style medical question synthesis (54% passed expert authenticity evaluation; 89,587 questions generated), and (3) clinical diagnostic support (CNS-Obsidian demonstrated non-inferiority to GPT-4o, *p* = 0.1154). All outputs underwent rigorous validation via expert review and clinical blind testing, establishing a reproducible paradigm for discipline-specific VLM development.

Technology Category

Application Category

📝 Abstract
Research in AI for Science often focuses on using AI technologies to augment components of the scientific process, or in some cases, the entire scientific method; how about AI for scientific publications? Peer-reviewed journals are foundational repositories of specialized knowledge, written in discipline-specific language that differs from general Internet content used to train most large language models (LLMs) and vision-language models (VLMs). We hypothesized that by combining a family of scientific journals with generative AI models, we could invent novel tools for scientific communication, education, and clinical care. We converted 23,000 articles from Neurosurgery Publications into a multimodal database - NeuroPubs - of 134 million words and 78,000 image-caption pairs to develop six datasets for building AI models. We showed that the content of NeuroPubs uniquely represents neurosurgery-specific clinical contexts compared with broader datasets and PubMed. For publishing, we employed generalist VLMs to automatically generate graphical abstracts from articles. Editorial board members rated 70% of these as ready for publication without further edits. For education, we generated 89,587 test questions in the style of the ABNS written board exam, which trainee and faculty neurosurgeons found indistinguishable from genuine examples 54% of the time. We used these questions alongside a curriculum learning process to track knowledge acquisition while training our 34 billion-parameter VLM (CNS-Obsidian). In a blinded, randomized controlled trial, we demonstrated the non-inferiority of CNS-Obsidian to GPT-4o (p = 0.1154) as a diagnostic copilot for a neurosurgical service. Our findings lay a novel foundation for AI with Science and establish a framework to elevate scientific communication using state-of-the-art generative artificial intelligence while maintaining rigorous quality standards.
Problem

Research questions and friction points this paper is trying to address.

AI for scientific publications enhancement
Multimodal database for neurosurgery content
Generative AI for scientific communication tools
Innovation

Methods, ideas, or system contributions that make the work stand out.

Vision-language models repurposed
Multimodal database for AI training
Generated graphical abstracts automatically
🔎 Similar Papers
No similar papers found.
Anton Alyakin
Anton Alyakin
medical student at washington univesity
llmsneurosurgerynetworkscausality
J
Jaden Stryker
Department of Neurological Surgery, NYU Langone Health, New York, 10016, USA
D
D. A. Alber
Department of Neurological Surgery, NYU Langone Health, New York, 10016, USA; New York University Grossman School of Medicine, New York, NY 10016, USA
Karl L. Sangwon
Karl L. Sangwon
Medical Student at NYU Grossman School of Medicine
NeurosurgeryApplied Math
Brandon Duderstadt
Brandon Duderstadt
Calcifer Computing
computational semiotics
A
Akshay V. Save
Department of Neurological Surgery, NYU Langone Health, New York, 10016, USA
D
D. Kurland
Department of Neurological Surgery, NYU Langone Health, New York, 10016, USA
S
Spencer Frome
Department of Neurological Surgery, NYU Langone Health, New York, 10016, USA; New York University Grossman School of Medicine, New York, NY 10016, USA
S
Shrutika Singh
Department of Neurological Surgery, NYU Langone Health, New York, 10016, USA
J
Jeff Zhang
Department of Neurological Surgery, NYU Langone Health, New York, 10016, USA; Department of Population Health, NYU Langone Health, New York, NY 10016, USA; Division of Applied AI Technologies, NYU Langone Health, New York, NY 10016, USA
E
Eunice Yang
Department of Neurological Surgery, NYU Langone Health, New York, 10016, USA; Columbia University Vagelos College of Physicians and Surgeons, New York, NY, USA
K
Ki Yun Park
Department of Neurological Surgery, Washington University in Saint Louis, Saint Louis, MO 63110, USA; Washington University School of Medicine, Saint Louis, MO 63110, USA
C
C. Orillac
Department of Neurological Surgery, NYU Langone Health, New York, 10016, USA
A
Aly A. Valliani
Department of Neurological Surgery, NYU Langone Health, New York, 10016, USA
S
Sean Neifert
Department of Neurological Surgery, NYU Langone Health, New York, 10016, USA
A
Albert Liu
Department of Neurological Surgery, NYU Langone Health, New York, 10016, USA
A
Aneek Patel
Department of Neurological Surgery, NYU Langone Health, New York, 10016, USA
C
Christopher Livia
Department of Neurological Surgery, NYU Langone Health, New York, 10016, USA
D
Darryl Lau
Department of Neurological Surgery, NYU Langone Health, New York, 10016, USA
I
Ilya Laufer
Department of Neurological Surgery, NYU Langone Health, New York, 10016, USA
P
P. Rozman
Department of Neurological Surgery, NYU Langone Health, New York, 10016, USA
E
E. Hidalgo
Department of Neurological Surgery, NYU Langone Health, New York, 10016, USA
H
Howard A Riina
Department of Neurological Surgery, NYU Langone Health, New York, 10016, USA; Department of Radiology, NYU Langone Health, New York, NY 10016, USA
R
Rui Feng
Department of Neurosurgery, Mount Sinai Health System, New York, NY 10019
T
T. Hollon
Department of Neurosurgery, University of Michigan, Ann Arbor, MI 48109
Y
Yin Aphinyanaphongs
Department of Population Health, NYU Langone Health, New York, NY 10016, USA; Division of Applied AI Technologies, NYU Langone Health, New York, NY 10016, USA
J
J. Golfinos
Department of Neurological Surgery, NYU Langone Health, New York, 10016, USA; Department of Otolaryngology - Head and Neck Surgery, NYU Langone Health, New York, NY 10016, USA
L
Laura Snyder
Department of Neurosurgery, Barrow Neurological Institute, Phoenix, AZ 85013, USA
Eric Leuthardt
Eric Leuthardt
Department of Neurosurgery, Washington University in Saint Louis, Saint Louis, MO 63110, USA
D
Douglas Kondziolka
Department of Neurological Surgery, NYU Langone Health, New York, 10016, USA; Department of Radiation Oncology, NYU Langone Health, New York, NY 10016, USA
E
E. Oermann
Department of Neurological Surgery, NYU Langone Health, New York, 10016, USA; Department of Radiology, NYU Langone Health, New York, NY 10016, USA; Center for Data Science, New York University, New York, NY 10011, USA; Neuroscience Institute, NYU Langone Health, New York, NY 10016, USA