SurvPFN: Towards Foundation Models for Survival Predictions

📅 2026-06-03
📈 Citations: 0
Influential: 0
📄 PDF

career value

173K/year
🤖 AI Summary
Existing tabular foundation models struggle to effectively model censored data in survival analysis, often leading to biased predictions. This work proposes SurvPFN—the first successful extension of Prior-data Fitted Networks (PFNs) to survival analysis—by pretraining on millions of synthetic survival tasks and framing the problem as continuous-time distribution regression that explicitly accounts for censoring. SurvPFN requires no task-specific architecture, feature engineering, or dataset-specific fine-tuning. It leverages Weibull-distributed event times, a non-informative censoring mechanism, and a novel censored negative log-likelihood loss. Evaluated on real-world datasets from SurvSet, SurvPFN matches or surpasses the performance of established classical and deep learning baselines in survival prediction.
📝 Abstract
Tabular foundation models (TFMs) have made rapid progress in standard classification and regression, but time-to-event survival prediction tasks have remained largely untouched. Unlike in standard regression tasks, survival prediction models must account for censored data. Standard TFMs cannot handle natively censored data, leading to biased and inaccurate predictions, making them unsuitable for real-world applications. To overcome this fundamental limitation, we propose \texttt{SurvPFN}, a prior-data fitted network (PFN), for survival prediction tasks. We pretrain \texttt{SurvPFN} on millions of synthetic survival prediction tasks to learn survival via distributional regression that accounts for censored data. \texttt{SurvPFN} works by (1) generating data with Weibull event times and a non-informative censoring mechanism; (2) integrating a censored event indicator; and (3) minimizing a censored negative log-likelihood. On SurvSet, a collection of real-world survival tasks, \texttt{SurvPFN} is highly competitive with classical and deep survival baselines without per-dataset fitting, a survival-specific architecture, or feature engineering. We show that survival can be treated as a continuous-time distributional regression problem with censored loss, unlocking the power of PFNs for time-to-event predictions.
Problem

Research questions and friction points this paper is trying to address.

survival prediction
censored data
tabular foundation models
time-to-event prediction
distributional regression
Innovation

Methods, ideas, or system contributions that make the work stand out.

Survival Prediction
Censored Data
Prior-Fitted Networks
Distributional Regression
Foundation Models
🔎 Similar Papers
💼 Related Jobs
Postdoctoral Fellow – AI-Driven Multi-Omics Integration for Predictive Toxicology
Pfizer
The annual base salary for this position ranges from $64,600.00 to $107,600.00. In addition, this position is eligible for participation in Pfizer’s Global Performance Plan with a bonus target of 7.5% of the base salary. We offer comprehensive and generous benefits and programs to help our colleagues lead healthy lives and to support each of life’s moments. Benefits offered include a 401(k) plan with Pfizer Matching Contributions and an additional Pfizer Retirement Savings Contribution, paid vacation, holiday and personal days, paid caregiver/parental and medical leave, and health benefits to include medical, prescription drug, dental and vision coverage. Learn more at Pfizer Candidate Site – U.S. Benefits | (uscandidates.mypfizerbenefits.com). Pfizer compensation structures and benefit packages are aligned based on the location of hire. The United States salary range provided does not apply to Tampa, FL or any location outside of the United States. Relocation assistance may be available based on business needs and/or eligibility.
Hybrid
S
Samuel Böhm
Institute of Epidemiology and Prevention, Medical Center – University of Freiburg, Faculty of Medicine, University of Freiburg, Freiburg, Germany
Lennart Purucker
Lennart Purucker
PhD Student
Tabular DataBenchmarkingAutoMLMachine LearningFoundation Models
Frank Hutter
Frank Hutter
Prior Labs; ELLIS Institute Tübingen; University of Freiburg
Tabular DataFoundation ModelsAutoMLMeta-LearningDeep Learning
P
Pascal Schlosser
Department of Epidemiology, Johns Hopkins Bloomberg School of Public Health, Baltimore, Maryland, US; CIBSS - Centre for Integrative Biological Signalling Studies, University of Freiburg, Freiburg, Germany