New Benchmarking Shows Limited Generalization Power of TCR Antigenic Epitope Prediction Models

📅 2026-06-03
📈 Citations: 0
Influential: 0
📄 PDF

career value

194K/year
🤖 AI Summary
Current T cell receptor (TCR) epitope prediction models suffer from limited generalizability and a lack of rigorously defined unseen benchmark datasets for unbiased evaluation. This work addresses these gaps by constructing, for the first time, two complementary and strictly partitioned unseen benchmark datasets, thereby establishing a standardized evaluation framework for TCR–antigen binding prediction. By integrating immune repertoire analysis, computational immunology, and machine learning evaluation methodologies, the study systematically uncovers the performance bottlenecks of state-of-the-art models in real-world scenarios. The resulting benchmarks and foundational framework provide a reliable basis for the development and assessment of next-generation prediction algorithms.
📝 Abstract
Accurate computational prediction of T cell receptor (TCR) antigen specificity would transform the study of T cell biology and enable scalable immune engineering, yet existing models lack sufficient sensitivity and specificity for broad applications. A major limitation is the absence of rigorously defined, unseen benchmark datasets that allow unbiased evaluation of model performance and generalizability. Here, we describe two complementary classes of datasets that meet this criterion and argue that they provide both a robust framework for model assessment and a foundation for next-generation TCR-antigen prediction algorithm development.
Problem

Research questions and friction points this paper is trying to address.

TCR
antigen specificity
benchmarking
generalization
epitope prediction
Innovation

Methods, ideas, or system contributions that make the work stand out.

TCR-antigen prediction
benchmark dataset
model generalization
immune engineering
computational immunology
Yiming Liao
Yiming Liao
Meta
Machine LearningRecommender SystemData Mining
Y
Yiheng Li
Children’s Hospital of Philadelphia, Philadelphia, PA 19104, USA
N
Ning Jiang
Department of Bioengineering, University of Pennsylvania, Philadelphia, PA 19104, USA; Institute for Immunology & Immune Health, University of Pennsylvania, Philadelphia, PA 19104, USA; Institute for RNA Innovation, University of Pennsylvania, Philadelphia, PA 19104, USA; Abramson Cancer Center, University of Pennsylvania, Philadelphia, PA 19104, USA; Center for Precision Engineering for Health, University of Pennsylvania, Philadelphia, PA 19104, USA; Center for Cellular Immunotherapies, University of Penns
Bo Li
Bo Li
Associate Professor, University of Pennsylvania
Cancer genomicsCancer heterogeneityComputational Cancer immunology
Keke Chen
Keke Chen
Department of Computer Science and Electrical Engineering, University of Maryland Baltimore County
PrivacySecurityData MiningData Intensive ComputingCloud Computing