🤖 AI Summary
In survival analysis, discriminative measures such as the C-index and time-dependent AUC rely on scoring rules (e.g., Cox linear predictors), but their estimators suffer from bias when the censoring model is misspecified. This paper proposes a model-robust, doubly robust, and efficient nonparametric estimator for discrimination metrics under linear scoring rules, which does not require correct specification of the censoring mechanism. Our key contribution is the first formal definition of a nonparametric coefficient vector under no-censoring assumptions, integrated with influence function theory, data-adaptive modeling (e.g., machine learning), and efficient semiparametric estimation. Simulation studies demonstrate substantially reduced bias and smaller variance compared to standard approaches. In a real-world glioblastoma dataset, our estimator exhibits improved stability, clinical interpretability, and superior performance over prevailing methods.
📝 Abstract
Discrimination measures such as the concordance index and the cumulative-dynamic time-dependent area under the ROC-curve (AUC) are widely used in the medical literature for evaluating the predictive accuracy of a scoring rule which relates a set of prognostic markers to the risk of experiencing a particular event. Often the scoring rule being evaluated in terms of discriminatory ability is the linear predictor of a survival regression model such as the Cox proportional hazards model. This has the undesirable feature that the scoring rule depends on the censoring distribution when the model is misspecified. In this work we focus on linear scoring rules where the coefficient vector is a nonparametric estimand defined in the setting where there is no censoring. We propose so-called debiased estimators of the aforementioned discrimination measures for this class of scoring rules. The proposed estimators make efficient use of the data and minimize bias by allowing for the use of data-adaptive methods for model fitting. Moreover, the estimators do not rely on correct specification of the censoring model to produce consistent estimation. We compare the estimators to existing methods in a simulation study, and we illustrate the method by an application to a brain cancer study.