Exhausting the type I error level in event-driven group-sequential designs with a closed testing procedure for progression-free and overall survival

📅 2025-12-09
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
In oncology clinical trials, overall survival (OS) and progression-free survival (PFS) are frequently designated as co-primary endpoints; however, conventional Bonferroni correction ignores their correlation, resulting in substantial loss of statistical power. This paper proposes an event-driven sequential closed testing procedure: leveraging the asymptotic joint distribution of log-rank statistics, it models the cross-time-point covariance structure between PFS and OS, and dynamically adjusts significance thresholds at interim and final analyses to fully exhaust the type I error rate. The method rigorously controls the family-wise error rate (FWER) and demonstrates robust performance under moderate-to-large sample sizes. Compared with Bonferroni correction, it improves OS testing power by approximately 67%, reduces required OS events by ~5%, substantially enhances both disjunctive and conjunctive power, and increases the probability of early trial termination.

Technology Category

Application Category

📝 Abstract
In oncological clinical trials, overall survival (OS) is the gold-standard endpoint, but long follow-up and treatment switching can delay or dilute detectable effects. Progression-free survival (PFS) often provides earlier evidence and is therefore frequently used together with OS as multiple primary endpoints. Since in certain scenarios trial success may be defined if one of the two hypotheses involved can be rejected, a correction for multiple testing may be deemed necessary. Because PFS and OS are generally highly dependent, their test statistics are typically correlated. Ignoring this dependency (e.g. via a simple Bonferroni correction) is not power optimal. We develop a group-sequential testing procedure for the multiple primary endpoints PFS and OS that fully exhausts the family-wise error rate (FWER) by exploiting their dependence. Specifically, we characterize the joint asymptotic distribution of log-rank statistics across endpoints and multiple event-driven analysis cutoffs. Furthermore, we show that we can consistently estimate the covariance structure. Embedding these results in a closed testing procedure, we can recalculate critical values of the test statistics in order to spend the available type I error optimally. An important extension to the current literature is that we allow for both interim and final analysis to be event-driven. Simulations based on illness-death multi-state models empirically confirm FWER control for moderate to large sample sizes. Compared with a simple Bonferroni correction, the proposed methods recover roughly two thirds of the power loss for OS, increase disjunctive and conjunctive power, and enable meaningful early stopping. In planning, these gains translate into about 5% fewer OS events required to reach the targeted power. We also discuss practical issues in the implementation of such designs and possible extensions of the introduced method.
Problem

Research questions and friction points this paper is trying to address.

Develops group-sequential testing for PFS and OS endpoints
Exploits dependency to exhaust family-wise error rate optimally
Enables event-driven interim and final analyses to improve power
Innovation

Methods, ideas, or system contributions that make the work stand out.

Closed testing procedure exploiting endpoint dependence
Event-driven group-sequential design with joint asymptotic distribution
Covariance estimation for optimal type I error spending
🔎 Similar Papers
No similar papers found.
Moritz Fabian Danzer
Moritz Fabian Danzer
Research Associate, Institute of Biostatistics and Clinical Research, University of Münster
BiostatisticsSurvival AnalysisAdaptive Designs
Kaspar Rufibach
Kaspar Rufibach
Merck KGaA
BiostatisticsClinical trial design
J
Jan Beyersmann
Institute of Statistics, University of Ulm, Germany
R
René Schmidt
Institute of Biostatistics and Clinical Research, University of Münster, Germany