Online Prediction of Stochastic Sequences with High Probability Regret Bounds

📅 2026-02-18
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This study addresses the problem of universal online prediction for stochastic sequences, aiming to establish asymptotically vanishing regret bounds that hold with high probability over a finite time horizon. By integrating martingale inequalities, information-theoretic tools, and generalization bound techniques, the work presents the first high-probability regret upper bound for stochastic processes over countable alphabets. The result guarantees a convergence rate of $O(\sqrt{T^{-1}\log(1/\delta)})$ with probability at least $1-\delta$, formally matching the best-known expected regret bounds. Moreover, the analysis demonstrates that, without additional assumptions, the dependence on the failure probability $\delta$ is information-theoretically optimal, thereby establishing a fundamental lower bound on the achievable exponent in the confidence term.

Technology Category

Application Category

📝 Abstract
We revisit the classical problem of universal prediction of stochastic sequences with a finite time horizon $T$ known to the learner. The question we investigate is whether it is possible to derive vanishing regret bounds that hold with high probability, complementing existing bounds from the literature that hold in expectation. We propose such high-probability bounds which have a very similar form as the prior expectation bounds. For the case of universal prediction of a stochastic process over a countable alphabet, our bound states a convergence rate of $\mathcal{O}(T^{-1/2} δ^{-1/2})$ with probability as least $1-δ$ compared to prior known in-expectation bounds of the order $\mathcal{O}(T^{-1/2})$. We also propose an impossibility result which proves that it is not possible to improve the exponent of $δ$ in a bound of the same form without making additional assumptions.
Problem

Research questions and friction points this paper is trying to address.

online prediction
stochastic sequences
high probability regret bounds
universal prediction
finite time horizon
Innovation

Methods, ideas, or system contributions that make the work stand out.

high-probability regret bounds
universal prediction
stochastic sequences
convergence rate
impossibility result
🔎 Similar Papers
No similar papers found.
M
Matthias Frey
Department of Electrical and Electronic Engineering, The University of Melbourne
J
Jonathan H. Manton
Department of Electrical and Electronic Engineering, The University of Melbourne
Jingge Zhu
Jingge Zhu
University of Melbourne
Information TheoryCommunication SystemsStatistical Learning Theory