Swap Regret and Correlated Equilibria Beyond Normal-Form Games

📅 2025-02-27

📈 Citations: 0

✨ Influential: 0

🤖 AI Summary

Existing notions of regret fail to uniformly characterize learning algorithms’ resilience against strategic manipulation in generalized games, including Bayesian and extensive-form games. Method: We introduce *profile swap regret*, a novel regret measure grounded in polyhedral game modeling, integrating online convex optimization and approximation algorithm design. Contribution/Results: We establish, for the first time, a necessary and sufficient condition linking sublinear profile swap regret to non-manipulability. We prove that the induced equilibrium set is strictly contained in the set of correlated equilibria—surpassing the limitations of normal-form game analysis. Furthermore, we devise the first efficient algorithm achieving an $O(sqrt{T})$ regret bound and rigorously characterize the boundary of its convergent equilibrium structure, thereby resolving an open problem posed by Mansour et al. (2022).

Technology Category

Application Category

📝 Abstract

Swap regret is a notion that has proven itself to be central to the study of general-sum normal-form games, with swap-regret minimization leading to convergence to the set of correlated equilibria and guaranteeing non-manipulability against a self-interested opponent. However, the situation for more general classes of games -- such as Bayesian games and extensive-form games -- is less clear-cut, with multiple candidate definitions for swap-regret but no known efficiently minimizable variant of swap regret that implies analogous non-manipulability guarantees. In this paper, we present a new variant of swap regret for polytope games that we call ``profile swap regret'', with the property that obtaining sublinear profile swap regret is both necessary and sufficient for any learning algorithm to be non-manipulable by an opponent (resolving an open problem of Mansour et al., 2022). Although we show profile swap regret is NP-hard to compute given a transcript of play, we show it is nonetheless possible to design efficient learning algorithms that guarantee at most $O(sqrt{T})$ profile swap regret. Finally, we explore the correlated equilibrium notion induced by low-profile-swap-regret play, and demonstrate a gap between the set of outcomes that can be implemented by this learning process and the set of outcomes that can be implemented by a third-party mediator (in contrast to the situation in normal-form games).

Problem

Research questions and friction points this paper is trying to address.

Defines profile swap regret for polytope games

Ensures non-manipulability in learning algorithms

Explores correlated equilibrium in non-normal-form games

Innovation

Methods, ideas, or system contributions that make the work stand out.

Introduces profile swap regret

Guarantees O(sqrt{T}) regret

Explores correlated equilibrium gap

🔎 Similar Papers

No similar papers found.

Authors to Follow