Online Price Competition under Generalized Linear Demands

📅 2025-11-13

📈 Citations: 0

✨ Influential: 0

🤖 AI Summary

This paper studies decentralized online price competition among multiple sellers under generalized linear (single-index) demand: each seller observes only its own demand and competitors’ prices, receives binary or real-valued demand feedback, and operates without coordinated exploration. We propose PML-GLUCB, a novel algorithm integrating penalized maximum likelihood estimation with a generalized upper-confidence-bound pricing rule—achieving the first fully distributed learning guarantee for nonlinear single-index demand models. To accommodate the multi-agent competitive structure, we refine the elliptical potential lemma. Theoretically, PML-GLUCB attains an $O(N^2 sqrt{T} log T)$ regret bound against a dynamic benchmark, matching the optimal rate up to logarithmic factors in the linear-demand setting. This significantly extends the applicability and practicality of existing approaches to broader, more realistic demand specifications.

Technology Category

Application Category

📝 Abstract

We study sequential price competition among $N$ sellers, each influenced by the pricing decisions of their rivals. Specifically, the demand function for each seller $i$ follows the single index model $lambda_i(mathbf{p}) = mu_i(langle oldsymbol{ heta}_{i,0}, mathbf{p} angle)$, with known increasing link $mu_i$ and unknown parameter $oldsymbol{ heta}_{i,0}$, where the vector $mathbf{p}$ denotes the vector of prices offered by all the sellers simultaneously at a given instant. Each seller observes only their own realized demand -- unobservable to competitors -- and the prices set by rivals. Our framework generalizes existing approaches that focus solely on linear demand models. We propose a novel decentralized policy, PML-GLUCB, that combines penalized MLE with an upper-confidence pricing rule, removing the need for coordinated exploration phases across sellers -- which is integral to previous linear models -- and accommodating both binary and real-valued demand observations. Relative to a dynamic benchmark policy, each seller achieves $O(N^{2}sqrt{T}log(T))$ regret, which essentially matches the optimal rate known in the linear setting. A significant technical contribution of our work is the development of a variant of the elliptical potential lemma -- typically applied in single-agent systems -- adapted to our competitive multi-agent environment.

Problem

Research questions and friction points this paper is trying to address.

Studying sequential price competition among multiple sellers with interdependent demands

Developing decentralized pricing policies without coordinated exploration phases

Generalizing linear demand models to accommodate broader demand functions

Innovation

Methods, ideas, or system contributions that make the work stand out.

Decentralized policy combining penalized MLE

Upper-confidence pricing without coordinated exploration

Elliptical potential lemma for competitive multi-agent systems

🔎 Similar Papers

No similar papers found.

Authors to Follow