Poisson-MNL Bandit: Nearly Optimal Dynamic Joint Assortment and Pricing with Decision-Dependent Customer Arrivals

📅 2026-02-18
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This study addresses a key limitation in existing dynamic joint assortment and pricing methods, which typically assume a fixed customer arrival rate and neglect the impact of assortment and pricing decisions on that rate, leading to suboptimal revenue. To overcome this, the paper introduces the Poisson-MNL model, which explicitly models the customer arrival rate as a function of both assortment and price by coupling a contextual multinomial logit (MNL) choice model with a Poisson arrival process. Building on a multi-armed bandit framework, the authors propose an efficient UCB-type algorithm, PMNL. Theoretical analysis establishes a non-asymptotic regret bound of order √(T log T), which is shown to be minimax optimal. Extensive simulations demonstrate that the proposed approach significantly outperforms benchmark methods that assume a fixed arrival rate.

Technology Category

Application Category

📝 Abstract
We study dynamic joint assortment and pricing where a seller updates decisions at regular accounting/operating intervals to maximize the cumulative per-period revenue over a horizon $T$. In many settings, assortment and prices affect not only what an arriving customer buys but also how many customers arrive within the period, whereas classical multinomial logit (MNL) models assume arrivals as fixed, potentially leading to suboptimal decisions. We propose a Poisson-MNL model that couples a contextual MNL choice model with a Poisson arrival model whose rate depends on the offered assortment and prices. Building on this model, we develop an efficient algorithm PMNL based on the idea of upper confidence bound (UCB). We establish its (near) optimality by proving a non-asymptotic regret bound of order $\sqrt{T\log{T}}$ and a matching lower bound (up to $\log T$). Simulation studies underscore the importance of accounting for the dependency of arrival rates on assortment and pricing: PMNL effectively learns customer choice and arrival models and provides joint assortment-pricing decisions that outperform others that assume fixed arrival rates.
Problem

Research questions and friction points this paper is trying to address.

dynamic assortment and pricing
customer arrivals
multinomial logit model
decision-dependent arrivals
revenue maximization
Innovation

Methods, ideas, or system contributions that make the work stand out.

Poisson-MNL
dynamic assortment and pricing
decision-dependent arrivals
upper confidence bound
regret bound
🔎 Similar Papers
No similar papers found.
J
Junhui Cai
Department of Information Technology, Analytics, and Operations, University of Notre Dame
R
Ran Chen
Department of Statistics and Data Science, Washington University in St. Louis
Q
Qitao Huang
Department of Mathematics, Tsinghua University
Linda Zhao
Linda Zhao
University of Pennsylvania
Post-selection inferenceEmpirical BayesNetwork analysisEquity networkEducation in data science
W
Wu Zhu
Department of Finance, Tsinghua University