DiffVolume: Diffusion Models for Volume Generation in Limit Order Books

📅 2025-08-12
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This paper addresses the challenge of jointly modeling strong temporal dependencies and dynamic liquidity couplings in high-dimensional limit-order-book (LOB) volume snapshot generation. We propose DiffVolume, a conditional diffusion model that leverages historical volume sequences and explicit liquidity features—such as bid-ask spread and order-book depth—as conditioning inputs to jointly capture temporal evolution and cross-level liquidity dependencies, enabling controllable counterfactual generation. Compared to existing approaches, DiffVolume achieves significant improvements in statistical distribution fidelity, counterfactual plausibility, and downstream liquidity prediction performance. Empirical evaluation across multiple market datasets shows an average reduction of 12.7%–23.4% in prediction error. By unifying temporal dynamics and multi-level liquidity interactions within a generative framework, DiffVolume establishes a novel paradigm for market microstructure modeling and synthetic LOB data generation.

Technology Category

Application Category

📝 Abstract
Modeling limit order books (LOBs) dynamics is a fundamental problem in market microstructure research. In particular, generating high-dimensional volume snapshots with strong temporal and liquidity-dependent patterns remains a challenging task, despite recent work exploring the application of Generative Adversarial Networks to LOBs. In this work, we propose a conditional extbf{Diff}usion model for the generation of future LOB extbf{Volume} snapshots ( extbf{DiffVolume}). We evaluate our model across three axes: (1) extit{Realism}, where we show that DiffVolume, conditioned on past volume history and time of day, better reproduces statistical properties such as marginal distribution, spatial correlation, and autocorrelation decay; (2) extit{Counterfactual generation}, allowing for controllable generation under hypothetical liquidity scenarios by additionally conditioning on a target future liquidity profile; and (3) extit{Downstream prediction}, where we show that the synthetic counterfactual data from our model improves the performance of future liquidity forecasting models. Together, these results suggest that DiffVolume provides a powerful and flexible framework for realistic and controllable LOB volume generation.
Problem

Research questions and friction points this paper is trying to address.

Modeling high-dimensional LOB volume snapshots with temporal patterns
Generating realistic LOB data with statistical properties
Enabling controllable counterfactual generation for liquidity scenarios
Innovation

Methods, ideas, or system contributions that make the work stand out.

Conditional diffusion model for LOB volume generation
Controllable generation under liquidity scenarios
Improves liquidity forecasting with synthetic data
🔎 Similar Papers
No similar papers found.