Is Sliding Window All You Need? An Open Framework for Long-Sequence Recommendation

πŸ“… 2026-04-14
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF

career value

207K/year
πŸ€– AI Summary
This work addresses the challenge of deploying long-sequence recommendation models under memory and latency constraints in resource-limited settings. The authors propose an end-to-end open-source framework that leverages a sliding window mechanism for efficient training and introduces a k-shift embedding layer, enabling support for million-scale vocabularies on commodity GPUs with negligible accuracy degradation. For the first time, they conduct a systematic, runtime-aware ablation study of windowing strategies, thoroughly evaluating the trade-offs between recommendation accuracy and computational overhead. Experiments on datasets such as RetailRocket demonstrate substantial performance gainsβ€”up to +6.04% in MRR and +6.34% in Recall@10β€”with only approximately 4Γ— increased training cost, allowing stable deployment on university-scale GPU clusters.

Technology Category

Application Category

πŸ“ Abstract
Long interaction histories are central to modern recommender systems, yet training with long sequences is often dismissed as impractical under realistic memory and latency budgets. This work demonstrates that it is not only practical but also effective-at academic scale. We release a complete, end-to-end framework that implements industrial-style long-sequence training with sliding windows, including all data processing, training, and evaluation scripts. Beyond reproducing prior gains, we contribute two capabilities missing from earlier reports: (i) a runtime-aware ablation study that quantifies the accuracy-compute frontier across windowing regimes and strides, and (ii) a novel k-shift embedding layer that enables million-scale vocabularies on commodity GPUs with negligible accuracy loss. Our implementation trains reliably on modest university clusters while delivering competitive retrieval quality (e.g., up to +6.04% MRR and +6.34% Recall@10 on Retailrocket) with $\sim 4 \times $ training-time overheads. By packaging a robust pipeline, reporting training time costs, and introducing an embedding mechanism tailored for low-resource settings, we transform long-sequence training from a closed, industrial technique into a practical, open, and extensible methodology for the community.
Problem

Research questions and friction points this paper is trying to address.

long-sequence recommendation
sliding window
training efficiency
memory constraints
recommendation systems
Innovation

Methods, ideas, or system contributions that make the work stand out.

sliding window
long-sequence recommendation
k-shift embedding
runtime-aware ablation
open framework
πŸ”Ž Similar Papers
2024-02-02ACM Transactions on Recommender SystemsCitations: 1