Research on E-Commerce Long-Tail Product Recommendation Mechanism Based on Large-Scale Language Models

📅 2025-05-31
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address severe data sparsity and cold-start challenges in long-tail item recommendation within e-commerce scenarios, this paper proposes an LLM-driven framework integrating semantic understanding and user intent modeling. We introduce a Semantic Visor to derive fine-grained semantic representations from item textual descriptions and design an attention-driven user intent encoder to capture long-tail interest patterns. Furthermore, we propose a tri-path fusion ranking mechanism—combining semantic, collaborative, and generative signals—that dynamically weights and integrates heterogeneous information sources. Evaluated on real-world e-commerce datasets, our method achieves significant improvements: +12% in recall, +9% in hit rate, and +15% in user coverage, substantially enhancing long-tail item exposure and conversion. The framework establishes a scalable, semantics-enhanced paradigm for recommender systems operating under extreme sparsity.

Technology Category

Application Category

📝 Abstract
As e-commerce platforms expand their product catalogs, accurately recommending long-tail items becomes increasingly important for enhancing both user experience and platform revenue. A key challenge is the long-tail problem, where extreme data sparsity and cold-start issues limit the performance of traditional recommendation methods. To address this, we propose a novel long-tail product recommendation mechanism that integrates product text descriptions and user behavior sequences using a large-scale language model (LLM). First, we introduce a semantic visor, which leverages a pre-trained LLM to convert multimodal textual content such as product titles, descriptions, and user reviews into meaningful embeddings. These embeddings help represent item-level semantics effectively. We then employ an attention-based user intent encoder that captures users' latent interests, especially toward long-tail items, by modeling collaborative behavior patterns. These components feed into a hybrid ranking model that fuses semantic similarity scores, collaborative filtering outputs, and LLM-generated recommendation candidates. Extensive experiments on a real-world e-commerce dataset show that our method outperforms baseline models in recall (+12%), hit rate (+9%), and user coverage (+15%). These improvements lead to better exposure and purchase rates for long-tail products. Our work highlights the potential of LLMs in interpreting product content and user intent, offering a promising direction for future e-commerce recommendation systems.
Problem

Research questions and friction points this paper is trying to address.

Addressing long-tail product recommendation challenges in e-commerce
Overcoming data sparsity and cold-start issues with LLMs
Improving recall, hit rate, and user coverage for long-tail items
Innovation

Methods, ideas, or system contributions that make the work stand out.

LLM converts product text to embeddings
Attention-based encoder captures user intent
Hybrid ranking model combines multiple scores
🔎 Similar Papers
No similar papers found.
Q
Qingyi Lu
Department of Computer Science, Brown University, Providence, USA
H
Haotian Lyu
Viterbi School of Engineering, University of Southern California, Los Angeles, USA
J
Jiayun Zheng
College of Engineering, University of Michigan Ann Arbor, Ann Arbor, USA
Y
Yang Wang
Department of Information and Communication Engineering, Nagoya University, Nagoya, Japan
L
Li Zhang
Amazon, New York, USA
Chengrui Zhou
Chengrui Zhou
Columbia University