Beyond Matching: Category-Guided Latent Intent Reasoning for Generative Retrieval in E-Commerce

📅 2026-06-05
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the semantic gap between short user queries and product semantic identifiers (SIDs) in e-commerce search, along with stringent latency constraints. To this end, the authors propose CaLIR, a framework that performs coarse-to-fine implicit intent reasoning guided by category hierarchies, learning continuous latent states to align user shopping intent with SIDs without relying on explicit chain-of-thought reasoning that incurs high latency. CaLIR innovatively integrates query-aware intent path modeling, dynamic prefix Trie-constrained decoding, and hierarchical semantic reasoning, achieving substantial retrieval gains without compromising efficiency. Experiments demonstrate that CaLIR strikes a superior balance between retrieval effectiveness and inference efficiency across multilingual e-commerce datasets, while exhibiting strong transferability and robustness across diverse category taxonomies and generative backbone models.
📝 Abstract
Generative retrieval offers a new paradigm for e-commerce search by mapping user queries directly to product Semantic Identifiers (SIDs). However, e-commerce queries are often short, noisy, attribute-heavy, and associated with multiple category-consistent products, creating a substantial representation gap between natural-language shopping intent and artificially constructed item SIDs. Explicit Chain-of-Thought (CoT) reasoning can help bridge this gap, but its extra generation cost is difficult to reconcile with the low-latency requirements of online e-commerce systems. To address this challenge, we propose CaLIR (Category-guided Latent Intent Reasoning), a category-guided latent intent reasoning framework for e-commerce generative retrieval. Rather than generating explicit textual rationales, CaLIR learns continuous latent intent states before SID decoding and uses product category hierarchies as a natural scaffold for coarse-to-fine intent reasoning. Specifically, we introduce hierarchical semantic reasoning to align latent states with category-level shopping intent, and query-wise reasoning enhancement to model diverse intent paths under multi-positive queries. CaLIR further combines a query-specific dynamic prefix trie, assembled from pre-indexed category-level tries, with reasoning-aware constrained decoding. Experiments on multilingual e-commerce search datasets show that CaLIR achieves a better balance between retrieval effectiveness and inference efficiency than existing methods, while also demonstrating transferability and robustness across induced hierarchies and different generative backbones.
Problem

Research questions and friction points this paper is trying to address.

generative retrieval
e-commerce search
semantic identifier
intent reasoning
representation gap
Innovation

Methods, ideas, or system contributions that make the work stand out.

Generative Retrieval
Latent Intent Reasoning
Category Hierarchy
Constrained Decoding
E-commerce Search
🔎 Similar Papers
No similar papers found.
F
Fuwei Zhang
Institute of Artificial Intelligence, Beihang University, China and Meituan, China
Xiaoyu Liu
Xiaoyu Liu
University of Science and Technology of China
Model CompressionCVNLP
Jiajie Jin
Jiajie Jin
Renmin University of China
Information RetrievalLarge Language Models
J
Jiale Mao
Institute of Artificial Intelligence, Beihang University, China and Meituan, China
W
Wei Chen
Institute of Artificial Intelligence, Beihang University, China
D
Dongbo Xi
Meituan, China
Y
Yifan Yang
Meituan, China
Peng Yan
Peng Yan
Research Assistant of ZHAW, PhD student of UZH
Deep LearningTransfer LearningIntelligent Algorithm
Z
Zichao Hao
Beijing Information Science and Technology University, China
Z
Zhao Zhang
SKLCCSE, School of Computer Science and Engineering, Beihang University, China
F
Fuzhen Zhuang
Institute of Artificial Intelligence, Beihang University, China