QARM V2: Quantitative Alignment Multi-Modal Recommendation for Reasoning User Sequence Modeling

📅 2026-02-09
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Traditional recommender systems relying on ID embeddings suffer from low information density, knowledge isolation, and limited generalization. Conversely, the semantic representations of large language models (LLMs) often misalign with recommendation objectives and cannot be optimized end-to-end. To address these limitations, this work proposes QARM V2, a novel framework that achieves, for the first time, quantitative alignment between LLM-derived semantic representations and recommendation-specific business goals while enabling end-to-end joint training. QARM V2 integrates LLM embeddings, a multimodal alignment mechanism, user sequence modeling, and GSU/ESU architectures, augmented with a learnable quantitative alignment module. This design substantially enhances information density and model generalization, yielding more accurate personalized recommendations in industrial-scale scenarios.

Technology Category

Application Category

📝 Abstract
With the evolution of large language models (LLMs), there is growing interest in leveraging their rich semantic understanding to enhance industrial recommendation systems (RecSys). Traditional RecSys relies on ID-based embeddings for user sequence modeling in the General Search Unit (GSU) and Exact Search Unit (ESU) paradigm, which suffers from low information density, knowledge isolation, and weak generalization ability. While LLMs offer complementary strengths with dense semantic representations and strong generalization, directly applying LLM embeddings to RecSys faces critical challenges: representation unmatch with business objectives and representation unlearning end-to-end with downstream tasks. In this paper, we present QARM V2, a unified framework that bridges LLM semantic understanding with RecSys business requirements for user sequence modeling.
Problem

Research questions and friction points this paper is trying to address.

recommendation systems
user sequence modeling
large language models
semantic representation
representation alignment
Innovation

Methods, ideas, or system contributions that make the work stand out.

Quantitative Alignment
Multi-Modal Recommendation
LLM-based RecSys
User Sequence Modeling
Semantic Representation
🔎 Similar Papers
No similar papers found.
T
Tian Xia
Kuaishou Technology, Beijing, China
J
Jiaqi Zhang
Kuaishou Technology, Beijing, China
Y
Yueyang Liu
Kuaishou Technology, Beijing, China
Hongjian Dou
Hongjian Dou
Alibaba
Recommender System
T
Tingya Yin
Kuaishou Technology, Beijing, China
Jiangxia Cao
Jiangxia Cao
Kuaishou Tech
RecSysLow-Resource Large Model
X
Xulei Liang
Kuaishou Technology, Beijing, China
T
Tianlu Xie
Kuaishou Technology, Beijing, China
Lihao Liu
Lihao Liu
Amazon
LLM-based AgentHealthcare AI
X
Xiang Chen
Kuaishou Technology, Beijing, China
S
Shen Wang
Kuaishou Technology, Beijing, China
C
Changxin Lao
Kuaishou Technology, Beijing, China
H
Haixiang Gan
Kuaishou Technology, Beijing, China
J
Jinkai Yu
Kuaishou Technology, Beijing, China
Keting Cen
Keting Cen
Unknown affiliation
L
Lu Hao
Kuaishou Technology, Beijing, China
Xu Zhang
Xu Zhang
University of Science and Technology of China
Clinical NLPMedical Imaging
Q
Qiqiang Zhong
Kuaishou Technology, Beijing, China
Z
Zhongbo Sun
Kuaishou Technology, Beijing, China
Yiyu Wang
Yiyu Wang
Alibaba International
LLMAgentComputer VisionImage Captioning
Shuang Yang
Shuang Yang
East China University of Science & Technology
solar cellssemiconductor devicessolid-state chemistry
M
Mingxin Wen
Kuaishou Technology, Beijing, China
X
Xiangyu Wu
Kuaishou Technology, Beijing, China
Shaoguo Liu
Shaoguo Liu
Alibaba Corporation
Maching LearningComputer Vision
T
Tingting Gao
Kuaishou Technology, Beijing, China