Cross-Scenario Unified Modeling of User Interests at Billion Scale

📅 2025-10-16

📈 Citations: 0

✨ Influential: 0

career value

163K/year

🤖 AI Summary

Traditional recommender systems optimize for single scenarios, neglecting cross-scenario behavioral synergy and struggling to integrate large language models (LLMs) at billion-scale, leading to fragmented user interest modeling. To address this, we propose RED-Rec—a novel framework that pioneers deep LLM integration into industrial-scale recommendation systems. It introduces a dual-tower LLM architecture coupled with a scenario-aware dense hybrid query mechanism, enabling hierarchical fusion of multi-scenario behavioral sequences (e.g., search, feed, content discovery) and fine-grained interest modeling. An efficient online serving engine supports low-latency real-time inference. Extensive A/B tests across hundreds of millions of users demonstrate significant improvements in core recommendation and advertising metrics. Furthermore, we release RED-MMU—a million-scale, multi-scenario sequential dataset—to foster collaborative advancement in both academia and industry.

Technology Category

Application Category

📝 Abstract

User interests on content platforms are inherently diverse, manifesting through complex behavioral patterns across heterogeneous scenarios such as search, feed browsing, and content discovery. Traditional recommendation systems typically prioritize business metric optimization within isolated specific scenarios, neglecting cross-scenario behavioral signals and struggling to integrate advanced techniques like LLMs at billion-scale deployments, which finally limits their ability to capture holistic user interests across platform touchpoints. We propose RED-Rec, an LLM-enhanced hierarchical Recommender Engine for Diversified scenarios, tailored for industry-level content recommendation systems. RED-Rec unifies user interest representations across multiple behavioral contexts by aggregating and synthesizing actions from varied scenarios, resulting in comprehensive item and user modeling. At its core, a two-tower LLM-powered framework enables nuanced, multifaceted representations with deployment efficiency, and a scenario-aware dense mixing and querying policy effectively fuses diverse behavioral signals to capture cross-scenario user intent patterns and express fine-grained, context-specific intents during serving. We validate RED-Rec through online A/B testing on hundreds of millions of users in RedNote through online A/B testing, showing substantial performance gains in both content recommendation and advertisement targeting tasks. We further introduce a million-scale sequential recommendation dataset, RED-MMU, for comprehensive offline training and evaluation. Our work advances unified user modeling, unlocking deeper personalization and fostering more meaningful user engagement in large-scale UGC platforms.

Problem

Research questions and friction points this paper is trying to address.

Modeling diverse user interests across multiple scenarios

Integrating LLMs efficiently in billion-scale recommendation systems

Unifying behavioral signals from heterogeneous platform interactions

Innovation

Methods, ideas, or system contributions that make the work stand out.

LLM-enhanced hierarchical recommender for diverse scenarios

Two-tower LLM framework enables efficient multifaceted representations

Scenario-aware policy fuses behavioral signals across contexts

🔎 Similar Papers

No similar papers found.