AutoCas: Autoregressive Cascade Predictor in Social Networks via Large Language Models

📅 2025-02-25
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Addressing the challenges of data sparsity and dynamic heterogeneity in popularity prediction of information cascades within social networks, this paper proposes the first large language model (LLM) transfer framework that models cascade diffusion as an autoregressive sequence. Methodologically, it introduces (1) a structure-aware cascade graph tokenization mechanism that explicitly encodes both topological and temporal diffusion context, and (2) a cascade-specific prompt learning paradigm enabling effective LLM adaptation under few-shot settings—without full-parameter fine-tuning. Evaluated on multiple real-world datasets, the approach significantly outperforms state-of-the-art methods in cascade popularity prediction. It exhibits strong generalization across diverse platforms and domains while inheriting the scalable parameter efficiency inherent to LLMs. The framework thus bridges structural modeling of diffusion processes with the expressive power of foundation models, offering a principled and practical solution for dynamic cascade forecasting.

Technology Category

Application Category

📝 Abstract
Popularity prediction in information cascades plays a crucial role in social computing, with broad applications in viral marketing, misinformation control, and content recommendation. However, information propagation mechanisms, user behavior, and temporal activity patterns exhibit significant diversity, necessitating a foundational model capable of adapting to such variations. At the same time, the amount of available cascade data remains relatively limited compared to the vast datasets used for training large language models (LLMs). Recent studies have demonstrated the feasibility of leveraging LLMs for time-series prediction by exploiting commonalities across different time-series domains. Building on this insight, we introduce the Autoregressive Information Cascade Predictor (AutoCas), an LLM-enhanced model designed specifically for cascade popularity prediction. Unlike natural language sequences, cascade data is characterized by complex local topologies, diffusion contexts, and evolving dynamics, requiring specialized adaptations for effective LLM integration. To address these challenges, we first tokenize cascade data to align it with sequence modeling principles. Next, we reformulate cascade diffusion as an autoregressive modeling task to fully harness the architectural strengths of LLMs. Beyond conventional approaches, we further introduce prompt learning to enhance the synergy between LLMs and cascade prediction. Extensive experiments demonstrate that AutoCas significantly outperforms baseline models in cascade popularity prediction while exhibiting scaling behavior inherited from LLMs. Code is available at this repository: https://anonymous.4open.science/r/AutoCas-85C6
Problem

Research questions and friction points this paper is trying to address.

Predicts popularity in social network cascades.
Adapts LLMs for complex cascade dynamics.
Enhances LLM integration via prompt learning.
Innovation

Methods, ideas, or system contributions that make the work stand out.

Utilizes LLMs for cascade prediction
Tokenizes cascade data for modeling
Introduces prompt learning for synergy
🔎 Similar Papers
No similar papers found.
Yuhao Zheng
Yuhao Zheng
University of Science and Technology of China
Chenghua Gong
Chenghua Gong
University of Science and Technology of China
Graph MiningLarge Language ModelSocial Computing
R
Rui Sun
Hefei University of Technology, Hefei, China
J
Juyuan Zhang
University of Science and Technology of China, Hefei, China
L
Liming Pan
University of Science and Technology of China, Hefei, China
L
Linyuan Lv
University of Science and Technology of China, Hefei, China