Multi-Agent Synergy-Driven Iterative Visual Narrative Synthesis

📅 2025-07-17
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Existing automated media presentation generation methods often suffer from narrative discontinuity and suboptimal visual layout, failing to meet professional quality standards. To address these issues, we propose RCPS—a reflective, multi-agent framework integrating deep structured narrative planning, adaptive layout generation, and an iterative optimization loop. Furthermore, we introduce PREVAL, a preference-based evaluation system that jointly optimizes content consistency, coherence, and visual design across multiple dimensions. Experimental results demonstrate that RCPS-generated presentations significantly outperform baseline approaches across all quantitative metrics, achieving overall quality comparable to human experts. Crucially, PREVAL’s assessments exhibit strong agreement with human judgments (Spearman’s ρ > 0.92), validating its effectiveness as a reliable, automated quality evaluator.

Technology Category

Application Category

📝 Abstract
Automated generation of high-quality media presentations is challenging, requiring robust content extraction, narrative planning, visual design, and overall quality optimization. Existing methods often produce presentations with logical inconsistencies and suboptimal layouts, thereby struggling to meet professional standards. To address these challenges, we introduce RCPS (Reflective Coherent Presentation Synthesis), a novel framework integrating three key components: (1) Deep Structured Narrative Planning; (2) Adaptive Layout Generation; (3) an Iterative Optimization Loop. Additionally, we propose PREVAL, a preference-based evaluation framework employing rationale-enhanced multi-dimensional models to assess presentation quality across Content, Coherence, and Design. Experimental results demonstrate that RCPS significantly outperforms baseline methods across all quality dimensions, producing presentations that closely approximate human expert standards. PREVAL shows strong correlation with human judgments, validating it as a reliable automated tool for assessing presentation quality.
Problem

Research questions and friction points this paper is trying to address.

Automated high-quality media presentation generation challenges
Logical inconsistencies and suboptimal layouts in presentations
Lack of reliable automated presentation quality assessment
Innovation

Methods, ideas, or system contributions that make the work stand out.

Deep Structured Narrative Planning
Adaptive Layout Generation
Iterative Optimization Loop
🔎 Similar Papers
No similar papers found.
W
Wang Xi
Hefei Institutes of Physical Science, Chinese Academy of Sciences; University of Science and Technology of China
Q
Quan Shi
Changzhou University
Tian Yu
Tian Yu
Institute of Computing Technology, Chinese Academy of Sciences (ICT/CAS)
Y
Yujie Peng
Hefei Institutes of Physical Science, Chinese Academy of Sciences; University of Science and Technology of China
J
Jiayi Sun
Hefei Institutes of Physical Science, Chinese Academy of Sciences; University of Science and Technology of China
M
Mengxing Ren
Hefei Institutes of Physical Science, Chinese Academy of Sciences; University of Science and Technology of China
Z
Zenghui Ding
Hefei Institutes of Physical Science, Chinese Academy of Sciences
N
Ningguang Yao
Hefei Institutes of Physical Science, Chinese Academy of Sciences; University of Science and Technology of China