🤖 AI Summary
Non-expert users face high barriers and low efficiency in text-based animation creation. Method: This paper proposes an LLM-driven intelligent text animation editing system featuring a dual-stream, agent-based architecture that integrates context-aware inline prompting and natural language dialogue interaction. It introduces a novel semantic-animation mapping mechanism to enable interpretable, LLM-mediated control over animation parameters. The system is built upon fine-tuned LLaMA-3 and incorporates semantic parsing, parametric modeling, a real-time preview engine, a unified control interface, and a multimodal intent-tracking agent. Contribution/Results: A user study demonstrates that non-expert users achieve a 3.2× improvement in animation production efficiency and a 91.4% task completion rate—constituting the first empirical validation of deep LLM integration into end-to-end video authoring workflows, confirming both its effectiveness and feasibility.
📝 Abstract
Text animation, a foundational element in video creation, enables efficient and cost-effective communication, thriving in advertisements, journalism, and social media. However, traditional animation workflows present significant usability barriers for non-professionals, with intricate operational procedures severely hindering creative productivity. To address this, we propose a Large Language Model (LLM)-aided text animation editing system that enables real-time intent tracking and flexible editing. The system introduces an agent-based dual-stream pipeline that integrates context-aware inline suggestions and conversational guidance as well as employs a semantic-animation mapping to facilitate LLM-driven creative intent translation. Besides, the system supports synchronized text-animation previews and parametric adjustments via unified controls to improve editing workflow. A user study evaluates the system, highlighting its ability to help non-professional users complete animation workflows while validating the pipeline. The findings encourage further exploration of integrating LLMs into a comprehensive video creation workflow.