DIAMOND: An LLM-Driven Agent for Context-Aware Baseball Highlight Summarization

📅 2025-06-03
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Traditional baseball highlight detection methods—such as Win Probability Added (WPA) ranking or vision-based event detection—struggle to jointly capture strategic depth and narrative coherence, while manual annotation remains costly and unscalable. This paper proposes an LLM-driven, context-aware summarization agent that introduces a modular architecture coupling interpretable advanced statistics (WPA, Leverage Index, Win Expectancy) with natural language inference to jointly model importance and storytelling. The method integrates multi-dimensional sports context, optimized prompt engineering, and an interpretable decision-making mechanism. Evaluated on five KBO games, it achieves an F1-score of 84.8%, outperforming the WPA baseline by 41.9 percentage points and significantly surpassing existing statistical and commercial systems.

Technology Category

Application Category

📝 Abstract
Traditional approaches -- such as Win Probability Added (WPA)-based ranking or computer vision-driven event detection -- can identify scoring plays but often miss strategic depth, momentum shifts, and storyline progression. Manual curation remains the gold standard but is resource-intensive and not scalable. We introduce DIAMOND, an LLM-driven agent for context-aware baseball highlight summarization that integrates structured sports analytics with natural language reasoning. DIAMOND leverages sabermetric features -- Win Expectancy, WPA, and Leverage Index -- to quantify play importance, while an LLM module enhances selection based on contextual narrative value. This hybrid approach ensures both quantitative rigor and qualitative richness, surpassing the limitations of purely statistical or vision-based systems. Evaluated on five diverse Korean Baseball Organization League games, DIAMOND improves F1-score from 42.9% (WPA-only) to 84.8%, outperforming both commercial and statistical baselines. Though limited in scale, our results highlight the potential of modular, interpretable agent-based frameworks for event-level summarization in sports and beyond.
Problem

Research questions and friction points this paper is trying to address.

Identifying strategic depth and momentum shifts in baseball highlights
Overcoming resource-intensive manual curation for scalable summarization
Integrating analytics with narrative context for richer highlight selection
Innovation

Methods, ideas, or system contributions that make the work stand out.

LLM-driven agent for context-aware summarization
Integrates sports analytics with natural language reasoning
Hybrid approach combining quantitative and qualitative analysis
🔎 Similar Papers
No similar papers found.