SciPostLayoutTree: A Dataset for Structural Analysis of Scientific Posters

📅 2025-11-23

📈 Citations: 0

✨ Influential: 0

career value

181K/year

🤖 AI Summary

Scientific poster layout analysis has long been hindered by the scarcity of annotated datasets and dedicated models. To address this, we propose the first structured analysis framework for scientific posters. We introduce SciPostLayoutTree, a large-scale dataset comprising 8,000 posters meticulously annotated with reading order and hierarchical parent–child relationships. We further design Layout Tree Decoder, a Transformer-based model that jointly encodes visual features, bounding box coordinates, and semantic class labels to capture both spatial and semantic dependencies; it employs beam search to optimize sequential decoding of tree-structured layouts. Experiments demonstrate that our method significantly outperforms existing baselines in predicting complex spatial relationships, establishing a robust benchmark for poster content understanding. All components—including the dataset, model, and code—are publicly released.

Technology Category

Application Category

📝 Abstract

Scientific posters play a vital role in academic communication by presenting ideas through visual summaries. Analyzing reading order and parent-child relations of posters is essential for building structure-aware interfaces that facilitate clear and accurate understanding of research content. Despite their prevalence in academic communication, posters remain underexplored in structural analysis research, which has primarily focused on papers. To address this gap, we constructed SciPostLayoutTree, a dataset of approximately 8,000 posters annotated with reading order and parent-child relations. Compared to an existing structural analysis dataset, SciPostLayoutTree contains more instances of spatially challenging relations, including upward, horizontal, and long-distance relations. As a solution to these challenges, we develop Layout Tree Decoder, which incorporates visual features as well as bounding box features including position and category information. The model also uses beam search to predict relations while capturing sequence-level plausibility. Experimental results demonstrate that our model improves the prediction accuracy for spatially challenging relations and establishes a solid baseline for poster structure analysis. The dataset is publicly available at https://huggingface.co/datasets/omron-sinicx/scipostlayouttree. The code is also publicly available at https://github.com/omron-sinicx/scipostlayouttree.

Problem

Research questions and friction points this paper is trying to address.

Analyzes reading order and parent-child relations in scientific posters

Addresses underexplored structural analysis of posters compared to papers

Improves prediction accuracy for spatially challenging layout relations

Innovation

Methods, ideas, or system contributions that make the work stand out.

Dataset with 8,000 annotated scientific posters

Layout Tree Decoder using visual and bounding box features

Beam search for predicting plausible reading orders

🔎 Similar Papers

No similar papers found.