Carousel: A High-Resolution Dataset for Multi-Target Automatic Image Cropping

📅 2025-11-06
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address the need for generating multiple distinct aesthetic cropping regions from multi-subject images in social media, this paper initiates the first systematic study of multi-objective automatic cropping. We propose a segmentation-based preprocessing framework that integrates existing single-objective cropping models with an adaptive tiling strategy, optimized and evaluated using human-annotated multi-region aesthetic scores. We construct and publicly release MOCrop—the first high-quality multi-cropping dataset—comprising 277 high-resolution images with dense, expert-annotated aesthetic scores across multiple candidate cropping regions. Extensive experiments reveal significant performance bottlenecks of state-of-the-art single-cropping models in multi-objective scenarios, establishing MOCrop as a benchmarking platform for multi-cropping research. Our work introduces a novel evaluation paradigm grounded in region-level aesthetic scoring and provides reproducible data, code, and baselines to advance the field.

Technology Category

Application Category

📝 Abstract
Automatic image cropping is a method for maximizing the human-perceived quality of cropped regions in photographs. Although several works have proposed techniques for producing singular crops, little work has addressed the problem of producing multiple, distinct crops with aesthetic appeal. In this paper, we motivate the problem with a discussion on modern social media applications, introduce a dataset of 277 relevant images and human labels, and evaluate the efficacy of several single-crop models with an image partitioning algorithm as a pre-processing step. The dataset is available at https://github.com/RafeLoya/carousel.
Problem

Research questions and friction points this paper is trying to address.

Producing multiple distinct aesthetic crops from images
Addressing limitations of single-crop automatic cropping techniques
Developing dataset and method for multi-target image cropping
Innovation

Methods, ideas, or system contributions that make the work stand out.

Multi-target automatic image cropping technique
Dataset with 277 images and human labels
Image partitioning algorithm as preprocessing step
R
Rafe Loya
Baylor University
A
Andrew Hamara
Baylor University
B
Benjamin Estell
Baylor University
B
Benjamin Kilpatrick
Baylor University
Andrew C. Freeman
Andrew C. Freeman
Baylor University
Multimediavideo streamingcomputer visionneuromorphic systems