🤖 AI Summary
To address the need for generating multiple distinct aesthetic cropping regions from multi-subject images in social media, this paper initiates the first systematic study of multi-objective automatic cropping. We propose a segmentation-based preprocessing framework that integrates existing single-objective cropping models with an adaptive tiling strategy, optimized and evaluated using human-annotated multi-region aesthetic scores. We construct and publicly release MOCrop—the first high-quality multi-cropping dataset—comprising 277 high-resolution images with dense, expert-annotated aesthetic scores across multiple candidate cropping regions. Extensive experiments reveal significant performance bottlenecks of state-of-the-art single-cropping models in multi-objective scenarios, establishing MOCrop as a benchmarking platform for multi-cropping research. Our work introduces a novel evaluation paradigm grounded in region-level aesthetic scoring and provides reproducible data, code, and baselines to advance the field.
📝 Abstract
Automatic image cropping is a method for maximizing the human-perceived quality of cropped regions in photographs. Although several works have proposed techniques for producing singular crops, little work has addressed the problem of producing multiple, distinct crops with aesthetic appeal. In this paper, we motivate the problem with a discussion on modern social media applications, introduce a dataset of 277 relevant images and human labels, and evaluate the efficacy of several single-crop models with an image partitioning algorithm as a pre-processing step. The dataset is available at https://github.com/RafeLoya/carousel.