AesRec: A Dataset for Aesthetics-Aligned Clothing Outfit Recommendation

📅 2026-02-03
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the limitation of existing fashion recommendation methods, which often lack explicit aesthetic modeling and struggle to balance personalization with aesthetic guidance. To bridge this gap, the authors introduce AesRec, a novel dataset featuring a multidimensional aesthetic annotation framework grounded in professional fashion standards, encompassing both individual garments and full outfits. Leveraging vision-language models, the framework enables large-scale, scalable aesthetic scoring, complemented by human validation to ensure alignment between human and machine judgments. Experimental results demonstrate that integrating this quantified aesthetic representation into recommendation models significantly enhances system performance in aligning with both user preferences and aesthetic principles.

Technology Category

Application Category

📝 Abstract
Clothing recommendation extends beyond merely generating personalized outfits; it serves as a crucial medium for aesthetic guidance. However, existing methods predominantly rely on user-item-outfit interaction behaviors while overlooking explicit representations of clothing aesthetics. To bridge this gap, we present the AesRec benchmark dataset featuring systematic quantitative aesthetic annotations, thereby enabling the development of aesthetics-aligned recommendation systems. Grounded in professional apparel quality standards and fashion aesthetic principles, we define a multidimensional set of indicators. At the item level, six dimensions are independently assessed: silhouette, chromaticity, materiality, craftsmanship, wearability, and item-level impression. Transitioning to the outfit level, the evaluation retains the first five core attributes while introducing stylistic synergy, visual harmony, and outfit-level impression as distinct metrics to capture the collective aesthetic impact. Given the increasing human-like proficiency of Vision-Language Models in multimodal understanding and interaction, we leverage them for large-scale aesthetic scoring. We conduct rigorous human-machine consistency validation on a fashion dataset, confirming the reliability of the generated ratings. Experimental results based on AesRec further demonstrate that integrating quantified aesthetic information into clothing recommendation models can provide aesthetic guidance for users while fulfilling their personalized requirements.
Problem

Research questions and friction points this paper is trying to address.

clothing recommendation
aesthetics
outfit recommendation
fashion aesthetics
aesthetic alignment
Innovation

Methods, ideas, or system contributions that make the work stand out.

aesthetic-aligned recommendation
multidimensional aesthetic annotation
vision-language models
outfit recommendation
human-machine consistency
🔎 Similar Papers
No similar papers found.
W
Wenxin Ye
Wuhan University of Technology
Lin Li
Lin Li
School of Mathematics and Statistics, Chongqing Technology and Business University
Nonlinear AnalysisPartial Differential EquationsVariational MethodsCritical Point TheoryPDEs
M
Ming Li
York University
Y
Yang Shen
Wuhan University of Technology
K
Kanghong Wang
Wuhan University of Technology
J
Jimmy Xiangji Huang
York University