COutfitGAN: Learning to Synthesize Compatible Outfits Supervised by Silhouette Masks and Fashion Styles

📅 2025-02-12
🏛️ IEEE transactions on multimedia
📈 Citations: 23
Influential: 0
📄 PDF
🤖 AI Summary
This paper introduces a novel task—fashion outfit generation from an arbitrary number of given clothing items—aiming to synthesize visually realistic and stylistically harmonious complementary garments. Methodologically, it proposes the first generative framework for completing outfits from partial inputs, featuring a pyramid-style extractor to model multi-granularity fashion features, and a dual-discriminator joint optimization scheme: a U-Net-based discriminator assesses image realism, while a relational discriminator models cross-item compatibility. Additionally, contour mask supervision is incorporated to enhance fine-grained structural consistency. Evaluated on a large-scale dataset comprising 200K outfits and 800K individual items, the method achieves significant improvements over state-of-the-art approaches across quantitative metrics—including image fidelity, outfit compatibility, and visual similarity—demonstrating both effectiveness and generalizability.

Technology Category

Application Category

📝 Abstract
How to recommend outfits has gained considerable attention in both academia and industry in recent years. Many studies have been carried out regarding fashion compatibility learning, to determine whether the fashion items in an outfit are compatible or not. These methods mainly focus on evaluating the compatibility of existing outfits and rarely consider applying such knowledge to ‘design’ new fashion items. We propose the new task of generating complementary and compatible fashion items based on an arbitrary number of given fashion items. In particular, given some fashion items that can make up an outfit, the aim of this paper is to synthesize photo-realistic images of other, complementary, fashion items that are compatible with the given ones. To achieve this, we propose an outfit generation framework, referred to as COutfitGAN, which includes a pyramid style extractor, an outfit generator, a UNet-based real/fake discriminator, and a collocation discriminator. To train and evaluate this framework, we collected a large-scale fashion outfit dataset with over 200 K outfits and 800 K fashion items from the Internet. Extensive experiments show that COutfitGAN outperforms other baselines in terms of similarity, authenticity, and compatibility measurements.
Problem

Research questions and friction points this paper is trying to address.

Generate compatible fashion items
Design new complementary outfits
Photo-realistic synthesis using COutfitGAN
Innovation

Methods, ideas, or system contributions that make the work stand out.

Pyramid style extractor used
UNet-based discriminator implemented
Large-scale dataset collected
🔎 Similar Papers
No similar papers found.
D
Dongliang Zhou
Department of Computer Science, Harbin Institute of Technology, Shenzhen, 518055 China
Haijun Zhang
Haijun Zhang
Professor, IEEE Fellow, University of Science and Technology Beijing
6GAI enabled Wireless CommunicationsResource AllocationMobility Management
Q
Qun Li
Department of Computer Science, Harbin Institute of Technology, Shenzhen, 518055 China
J
Jianghong Ma
Department of Computer Science, Harbin Institute of Technology, Shenzhen, 518055 China
Xiaofei Xu
Xiaofei Xu
School of Information Technology, Murdoch University
Reinforcement LearningLLMStorage SystemsRecommender Systems