Fine-Grained Customized Fashion Design with Image-into-Prompt benchmark and dataset from LMM

📅 2025-09-11
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address the challenge that end users’ lack of fashion expertise leads to ambiguous text prompts and hinders fine-grained garment customization, this paper proposes a conversational image-to-prompt design framework based on large multimodal models (LMMs). Our method introduces the BUG workflow and establishes FashionEdit—the first dataset explicitly aligned with real-world fashion design processes—enabling end-to-end automation from sketch/reference-image interaction, semantically precise prompt generation, to controllable image editing. By integrating multi-granularity control mechanisms with iterative dialog-based feedback, our approach significantly improves alignment between generated outputs and user intent. Experiments on FashionEdit demonstrate superior performance over baselines across three key dimensions: generation similarity, user satisfaction, and design quality. To foster reproducibility and industrial adoption, we publicly release both the source code and the FashionEdit dataset.

Technology Category

Application Category

📝 Abstract
Generative AI evolves the execution of complex workflows in industry, where the large multimodal model empowers fashion design in the garment industry. Current generation AI models magically transform brainstorming into fancy designs easily, but the fine-grained customization still suffers from text uncertainty without professional background knowledge from end-users. Thus, we propose the Better Understanding Generation (BUG) workflow with LMM to automatically create and fine-grain customize the cloth designs from chat with image-into-prompt. Our framework unleashes users' creative potential beyond words and also lowers the barriers of clothing design/editing without further human involvement. To prove the effectiveness of our model, we propose a new FashionEdit dataset that simulates the real-world clothing design workflow, evaluated from generation similarity, user satisfaction, and quality. The code and dataset: https://github.com/detectiveli/FashionEdit.
Problem

Research questions and friction points this paper is trying to address.

Addresses fine-grained customization in AI fashion design
Overcomes text uncertainty without professional user knowledge
Automates clothing design customization using image-to-prompt workflow
Innovation

Methods, ideas, or system contributions that make the work stand out.

LMM-based workflow for fine-grained customization
Image-into-prompt technique for design generation
Automated fashion design without human involvement
🔎 Similar Papers
No similar papers found.
H
Hui Li
The Hong Kong Polytechnic University, China
Y
Yi You
The Hong Kong Polytechnic University, China
Q
Qiqi Chen
The Hong Kong Polytechnic University, China
B
Bingfeng Zhang
China University of Petroleum (East China), China
George Q. Huang
George Q. Huang
The Hong Kong Polytechnic University
Smart ManufacturingIndustrial EngineeringIndustry 4.0Digital TwinPhysical Internet