Stable diffusion models reveal a persisting human and AI gap in visual creativity

📅 2025-11-20
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This study investigates the performance gap in visual creativity between humans and generative AI (Stable Diffusion), and how human guidance modulates AI output quality. Method: We conducted a controlled experiment with three human groups—professional artists, non-artist adults—and two AI prompting conditions (“human-inspired” vs. “self-guided”)—evaluated by 255 human raters and GPT-4o across multiple creativity dimensions. Contribution/Results: Visual creativity exhibits a significant hierarchical gradient: artists > non-artists > human-inspired AI > self-guided AI. Human guidance substantially enhances AI-generated outputs, elevating them to near non-artist human levels. Critically, this work provides the first empirical evidence that human perceptual granularity and contextual sensitivity remain irreplaceable in visual ideation, while exposing fundamental limitations in large language models’ cross-modal transfer capability—particularly in grounding linguistic prompts in visual semantics. These findings establish both theoretical foundations and practical frameworks for human-AI co-creative systems.

Technology Category

Application Category

📝 Abstract
While recent research suggests Large Language Models match human creative performance in divergent thinking tasks, visual creativity remains underexplored. This study compared image generation in human participants (Visual Artists and Non Artists) and using an image generation AI model (two prompting conditions with varying human input: high for Human Inspired, low for Self Guided). Human raters (N=255) and GPT4o evaluated the creativity of the resulting images. We found a clear creativity gradient, with Visual Artists being the most creative, followed by Non Artists, then Human Inspired generative AI, and finally Self Guided generative AI. Increased human guidance strongly improved GenAI's creative output, bringing its productions close to those of Non Artists. Notably, human and AI raters also showed vastly different creativity judgment patterns. These results suggest that, in contrast to language centered tasks, GenAI models may face unique challenges in visual domains, where creativity depends on perceptual nuance and contextual sensitivity, distinctly human capacities that may not be readily transferable from language models.
Problem

Research questions and friction points this paper is trying to address.

AI struggles with visual creativity compared to humans
Human guidance improves AI's visual creative output
Human and AI raters differ in creativity judgment patterns
Innovation

Methods, ideas, or system contributions that make the work stand out.

Used human-inspired and self-guided AI prompting techniques
Employed human and GPT4o raters for creativity evaluation
Revealed creativity gap between human artists and AI models
🔎 Similar Papers
No similar papers found.
S
Silvia Rondini
Cognition and Brain Plasticity Unit, Bellvitge Biomedical Research Institute, L'Hospitalet de Llobregat, Spain
C
Claudia Alvarez-Martin
Department of Cognition, Development and Educational Psychology, University of Barcelona, Barcelona, Spain
P
Paula Angermair-Barkai
Vienna Cognitive Science Hub, Vienna, Austria
O
Olivier Penacchio
Computer Science Department, Universitat Autonoma de Barcelona, Bellaterra, Spain
M
M. Paz
Bridging AI and Neuroscience, Computer Vision Center, Bellaterra, Spain
M
Matthew Pelowski
Faculty of Psychology, University of Vienna, Vienna, Austria
Dan Dediu
Dan Dediu
ICREA & Department of Catalan Philology and General Linguistics, University of Barcelona, Spain
Languagegeneticsstatisticscomputer modelsadherence to medication
Antoni Rodriguez-Fornells
Antoni Rodriguez-Fornells
ICREA - University of Barcelona
Cognitive NeuroscienceLanguageCognitive Control
X
Xim Cerda-Company
Computer Science Department, Universitat Autonoma de Barcelona, Bellaterra, Spain