π€ AI Summary
Existing approaches struggle to predict the animal shapes humans perceive in clouds due to pareidolia. This work proposes a novel paradigm that leverages diffusion models to simulate and guide such perceptual experiences by generating animal forms visually consistent with original cloud images and synthesizing morphing videos that dynamically transition from cloud to animal. By integrating cloud image segmentation, diffusion-based generation, and morphing visualization, the method not only accurately predicts the animal categories likely to be perceived by humans but also significantly enhances observersβ recognition performance. The approach offers a new framework for understanding and augmenting pareidolic perception through dynamic visual cues.
π Abstract
People often see animal shapes in clouds, a phenomenon known as pareidolia. We propose an AI-based method that aims to predict which animals people are likely to perceive in clouds, even though state-of-the-art recognition methods typically fail to detect such animals. Additionally, we introduce a method to assist individuals in perceiving specific pareidolic animals, even if they did not recognize them initially. Our approach uses a diffusion model to transform cloud segments into an animal shape that visually resemble the original cloud. This diffusion technique is inspired by the observation that the diffusion process succeeds only when the target animal resembles the shape of the cloud, and that subtle visual hints often suffice to help individuals recognize specific pareidolic animals. A generated image, successfully derived from the diffusion model, is then used to predict the pareidolic animal. Additionally, a short morphing video transitioning from the generated image back to the original cloud segment is employed to further enhance the human's perception of the pareidolic animals.