Concept Retrieval - What and How?

πŸ“… 2025-10-08
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
This paper introduces the novel task of *concept retrieval*: given a query image, retrieve other images that share a deep, narrative-level conceptual coreβ€”not merely visual or semantic similarity. Methodologically, we propose a bimodal Gaussian distribution model to characterize neighborhood structure in embedding space; leverage concept consistency metrics to identify latent, mutually exclusive concepts shared across distinct neighborhoods; and integrate automated evaluation with human annotation for multidimensional validation. Experiments across multiple benchmark datasets demonstrate that our approach significantly outperforms state-of-the-art image retrieval and clustering methods in qualitative analysis, quantitative metrics, and human evaluation. To our knowledge, this is the first work to formally define the concept retrieval task, establish its theoretical foundations, and provide a reproducible, multi-faceted evaluation framework.

Technology Category

Application Category

πŸ“ Abstract
A concept may reflect either a concrete or abstract idea. Given an input image, this paper seeks to retrieve other images that share its central concepts, capturing aspects of the underlying narrative. This goes beyond conventional retrieval or clustering methods, which emphasize visual or semantic similarity. We formally define the problem, outline key requirements, and introduce appropriate evaluation metrics. We propose a novel approach grounded in two key observations: (1) While each neighbor in the embedding space typically shares at least one concept with the query, not all neighbors necessarily share the same concept with one another. (2) Modeling this neighborhood with a bimodal Gaussian distribution uncovers meaningful structure that facilitates concept identification. Qualitative, quantitative, and human evaluations confirm the effectiveness of our approach. See the package on PyPI: https://pypi.org/project/coret/
Problem

Research questions and friction points this paper is trying to address.

Retrieving images sharing central concepts beyond visual similarity
Modeling neighborhood structure with bimodal Gaussian distributions
Developing evaluation metrics for concept-based image retrieval
Innovation

Methods, ideas, or system contributions that make the work stand out.

Retrieves images sharing central concepts beyond visual similarity
Models neighborhood with bimodal Gaussian distribution for structure
Uses embedding space neighbors to identify shared concepts
πŸ”Ž Similar Papers