🤖 AI Summary
This paper addresses the inherent uncertainty in cluster membership assignments during clustering. To tackle this, it proposes a credal clustering framework grounded in Dempster–Shafer evidence theory. As a key contribution, the authors develop evclust—an open-source Python library that systematically integrates six efficient evidence-based clustering algorithms for uncertainty-aware clustering on arbitrary-dimensional data, producing credal partitions that explicitly model membership ambiguity. The library provides a comprehensive toolchain—including visualization, evaluation, and analysis modules—specifically designed for credal partitions, and is deeply integrated with the NumPy/SciPy/Matplotlib ecosystem. Extensive experiments on synthetic and real-world datasets demonstrate that the proposed approach achieves both strong interpretability and robustness, significantly enhancing the practicality and reproducibility of uncertainty-aware clustering.
📝 Abstract
A recent developing trend in clustering is the advancement of algorithms that not only identify clusters within data, but also express and capture the uncertainty of cluster membership. Evidential clustering addresses this by using the Dempster-Shafer theory of belief functions, a framework designed to manage and represent uncertainty. This approach results in a credal partition, a structured set of mass functions that quantify the uncertain assignment of each object to potential groups. The Python framework evclust, presented in this paper, offers a suite of efficient evidence clustering algorithms as well as tools for visualizing, evaluating and analyzing credal partitions.