evclust: Python library for evidential clustering

📅 2025-02-10

📈 Citations: 0

✨ Influential: 0

career value

213K/year

🤖 AI Summary

This paper addresses the inherent uncertainty in cluster membership assignments during clustering. To tackle this, it proposes a credal clustering framework grounded in Dempster–Shafer evidence theory. As a key contribution, the authors develop evclust—an open-source Python library that systematically integrates six efficient evidence-based clustering algorithms for uncertainty-aware clustering on arbitrary-dimensional data, producing credal partitions that explicitly model membership ambiguity. The library provides a comprehensive toolchain—including visualization, evaluation, and analysis modules—specifically designed for credal partitions, and is deeply integrated with the NumPy/SciPy/Matplotlib ecosystem. Extensive experiments on synthetic and real-world datasets demonstrate that the proposed approach achieves both strong interpretability and robustness, significantly enhancing the practicality and reproducibility of uncertainty-aware clustering.

Technology Category

Application Category

📝 Abstract

A recent developing trend in clustering is the advancement of algorithms that not only identify clusters within data, but also express and capture the uncertainty of cluster membership. Evidential clustering addresses this by using the Dempster-Shafer theory of belief functions, a framework designed to manage and represent uncertainty. This approach results in a credal partition, a structured set of mass functions that quantify the uncertain assignment of each object to potential groups. The Python framework evclust, presented in this paper, offers a suite of efficient evidence clustering algorithms as well as tools for visualizing, evaluating and analyzing credal partitions.

Problem

Research questions and friction points this paper is trying to address.

Develops evidential clustering algorithms

Manages uncertainty in cluster membership

Provides tools for analyzing credal partitions

Innovation

Methods, ideas, or system contributions that make the work stand out.

Python library evclust

Dempster-Shafer theory

Credal partition analysis

🔎 Similar Papers

Interpretable Clustering: A Survey