🤖 AI Summary
This work addresses the limited robustness of existing unsupervised anomaly detection methods when confronted with diverse anomaly types and varying levels of noise. The authors propose a density-driven manifold evolution approach that identifies anomalies through the geometric displacement of samples during an iterative density-enhancement process: normal samples exhibit minimal displacement, whereas anomalies undergo significant shifts as they are attracted toward high-density regions. The method innovatively integrates weighted mean shift, a fuzzy neighborhood graph constructed via UMAP, and adaptive density weighting to formulate a displacement-based cumulative anomaly scoring mechanism. Evaluated on the ADBench benchmark—comprising 46 real-world datasets, four anomaly generation protocols, and six noise levels—the approach consistently outperforms 13 baseline methods in AUC-ROC, AUC-PR, and Precision@n, demonstrating robust and balanced detection performance.
📝 Abstract
Unsupervised anomaly detection stands as an important problem in machine learning, with applications in financial fraud prevention, network security and medical diagnostics. Existing unsupervised anomaly detection algorithms rarely perform well across different anomaly types, often excelling only under specific structural assumptions. This lack of robustness also becomes particularly evident under noisy settings. We propose Mean Shift Density Enhancement (MSDE), a fully unsupervised framework that detects anomalies through their geometric response to density-driven manifold evolution. MSDE is based on the principle that normal samples, being well supported by local density, remain stable under iterative density enhancement, whereas anomalous samples undergo large cumulative displacements as they are attracted toward nearby density modes. To operationalize this idea, MSDE employs a weighted mean-shift procedure with adaptive, sample-specific density weights derived from a UMAP-based fuzzy neighborhood graph. Anomaly scores are defined by the total displacement accumulated across a small number of mean-shift iterations. We evaluate MSDE on the ADBench benchmark, comprising forty six real-world tabular datasets, four realistic anomaly generation mechanisms, and six noise levels. Compared to 13 established unsupervised baselines, MSDE achieves consistently strong, balanced and robust performance for AUC-ROC, AUC-PR, and Precision@n, at several noise levels and on average over several types of anomalies. These results demonstrate that displacement-based scoring provides a robust alternative to the existing state-of-the-art for unsupervised anomaly detection.