PHEATPRUNER: Interpretable Data-centric Feature Selection for Multivariate Time Series Classification through Persistent Homology

📅 2025-04-25
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Addressing the challenge of balancing performance and interpretability in multivariate time series classification, this paper proposes a data-driven topological feature selection method. Our approach innovatively integrates persistent homology with sheaf theory to construct an unsupervised, bias-free pruning framework—requiring neither labeled supervision nor posterior probabilities—to automatically reduce high-dimensional variables and generate structured interpretability vectors. Evaluated on the UEA benchmark and a real-world bovine mastitis dataset, our method achieves an average pruning rate of 45%, consistently improves (or at least maintains) classification accuracy, and incurs no additional computational overhead. It is compatible with mainstream classifiers—including Random Forest and XGBoost—demonstrating strong generalizability, intrinsic interpretability, and practical deployability.

Technology Category

Application Category

📝 Abstract
Balancing performance and interpretability in multivariate time series classification is a significant challenge due to data complexity and high dimensionality. This paper introduces PHeatPruner, a method integrating persistent homology and sheaf theory to address these challenges. Persistent homology facilitates the pruning of up to 45% of the applied variables while maintaining or enhancing the accuracy of models such as Random Forest, CatBoost, XGBoost, and LightGBM, all without depending on posterior probabilities or supervised optimization algorithms. Concurrently, sheaf theory contributes explanatory vectors that provide deeper insights into the data's structural nuances. The approach was validated using the UEA Archive and a mastitis detection dataset for dairy cows. The results demonstrate that PHeatPruner effectively preserves model accuracy. Furthermore, our results highlight PHeatPruner's key features, i.e. simplifying complex data and offering actionable insights without increasing processing time or complexity. This method bridges the gap between complexity reduction and interpretability, suggesting promising applications in various fields.
Problem

Research questions and friction points this paper is trying to address.

Balancing performance and interpretability in multivariate time series classification
Pruning high-dimensional data without sacrificing model accuracy
Providing interpretable insights into complex data structure
Innovation

Methods, ideas, or system contributions that make the work stand out.

Uses persistent homology for feature selection
Integrates sheaf theory for explanatory insights
Maintains accuracy while pruning 45% variables
🔎 Similar Papers
No similar papers found.
Anh-Duy Pham
Anh-Duy Pham
Julius-Maximilians-Universität Würzburg
Operations ResearchGraph LearningAgent-based Modelling
O
O. Kashongwe
Osnabrück University, Joint Lab Artificial Intelligence & Data Science, Osnabrück, Germany; Leibniz Institute for Agricultural Engineering and Bioeconomy (ATB), Department of Sensors and Modelling, Potsdam, Germany
M
M. Atzmueller
Osnabrück University, Semantic Information Systems Group, Osnabrück, Germany; German Research Center for Artificial Intelligence (DFKI), Research Department Plan-Based Robot Control, Osnabrück, Germany
T
Tim Romer
Osnabrück University, Institute of Mathematics, Osnabrück, Germany