Learning a Distance for the Clustering of Patients with Amyotrophic Lateral Sclerosis

📅 2025-11-03
🏛️ Symposium on Advances in Databases and Information Systems
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
ALS presents challenges including small and heterogeneous patient populations, sparse longitudinal clinical data, and lack of consensus on clinical subtyping. To address these, this paper proposes a weakly supervised distance learning framework that integrates medical prior knowledge to learn personalized distance functions tailored for disease progression—specifically, declarative functional scoring. The method jointly models multivariate longitudinal clinical biomarkers with domain-specific constraints, enabling compatibility with standard clustering algorithms. Applied to a cohort of 353 ALS patients, it yields clinically interpretable patient subgroups. Experimental results demonstrate superior discriminative performance in survival prediction (significantly improved C-index) compared to existing clustering approaches, while maintaining high clustering purity. Critically, the identified subgroups exhibit clear clinical relevance—e.g., distinct progression trajectories and symptom profiles—thereby establishing an interpretable, empirically verifiable paradigm for precision subtyping and individualized intervention in ALS.

Technology Category

Application Category

📝 Abstract
Amyotrophic lateral sclerosis (ALS) is a severe disease with a typical survival of 3-5 years after symptom onset. Current treatments offer only limited life extension, and the variability in patient responses highlights the need for personalized care. However, research is hindered by small, heterogeneous cohorts, sparse longitudinal data, and the lack of a clear definition for clinically meaningful patient clusters. Existing clustering methods remain limited in both scope and number. To address this, we propose a clustering approach that groups sequences using a disease progression declarative score. Our approach integrates medical expertise through multiple descriptive variables, investigating several distance measures combining such variables, both by reusing off-the-shelf distances and employing a weak-supervised learning method. We pair these distances with clustering methods and benchmark them against state-of-the-art techniques. The evaluation of our approach on a dataset of 353 ALS patients from the University Hospital of Tours, shows that our method outperforms state-of-the-art methods in survival analysis while achieving comparable silhouette scores. In addition, the learned distances enhance the relevance and interpretability of results for medical experts.
Problem

Research questions and friction points this paper is trying to address.

Developing distance metrics for clustering ALS patient progression data
Addressing heterogeneity in small patient cohorts with sparse longitudinal data
Improving clinical relevance and interpretability of ALS patient stratification
Innovation

Methods, ideas, or system contributions that make the work stand out.

Learning distance measures for patient clustering
Combining multiple descriptive clinical variables
Using weak-supervised learning with medical expertise
🔎 Similar Papers
No similar papers found.
G
Guillaume Tejedor
Laboratoire LIFAT, Université de Tours
Verónika Peralta
Verónika Peralta
Associate Professor, Université de Tours
data qualitydatabasesinformation systemsbusiness intelligencepersonalization
Nicolas Labroche
Nicolas Labroche
Professor at Université de Tours (France)
exploratory data analysisexplainable AIclusteringfuzzy logic
Patrick Marcel
Patrick Marcel
Professor, University of Orléans, France
databasesdata warehousing and OLAPdata miningpersonalizationrecommender systems
H
Hélène Blasco
Service de Biochimie et Biologie Moléculaire CHRU de Tours, UMR U1253 iBrain, Université de Tours, Inserm
H
Hugo Alarcan
Service de Biochimie et Biologie Moléculaire CHRU de Tours, UMR U1253 iBrain, Université de Tours, Inserm