Skip to main navigation Skip to search Skip to main content

Cluster analysis of categorical variables of parkinson’s disease patients

Research output: Contribution to journalArticlepeer-review

5 Scopus citations

Abstract

Parkinson’s disease (PD) is a chronic disease. No treatment stops its progression, and it presents symptoms in multiple areas. One way to understand the PD population is to investigate the clustering of patients by demographic and clinical similarities. Previous PD cluster studies included scores from clinical surveys, which provide a numerical but ordinal, non-linear value. In addition, these studies did not include categorical variables, as the clustering method utilized was not applicable to categorical variables. It was discovered that the numerical values of patient age and disease duration were similar among past cluster results, pointing to the need to exclude these values. This paper proposes a novel and automatic discovery method to cluster PD patients by incorporating categorical variables. No estimate of the number of clusters is required as input, whereas the previous cluster methods require a guess from the end user in order for the method to be initiated. Using a patient dataset from the Parkinson’s Progression Markers Initiative (PPMI) website to demonstrate the new clustering technique, our results showed that this method provided an accurate separation of the patients. In addition, this method provides an explainable process and an easy way to interpret clusters and describe patient subtypes.

Original languageEnglish
Article number1290
JournalBrain Sciences
Volume11
Issue number10
DOIs
StatePublished - Oct 2021

Keywords

  • Cluster analysis
  • Parkinson’s disease
  • Patient subtypes

Fingerprint

Dive into the research topics of 'Cluster analysis of categorical variables of parkinson’s disease patients'. Together they form a unique fingerprint.

Cite this