Skip to main content

Autoři/Authors

Martin Fleischmann

Kategorie/Category

Odborný článek

Rok publikace/Published

2023

Název publikace/Name

Clustergram: Visualization and diagnostics for cluster analysis

Citace/Citation

FLEISCHMANN, M., (2023): Clustergram: Visualization and diagnostics for cluster analysis. Journal of Open Source Software, 8, 89, 5240.

Abstrakt/Abstract

Given a heterogeneous group of observations, researchers often try to find more homogenous groups within them. Typical is the use of clustering algorithms determining these groups based on statistical similarity. While there is an extensive range of algorithms to be chosen from, they often share one specific limitation - the algorithm itself will not determine the optimal number of clusters a group of observations shall be divided into. The solution is usually depending on internal cluster validity measures, but those provide only limited insight and can result in a suboptimal choice (Gagolewski et al., 2021). This paper presents a Python package named clustergram offering tools to analyze the clustering solutions and visualize the behavior of observations in relation to a tested range of options for the number of classes, enabling a deeper understanding of the behavior of observations splitting into classes and better-informed decisions on the optimal number of classes.

URRlab


Urbánní a regionální laboratoř

Katedra sociální geografie
a regionálního rozvoje

Univerzita Karlova
Přírodovědecká fakulta

Kontakt


Albertov 2038/6
128 43 Praha 2 - Nové Město

Kontaktní osoba
Jiří Nemeškal
jiri.nemeskal@natur.cuni.cz
211 951 972


©  Urbánní a regionální laboratoř | administrace