Clustering is one of the principal tools used by data analysts for uncovering the structure present within a data set. Hierarchical clustering is particularly popular since it can reveal multiple scales of groupings at once without forcing the data analyst to commit to a certain number of clusterings. However, hierarchical clustering’s usefulness as a visualization tool is severely degraded by increasing data set sizes. We present an interactive tool that overcomes this difficulty, making hierarchical clustering useful for exploring data sets at scales of interest.
Read more →