Graphics Group @ ISU

Casting Multiple Shadows- High-Dimensional Interactive Data Visualisation With Tours and Embeddings

Thu Oct 15, 2020 by Stuart Lee in R packages, decompositions dimension reduction, high-dimensional data, tourr

There has been a rapid uptake in the use of non-linear dimensionality reduction (NLDR) methods such as t-distributed stochastic neighbour embedding (t-SNE) in the natural sciences as part of cluster orientation and dimension reduction workflows. The appropriate use of these methods is made difficult by their complex parameterisations and the multitude of decisions required to balance the preservation of local and global structure in the resulting visualisation. We present visual diagnostics for the pragmatic usage of NLDR methods by combining them with a technique called the tour. Read more →

A slice tour for finding hollowness in high-dimensional data

Thu Mar 12, 2020 by Di Cook, Monash University in R packages, decompositions dimension reduction, high-dimensional data, interactive, tourr

Taking projections of high-dimensional data is a common analytical and visualisation technique in statistics for working with high-dimensional problems. Sectioning, or slicing, through high dimensions is less common, but can be useful for visualising data with concavities, or non-linear structure. It is associated with conditional distributions in statistics, and also linked brushing between plots in interactive data visualisation. This talk will describe the simple approach for slicing in the orthogonal space of projections obtained when running a tour, thus presenting the viewer with an interpolated sequence of sliced projections. Read more →

Modern Dimension Reduction and Visualization Techniques using UMAP

Thu Nov 14, 2019 by Eric Hare and Lawrence Mosley in machine learning high-dimensional data, dimension-reduction

One of our fundamental tasks as data scientists, especially given our focus on statistical graphics, is to take a potentially large and messy dataset, and extract meaningful relationships and patterns from it. One such approach to this is dimension reduction, the task of reducing the number of variables in a dataset to a much smaller number that still captures the structure of the original data well. A commonly used technique for dimension reduction is PCA, or Principal Component Analysis, where transformations of the variables are made in order to extract a set of uncorrelated principal components from the data. Read more →