Lesson 4: Fundamentals of Visual Analytics
Lesson Outline
- Visual Analytics for Knowledge Discovery
- Visual Analytics Approach for Statistical Testing
- Visual Analytics for Building Better Models
- Visualising Uncertainty
- Why Visualising Uncertainty?
- Basic Statistical Concepts Related to Uncertainty
- Univariate Graphical Methods for Visualising Uncertainty
- Error bars
- Confidence strips
- Ridge plot
- Bivariate Graphical Methods for Visualising Uncertainty
- Funnel plot
- Variation and Its Discontents
Lesson Slides and Hands-on Notes
Handout of Hands-on Exercise 4
Readings on Visualising Uncertainty
- Chapter 9: Visualizing many distributions at once and Chapter and Chapter 16: Visualizing uncertainty in Claus O. Wilke (2019) Fundamentals of Data Visualisation.
- Why It’s So Hard for Us to Visualize Uncertainty
- Visualizing the Uncertainty in Data
- Uncertainty + Visualization, Explained
- Uncertainty + Visualization, Explained (Part 2: Continuous Encodings)
- Michael Fernandes et. al. (2018) “Uncertainty Displays Using Quantile Dotplots or CDFs Improve Transit Decision-Making”, ACM Human Factors in Computing Systems (CHI) 2018.
Error Plots
Ridgeline Plot
- 9.2 Visualizing distributions along the horizontal axis of Wilke, C. O.(2019) Fundamentals of data visualization: a primer on making informative and compelling figures. O’Reilly Media. On-line version.
Raincloud Plot
Micah Allen et. al. (2021) “Raincloud plots: a multi-platform tool for robust data visualization” [version 2; peer review: 2 approved]
Funnel Plot
- Variation and Its Discontents: Funnel Plots for Fair Comparisons
- What are the chances of successful fertility treatment?
- Three-fold variation in UK bowel cancer death rates(?)
- Using funnel plots in public health surveillance
- Graph Makeover: Where same-sex couples live in the US
- Using maps and funnel plots to explore variation in place of death from cancer within London, 2002–2007
All About Tableau
Visualising Uncertainty
All about R
ggstatsplot: An extension of ggplot2 package for creating statistical graphics with details from statistical tests.
ggdist: An R package that provides a flexible set of ggplot2 geoms and stats designed especially for visualising distributions and uncertainty. Also watch the video entitle Visualizing distributions and uncertainty using ggdist.
ggridges: An ggplot2 extension specially designed for plotting ridgeline plots.
ungeviz: Provides functions for visualizing uncertainty with ggplot2. It is particularly focused on hypothetical outcome plots (HOPs) and provides bootstrapping and sampling functionality that integrates well with the ggplot2 API.
see: An R package from the easystats family of packages that produces visualizations for a wide variety of models and statistical analyses in a way that is tightly linked with the model fitting process and requires minimal interruption of users’ workflow.
performance: An R package from easystats family of packages that provides utilities for computing indices of model quality and goodness of fit including provides many functions to check model assumptions visually.
infer: An R package specially designed to perform statistical inference using an expressive statistical grammar that coheres with the tidyverse design framework. The library also includes functions for visualising the distribution of the simulation-based inferential statistics or the theoretical distribution (or both).