v‐plots: Designing Hybrid Charts for the Comparative Analysis of Data Distributions

Comparing data distributions is a core focus in descriptive statistics, and part of most data analysis processes across disciplines. In particular, comparing distributions entails numerous tasks, ranging from identifying global distribution properties, comparing aggregated statistics (e.g., mean val...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Computer graphics forum Jg. 39; H. 3; S. 565 - 577
Hauptverfasser: Blumenschein, Michael, Debbeler, Luka J., Lages, Nadine C., Renner, Britta, Keim, Daniel A., El‐Assady, Mennatallah
Format: Journal Article
Sprache:Englisch
Veröffentlicht: Oxford Blackwell Publishing Ltd 01.06.2020
Schlagworte:
ISSN:0167-7055, 1467-8659
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Comparing data distributions is a core focus in descriptive statistics, and part of most data analysis processes across disciplines. In particular, comparing distributions entails numerous tasks, ranging from identifying global distribution properties, comparing aggregated statistics (e.g., mean values), to the local inspection of single cases. While various specialized visualizations have been proposed (e.g., box plots, histograms, or violin plots), they are not usually designed to support more than a few tasks, unless they are combined. In this paper, we present the v‐plot designer; a technique for authoring custom hybrid charts, combining mirrored bar charts, difference encodings, and violin‐style plots. v‐plots are customizable and enable the simultaneous comparison of data distributions on global, local, and aggregation levels. Our system design is grounded in an expert survey that compares and evaluates 20 common visualization techniques to derive guidelines for the task‐driven selection of appropriate visualizations. This knowledge externalization step allowed us to develop a guiding wizard that can tailor v‐plots to individual tasks and particular distribution properties. Finally, we confirm the usefulness of our system design and the user‐guiding process by measuring the fitness for purpose and applicability in a second study with four domain and statistic experts.
Bibliographie:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ISSN:0167-7055
1467-8659
DOI:10.1111/cgf.14002