In science, data visualization serves two primary purposes. The first is to explore data sets interactively and the second is to communicate discoveries. However, the requirements for visualizations employed in these activities are very different. Therefore, the software tools used for these purposes are typically disconnected, creating significant challenges for reproducibility and effective communication of discoveries in data-driven biomedical science. In this presentation, I will address how a new approach to creating data visualization tools can connect data analysts and other stakeholders inside and outside the scientific community. I will introduce and demonstrate the "Vistories" approach that was motivated by these question.
Presented at the 5th Cancer Research UK Big Data Analytics Conference on Data Visualization.
Creating and Analyzing Definitive Screening Designs
Data Visualization in Biomedical Sciences: More than Meets the Eye
1. Data Visualization in Biomedical Sciences:
More than Meets the Eye
Nils Gehlenborg, PhD
Department of Biomedical Informatics
Harvard Medical School
http://gehlenborglab.org @ngehlenborghttp://gehlenborglab.org
4. Data Visualization in Biomedical Sciences:
More than Meets the Eye
Nils Gehlenborg, PhD
Department of Biomedical Informatics
Harvard Medical School
http://gehlenborglab.org @ngehlenborghttp://gehlenborglab.org
22. Nature asked 1,576 researchers if there
is a reproducibility crisis in science.
M Baker, Nature 533, 452-454, 2016
23. 0% 100%
No crisis (3%)
Don’t know (7%)
Slight crisis (38%)
M Baker, Nature 533, 452-454, 2016
Significant crisis (52%)
Nature asked 1,576 researchers if there
is a reproducibility crisis in science.
28. Intentional?
Inability to capture everything?
Inability to communicate everything?
SOCIAL ISSUE
TECHNICAL ISSUES
M Baker, Nature 533, 452-454, 2016
30. Tumor Subtypes
PROBLEM 1
Visualize overlap of patient sets across two or more stratifications.
PROBLEM 2
Visualize characteristics of patient sets within a stratification of interest.
33. Tumor Subtypes
PROBLEM 1
Visualize overlap of patient sets across two or more stratifications.
PROBLEM 2
Visualize characteristics of patient sets within a stratification of interest.
PROBLEM 3
Identify relevant stratifications, pathways, and clinical variables.
34. Is there a mutation that overlaps with this mRNA cluster?
Is there a CNV that affects survival?
Is there a pathway that is enriched in this cluster?
Is there a mutually exclusive mutation?
Query
Stratifications
Clinical Params
Pathways
Guided
Exploration
M Streit, A Lex, S Gratzl, C Partl, D Schmalstieg, H Pfister, P Park, N Gehlenborg , Nature Methods (2014)