SlideShare ist ein Scribd-Unternehmen logo
1 von 34
Downloaden Sie, um offline zu lesen
“The human understanding, on account of its
own nature, readily supposes a greater order
and uniformity in things than it finds. And ...
it devises parallels and correspondences and
relations which are not there.”
—Francis Bacon, 1620
Wednesday, 10 November 2010
“The human understanding, on account of its
own nature, readily supposes a greater order
and uniformity in things than it finds. And ...
it devises parallels and correspondences and
relations which are not there.”
—Francis Bacon, 1620
Is what we see reallythere?
Wednesday, 10 November 2010
October 2010
Hadley Wickham, Dianne Cook,
Heike Hofmann, Andreas Buja
Graphical inference
for infovis
Wednesday, 10 November 2010
Which one of these plots is not like the others?
Which of these plots just doesn’t belong?
Wednesday, 10 November 2010
7 of those plots were plots of random
(null) data. 1 plot was the real data.
If you correctly picked the true
plot from the null plots then we
have evidence that it really is
different.
In fact, we have rigorous statistical
evidence that there is a difference, just
using Sesame Street skills!
Wednesday, 10 November 2010
1. The statistical justice system
2. Line up protocol
3. Rorschach protocol
4. Future work
Wednesday, 10 November 2010
http://www.flickr.com/photos/joegratz/117048243
Hypothesis testing?
Wednesday, 10 November 2010
http://www.flickr.com/photos/joegratz/117048243
The statistical justice system
Hypothesis testing?
Wednesday, 10 November 2010
Ho: null hypothesis
Ha: alternative hypothesis
Defence
Prosecution
Wednesday, 10 November 2010
Ho: null hypothesis
Ha: alternative hypothesis
Defence
Prosecution
Null distribution Innocents
Wednesday, 10 November 2010
Ho: null hypothesis
Ha: alternative hypothesis
Defence
Prosecution
Reject the null
Fail to reject the null
Guilty
Not guilty
Null distribution Innocents
Wednesday, 10 November 2010
Ho: null hypothesis
Ha: alternative hypothesis
Defence
Prosecution
Reject the null
Fail to reject the null
Guilty
Not guilty
Null distribution Innocents
p-value Probability that a truly
innocent dataset
would look as guilty
as the suspect
Wednesday, 10 November 2010
Line up
Wednesday, 10 November 2010
believe believe
case
caseclosely
closely descendants
descendants few few
long long modified
modified variations
variations very
very view view
believe believe
case
caseclosely
closely descendants
descendants few few
long long modified
modified variations
variations very
very view view
believe believe
case
caseclosely
closely descendants
descendants few few
long long modified
modified variations
variations very
very view view
believe believe
case
caseclosely
closely descendants
descendants few few
long long modified
modified variations
variations very
very view view
believe believe
case
caseclosely
closely descendants
descendants few few
long long modified
modified variations
variations very
very view view
Five tag clouds of selected words from the 1st (red) and 6th (blue)
editions of Darwin’s “Origin of Species”. Four of the tag clouds were
generated under the null hypothesis of no difference between editions,
and one is the true data. Can you spot it?
Wednesday, 10 November 2010
believe believe
case
caseclosely
closely descendants
descendants few few
long long modified
modified variations
variations very
very view view
believe believe
case
caseclosely
closely descendants
descendants few few
long long modified
modified variations
variations very
very view view
believe believe
case
caseclosely
closely descendants
descendants few few
long long modified
modified variations
variations very
very view view
believe believe
case
caseclosely
closely descendants
descendants few few
long long modified
modified variations
variations very
very view view
believe believe
case
caseclosely
closely descendants
descendants few few
long long modified
modified variations
variations very
very view view
Five tag clouds of selected words from the 1st (red) and 6th (blue)
editions of Darwin’s “Origin of Species”. Four of the tag clouds were
generated under the null hypothesis of no difference between editions,
and one is the true data. Can you spot it?
Wednesday, 10 November 2010
Protocol
Generate n-1 decoys
(null datasets)
Plot the decoys + the real data
(randomly positioned)
Show to an impartial observer.
Can they spot the real data?
If so, you have evidence for true difference
(p-value = 1/n)
Wednesday, 10 November 2010
E. L. Scott, C. D. Shane, and M. D. Swanson. Comparison of the synthetic and actual distribution of galaxies on a
photographic plate. Astrophysical Journal, 119:91–112, Jan. 1954.
Wednesday, 10 November 2010
A. M. Noll. Human or machine: A subjective comparison of Piet Mondrian’s “composition with lines” (1917) and a computer-
generated picture. The Psychological Record, 16:1–10, 1966.
Wednesday, 10 November 2010
vs. classical tests
Of course, if we know what we’re looking
for, we can always develop an algorithm
or numerical test.
The advantage of visual inference is that
works for very general tasks, including
when you don’t know exactly what you’re
looking for.
Wednesday, 10 November 2010
ower of the test
!
Power
0.0
0.2
0.4
0.6
0.8
1.0
0.0
0.2
0.4
0.6
0.8
1.0
sigma = 12
!15 !10 !5 0 5 10 15
sigma = 5
!15 !10 !5 0 5 10 15
samplesize=100samplesize=300
power_curve
Theoretical test
Visual test
lower_CL
upper_CL
Recent work shows that power only
a little worse than classical test
Wednesday, 10 November 2010
Plot Task
Choropleth
map
Is there a spatial trend?
Treemap
Is the distribution in higher
level categories the same?
Scatterplot
Are the two variables
independent?
Time series
Is there a trend in mean or
variability?
Wednesday, 10 November 2010
Wednesday, 10 November 2010
Wednesday, 10 November 2010
Wednesday, 10 November 2010
Once we’ve seen the plot,
we’re no longer impartial
Wednesday, 10 November 2010
Code
# Support package written in R
# http://github.com/ggobi/nullabor
# Provides reference implementation of ideas
library(nullabor)
library(ggplot2)
qplot(angle * 180 / pi, r, data = threept) %+%
lineup(null_model(r ~ poly(angle, 2)), n = 10) +
facet_wrap(~ .sample, ncol = 5)
Wednesday, 10 November 2010
Rorschach
Wednesday, 10 November 2010
Rorschach
We’re surprisingly bad at appreciating the
amount of variation in random data.
Showing only null plots is a good way to
calibrate our intuition.
We also plan on using these plots as an
empirical tool to understand what features
people pick up on. Anecdotally,
undergrads focus too much on outliers
Wednesday, 10 November 2010
result
count
0
20
40
60
80
100
0
20
40
60
80
100
0
20
40
60
80
100
1
4
7
0.0 0.2 0.4 0.6 0.8 1.0
2
5
8
0.0 0.2 0.4 0.6 0.8 1.0
3
6
9
0.0 0.2 0.4 0.6 0.8 1.0
Wednesday, 10 November 2010
Future work
Wednesday, 10 November 2010
Future work
How can visual inference be integrated
into visualisation software at a
fundamental level?
How does training impact results? How do
novices vs. experts differ?
What patterns do people pick up on?
What are the alternatives that people
respond to?
Wednesday, 10 November 2010
Questions?
Wednesday, 10 November 2010
Wednesday, 10 November 2010
This work is licensed under the Creative
Commons Attribution-Noncommercial 3.0 United
States License. To view a copy of this license,
visit http://creativecommons.org/licenses/by-nc/
3.0/us/ or send a letter to Creative Commons,
171 Second Street, Suite 300, San Francisco,
California, 94105, USA.
Wednesday, 10 November 2010

Weitere ähnliche Inhalte

Andere mochten auch

Correlations, Trends, and Outliers in ggplot2
Correlations, Trends, and Outliers in ggplot2Correlations, Trends, and Outliers in ggplot2
Correlations, Trends, and Outliers in ggplot2Chris Rucker
 
Model Visualisation (with ggplot2)
Model Visualisation (with ggplot2)Model Visualisation (with ggplot2)
Model Visualisation (with ggplot2)Hadley Wickham
 
R workshop iii -- 3 hours to learn ggplot2 series
R workshop iii -- 3 hours to learn ggplot2 seriesR workshop iii -- 3 hours to learn ggplot2 series
R workshop iii -- 3 hours to learn ggplot2 seriesVivian S. Zhang
 
Machine learning in R
Machine learning in RMachine learning in R
Machine learning in Rapolol92
 
4 R Tutorial DPLYR Apply Function
4 R Tutorial DPLYR Apply Function4 R Tutorial DPLYR Apply Function
4 R Tutorial DPLYR Apply FunctionSakthi Dasans
 
Data manipulation with dplyr
Data manipulation with dplyrData manipulation with dplyr
Data manipulation with dplyrRomain Francois
 
Data Manipulation Using R (& dplyr)
Data Manipulation Using R (& dplyr)Data Manipulation Using R (& dplyr)
Data Manipulation Using R (& dplyr)Ram Narasimhan
 
Introducing natural language processing(NLP) with r
Introducing natural language processing(NLP) with rIntroducing natural language processing(NLP) with r
Introducing natural language processing(NLP) with rVivian S. Zhang
 
Grouping & Summarizing Data in R
Grouping & Summarizing Data in RGrouping & Summarizing Data in R
Grouping & Summarizing Data in RJeffrey Breen
 
Elegant Graphics for Data Analysis with ggplot2
Elegant Graphics for Data Analysis with ggplot2Elegant Graphics for Data Analysis with ggplot2
Elegant Graphics for Data Analysis with ggplot2yannabraham
 

Andere mochten auch (17)

27 development
27 development27 development
27 development
 
16 Sequences
16 Sequences16 Sequences
16 Sequences
 
24 modelling
24 modelling24 modelling
24 modelling
 
Correlations, Trends, and Outliers in ggplot2
Correlations, Trends, and Outliers in ggplot2Correlations, Trends, and Outliers in ggplot2
Correlations, Trends, and Outliers in ggplot2
 
21 spam
21 spam21 spam
21 spam
 
03 Conditional
03 Conditional03 Conditional
03 Conditional
 
Model Visualisation (with ggplot2)
Model Visualisation (with ggplot2)Model Visualisation (with ggplot2)
Model Visualisation (with ggplot2)
 
R workshop iii -- 3 hours to learn ggplot2 series
R workshop iii -- 3 hours to learn ggplot2 seriesR workshop iii -- 3 hours to learn ggplot2 series
R workshop iii -- 3 hours to learn ggplot2 series
 
R packages
R packagesR packages
R packages
 
Machine learning in R
Machine learning in RMachine learning in R
Machine learning in R
 
4 R Tutorial DPLYR Apply Function
4 R Tutorial DPLYR Apply Function4 R Tutorial DPLYR Apply Function
4 R Tutorial DPLYR Apply Function
 
Data manipulation with dplyr
Data manipulation with dplyrData manipulation with dplyr
Data manipulation with dplyr
 
Data Manipulation Using R (& dplyr)
Data Manipulation Using R (& dplyr)Data Manipulation Using R (& dplyr)
Data Manipulation Using R (& dplyr)
 
Introducing natural language processing(NLP) with r
Introducing natural language processing(NLP) with rIntroducing natural language processing(NLP) with r
Introducing natural language processing(NLP) with r
 
Grouping & Summarizing Data in R
Grouping & Summarizing Data in RGrouping & Summarizing Data in R
Grouping & Summarizing Data in R
 
Elegant Graphics for Data Analysis with ggplot2
Elegant Graphics for Data Analysis with ggplot2Elegant Graphics for Data Analysis with ggplot2
Elegant Graphics for Data Analysis with ggplot2
 
Rsplit apply combine
Rsplit apply combineRsplit apply combine
Rsplit apply combine
 

Ähnlich wie Visual Inference for Detecting Differences in Data Plots

Dynamics of Internet-mediated partnership formation
Dynamics of Internet-mediated partnership formationDynamics of Internet-mediated partnership formation
Dynamics of Internet-mediated partnership formationPetter Holme
 
BACK TO THE DRAWING BOARD - The Myth of Data-Driven NLU and How to go Forward...
BACK TO THE DRAWING BOARD - The Myth of Data-Driven NLU and How to go Forward...BACK TO THE DRAWING BOARD - The Myth of Data-Driven NLU and How to go Forward...
BACK TO THE DRAWING BOARD - The Myth of Data-Driven NLU and How to go Forward...Walid Saba
 
Dynamic lexicon brazil 2018
Dynamic lexicon brazil 2018Dynamic lexicon brazil 2018
Dynamic lexicon brazil 2018Peter Ludlow
 
Reimagining the Archive keynote presentation
Reimagining the Archive keynote presentationReimagining the Archive keynote presentation
Reimagining the Archive keynote presentationRick Prelinger
 
Augmented reality sandbox
Augmented reality sandboxAugmented reality sandbox
Augmented reality sandboxNico Rutten
 
Sense Perception
Sense PerceptionSense Perception
Sense Perceptionalgrant
 
AI, Sherlock Holmes style - Introduction to automated Abductive Inference
AI, Sherlock Holmes style - Introduction to automated Abductive InferenceAI, Sherlock Holmes style - Introduction to automated Abductive Inference
AI, Sherlock Holmes style - Introduction to automated Abductive InferenceAll Things Open
 
Anthropology 130 Research Simulation 3Forensic Anthropolog.docx
Anthropology 130 Research Simulation 3Forensic Anthropolog.docxAnthropology 130 Research Simulation 3Forensic Anthropolog.docx
Anthropology 130 Research Simulation 3Forensic Anthropolog.docxboyfieldhouse
 
2010 - Projeto Abelhas de Blackawton
2010 - Projeto Abelhas de Blackawton 2010 - Projeto Abelhas de Blackawton
2010 - Projeto Abelhas de Blackawton Ezequiel Redin
 
Our New Super Powers
Our New Super PowersOur New Super Powers
Our New Super Powersfrog
 
Finding Ostriches in the Courtroom
Finding Ostriches in the CourtroomFinding Ostriches in the Courtroom
Finding Ostriches in the CourtroomChristopher Collins
 
Index Of Wp-ContentUploads201001. Online assignment writing service.
Index Of Wp-ContentUploads201001. Online assignment writing service.Index Of Wp-ContentUploads201001. Online assignment writing service.
Index Of Wp-ContentUploads201001. Online assignment writing service.Nicole Wells
 
Descriptive Essay Describing A Person
Descriptive Essay Describing A PersonDescriptive Essay Describing A Person
Descriptive Essay Describing A PersonKimberly Willis
 
University of California, Berkeley: iSchool Nov, 2009
University of California, Berkeley: iSchool Nov, 2009University of California, Berkeley: iSchool Nov, 2009
University of California, Berkeley: iSchool Nov, 2009Tom Moritz
 

Ähnlich wie Visual Inference for Detecting Differences in Data Plots (15)

Dynamics of Internet-mediated partnership formation
Dynamics of Internet-mediated partnership formationDynamics of Internet-mediated partnership formation
Dynamics of Internet-mediated partnership formation
 
BACK TO THE DRAWING BOARD - The Myth of Data-Driven NLU and How to go Forward...
BACK TO THE DRAWING BOARD - The Myth of Data-Driven NLU and How to go Forward...BACK TO THE DRAWING BOARD - The Myth of Data-Driven NLU and How to go Forward...
BACK TO THE DRAWING BOARD - The Myth of Data-Driven NLU and How to go Forward...
 
Dynamic lexicon brazil 2018
Dynamic lexicon brazil 2018Dynamic lexicon brazil 2018
Dynamic lexicon brazil 2018
 
Reimagining the Archive keynote presentation
Reimagining the Archive keynote presentationReimagining the Archive keynote presentation
Reimagining the Archive keynote presentation
 
Augmented reality sandbox
Augmented reality sandboxAugmented reality sandbox
Augmented reality sandbox
 
20 Estimation
20 Estimation20 Estimation
20 Estimation
 
Sense Perception
Sense PerceptionSense Perception
Sense Perception
 
AI, Sherlock Holmes style - Introduction to automated Abductive Inference
AI, Sherlock Holmes style - Introduction to automated Abductive InferenceAI, Sherlock Holmes style - Introduction to automated Abductive Inference
AI, Sherlock Holmes style - Introduction to automated Abductive Inference
 
Anthropology 130 Research Simulation 3Forensic Anthropolog.docx
Anthropology 130 Research Simulation 3Forensic Anthropolog.docxAnthropology 130 Research Simulation 3Forensic Anthropolog.docx
Anthropology 130 Research Simulation 3Forensic Anthropolog.docx
 
2010 - Projeto Abelhas de Blackawton
2010 - Projeto Abelhas de Blackawton 2010 - Projeto Abelhas de Blackawton
2010 - Projeto Abelhas de Blackawton
 
Our New Super Powers
Our New Super PowersOur New Super Powers
Our New Super Powers
 
Finding Ostriches in the Courtroom
Finding Ostriches in the CourtroomFinding Ostriches in the Courtroom
Finding Ostriches in the Courtroom
 
Index Of Wp-ContentUploads201001. Online assignment writing service.
Index Of Wp-ContentUploads201001. Online assignment writing service.Index Of Wp-ContentUploads201001. Online assignment writing service.
Index Of Wp-ContentUploads201001. Online assignment writing service.
 
Descriptive Essay Describing A Person
Descriptive Essay Describing A PersonDescriptive Essay Describing A Person
Descriptive Essay Describing A Person
 
University of California, Berkeley: iSchool Nov, 2009
University of California, Berkeley: iSchool Nov, 2009University of California, Berkeley: iSchool Nov, 2009
University of California, Berkeley: iSchool Nov, 2009
 

Mehr von Hadley Wickham (20)

22 spam
22 spam22 spam
22 spam
 
19 tables
19 tables19 tables
19 tables
 
18 cleaning
18 cleaning18 cleaning
18 cleaning
 
17 polishing
17 polishing17 polishing
17 polishing
 
16 critique
16 critique16 critique
16 critique
 
15 time-space
15 time-space15 time-space
15 time-space
 
14 case-study
14 case-study14 case-study
14 case-study
 
13 case-study
13 case-study13 case-study
13 case-study
 
12 adv-manip
12 adv-manip12 adv-manip
12 adv-manip
 
11 adv-manip
11 adv-manip11 adv-manip
11 adv-manip
 
11 adv-manip
11 adv-manip11 adv-manip
11 adv-manip
 
10 simulation
10 simulation10 simulation
10 simulation
 
10 simulation
10 simulation10 simulation
10 simulation
 
09 bootstrapping
09 bootstrapping09 bootstrapping
09 bootstrapping
 
08 functions
08 functions08 functions
08 functions
 
07 problem-solving
07 problem-solving07 problem-solving
07 problem-solving
 
06 data
06 data06 data
06 data
 
05 subsetting
05 subsetting05 subsetting
05 subsetting
 
04 reports
04 reports04 reports
04 reports
 
03 extensions
03 extensions03 extensions
03 extensions
 

Kürzlich hochgeladen

Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsTechSoup
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeThiyagu K
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationnomboosow
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactPECB
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfAyushMahapatra5
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhikauryashika82
 
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...fonyou31
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfagholdier
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfJayanti Pande
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...christianmathematics
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdfQucHHunhnh
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfchloefrazer622
 
fourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingfourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingTeacherCyreneCayanan
 
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...Sapna Thakur
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 
9548086042 for call girls in Indira Nagar with room service
9548086042  for call girls in Indira Nagar  with room service9548086042  for call girls in Indira Nagar  with room service
9548086042 for call girls in Indira Nagar with room servicediscovermytutordmt
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxVishalSingh1417
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Krashi Coaching
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpinRaunakKeshri1
 

Kürzlich hochgeladen (20)

Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communication
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdf
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
 
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdf
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdf
 
fourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingfourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writing
 
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
9548086042 for call girls in Indira Nagar with room service
9548086042  for call girls in Indira Nagar  with room service9548086042  for call girls in Indira Nagar  with room service
9548086042 for call girls in Indira Nagar with room service
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpin
 

Visual Inference for Detecting Differences in Data Plots

  • 1. “The human understanding, on account of its own nature, readily supposes a greater order and uniformity in things than it finds. And ... it devises parallels and correspondences and relations which are not there.” —Francis Bacon, 1620 Wednesday, 10 November 2010
  • 2. “The human understanding, on account of its own nature, readily supposes a greater order and uniformity in things than it finds. And ... it devises parallels and correspondences and relations which are not there.” —Francis Bacon, 1620 Is what we see reallythere? Wednesday, 10 November 2010
  • 3. October 2010 Hadley Wickham, Dianne Cook, Heike Hofmann, Andreas Buja Graphical inference for infovis Wednesday, 10 November 2010
  • 4. Which one of these plots is not like the others? Which of these plots just doesn’t belong? Wednesday, 10 November 2010
  • 5. 7 of those plots were plots of random (null) data. 1 plot was the real data. If you correctly picked the true plot from the null plots then we have evidence that it really is different. In fact, we have rigorous statistical evidence that there is a difference, just using Sesame Street skills! Wednesday, 10 November 2010
  • 6. 1. The statistical justice system 2. Line up protocol 3. Rorschach protocol 4. Future work Wednesday, 10 November 2010
  • 8. http://www.flickr.com/photos/joegratz/117048243 The statistical justice system Hypothesis testing? Wednesday, 10 November 2010
  • 9. Ho: null hypothesis Ha: alternative hypothesis Defence Prosecution Wednesday, 10 November 2010
  • 10. Ho: null hypothesis Ha: alternative hypothesis Defence Prosecution Null distribution Innocents Wednesday, 10 November 2010
  • 11. Ho: null hypothesis Ha: alternative hypothesis Defence Prosecution Reject the null Fail to reject the null Guilty Not guilty Null distribution Innocents Wednesday, 10 November 2010
  • 12. Ho: null hypothesis Ha: alternative hypothesis Defence Prosecution Reject the null Fail to reject the null Guilty Not guilty Null distribution Innocents p-value Probability that a truly innocent dataset would look as guilty as the suspect Wednesday, 10 November 2010
  • 13. Line up Wednesday, 10 November 2010
  • 14. believe believe case caseclosely closely descendants descendants few few long long modified modified variations variations very very view view believe believe case caseclosely closely descendants descendants few few long long modified modified variations variations very very view view believe believe case caseclosely closely descendants descendants few few long long modified modified variations variations very very view view believe believe case caseclosely closely descendants descendants few few long long modified modified variations variations very very view view believe believe case caseclosely closely descendants descendants few few long long modified modified variations variations very very view view Five tag clouds of selected words from the 1st (red) and 6th (blue) editions of Darwin’s “Origin of Species”. Four of the tag clouds were generated under the null hypothesis of no difference between editions, and one is the true data. Can you spot it? Wednesday, 10 November 2010
  • 15. believe believe case caseclosely closely descendants descendants few few long long modified modified variations variations very very view view believe believe case caseclosely closely descendants descendants few few long long modified modified variations variations very very view view believe believe case caseclosely closely descendants descendants few few long long modified modified variations variations very very view view believe believe case caseclosely closely descendants descendants few few long long modified modified variations variations very very view view believe believe case caseclosely closely descendants descendants few few long long modified modified variations variations very very view view Five tag clouds of selected words from the 1st (red) and 6th (blue) editions of Darwin’s “Origin of Species”. Four of the tag clouds were generated under the null hypothesis of no difference between editions, and one is the true data. Can you spot it? Wednesday, 10 November 2010
  • 16. Protocol Generate n-1 decoys (null datasets) Plot the decoys + the real data (randomly positioned) Show to an impartial observer. Can they spot the real data? If so, you have evidence for true difference (p-value = 1/n) Wednesday, 10 November 2010
  • 17. E. L. Scott, C. D. Shane, and M. D. Swanson. Comparison of the synthetic and actual distribution of galaxies on a photographic plate. Astrophysical Journal, 119:91–112, Jan. 1954. Wednesday, 10 November 2010
  • 18. A. M. Noll. Human or machine: A subjective comparison of Piet Mondrian’s “composition with lines” (1917) and a computer- generated picture. The Psychological Record, 16:1–10, 1966. Wednesday, 10 November 2010
  • 19. vs. classical tests Of course, if we know what we’re looking for, we can always develop an algorithm or numerical test. The advantage of visual inference is that works for very general tasks, including when you don’t know exactly what you’re looking for. Wednesday, 10 November 2010
  • 20. ower of the test ! Power 0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0 sigma = 12 !15 !10 !5 0 5 10 15 sigma = 5 !15 !10 !5 0 5 10 15 samplesize=100samplesize=300 power_curve Theoretical test Visual test lower_CL upper_CL Recent work shows that power only a little worse than classical test Wednesday, 10 November 2010
  • 21. Plot Task Choropleth map Is there a spatial trend? Treemap Is the distribution in higher level categories the same? Scatterplot Are the two variables independent? Time series Is there a trend in mean or variability? Wednesday, 10 November 2010
  • 25. Once we’ve seen the plot, we’re no longer impartial Wednesday, 10 November 2010
  • 26. Code # Support package written in R # http://github.com/ggobi/nullabor # Provides reference implementation of ideas library(nullabor) library(ggplot2) qplot(angle * 180 / pi, r, data = threept) %+% lineup(null_model(r ~ poly(angle, 2)), n = 10) + facet_wrap(~ .sample, ncol = 5) Wednesday, 10 November 2010
  • 28. Rorschach We’re surprisingly bad at appreciating the amount of variation in random data. Showing only null plots is a good way to calibrate our intuition. We also plan on using these plots as an empirical tool to understand what features people pick up on. Anecdotally, undergrads focus too much on outliers Wednesday, 10 November 2010
  • 29. result count 0 20 40 60 80 100 0 20 40 60 80 100 0 20 40 60 80 100 1 4 7 0.0 0.2 0.4 0.6 0.8 1.0 2 5 8 0.0 0.2 0.4 0.6 0.8 1.0 3 6 9 0.0 0.2 0.4 0.6 0.8 1.0 Wednesday, 10 November 2010
  • 30. Future work Wednesday, 10 November 2010
  • 31. Future work How can visual inference be integrated into visualisation software at a fundamental level? How does training impact results? How do novices vs. experts differ? What patterns do people pick up on? What are the alternatives that people respond to? Wednesday, 10 November 2010
  • 34. This work is licensed under the Creative Commons Attribution-Noncommercial 3.0 United States License. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc/ 3.0/us/ or send a letter to Creative Commons, 171 Second Street, Suite 300, San Francisco, California, 94105, USA. Wednesday, 10 November 2010