SlideShare ist ein Scribd-Unternehmen logo
1 von 7
Mutually Linked StudiesBalancing Threats to Internal and Ecological Validity in InfoVis Evaluation Niklas ElmqvistSchool of Electrical & Computer EngineeringPurdue University
The Problem of Validity InfoVis evaluation often comes in two flavors: Rigorous + unrealistic Scientific rigor Toy dataset => Lacking in ecological validity Realistic + ad hoc Real dataset Imperfect experimental design => Lacking in internal validity 2
Example: The ColorLens 3 [McDonnel & Elmqvist 2009]
Example: Evaluating the ColorLens Realistic photographs: Ecologically valid (+) Learning effects (-) Cannot control images (-) Does not “feel” canonical! (-) Perlin Noise: Full control over dataset (+) Eliminate learning (+) Canonical and abstract (+) Not realistic! (-) 4 ?
Mutually Linked Studies Why choose one type of study? Do both!  Idea: Mutually linked studies 1 x toy study – canonical and exhaustive 1 x realistic study – prove it works Complement each other Recycle much of the mechanics for both Experimental design Tasks Analysis Participants (!) 5 Reduces time and paper space requirements for conducting two studies (they are not separate)!
Questions? Acknowledgments: Color Lens co-authors(coming soon to a journal near you)Pierre DragicevicHM Jean-Daniel Fekete 6
Validity Primer Several different types of validity… I will discuss the following here: Internal validity Degree to which the outcome is a function of the controlled parameters of experiment Ecological validity Degree to which the results of the experiment can be applied to realistic situations 7

Weitere ähnliche Inhalte

Mehr von BELIV Workshop

Towards Information-Theoretic Visualization Evaluation Measure: A Practical e...
Towards Information-Theoretic Visualization Evaluation Measure: A Practical e...Towards Information-Theoretic Visualization Evaluation Measure: A Practical e...
Towards Information-Theoretic Visualization Evaluation Measure: A Practical e...BELIV Workshop
 
Is Your User Hunting or Gathering Insights? Identifying Insight Drivers Acros...
Is Your User Hunting or Gathering Insights? Identifying Insight Drivers Acros...Is Your User Hunting or Gathering Insights? Identifying Insight Drivers Acros...
Is Your User Hunting or Gathering Insights? Identifying Insight Drivers Acros...BELIV Workshop
 
A Descriptive Model of Visual Scanning.
A Descriptive Model of Visual Scanning.A Descriptive Model of Visual Scanning.
A Descriptive Model of Visual Scanning.BELIV Workshop
 
Generating a synthetic video dataset
Generating a synthetic video datasetGenerating a synthetic video dataset
Generating a synthetic video datasetBELIV Workshop
 
Beyond system logging: human logging for evaluating information visualization.
Beyond system logging: human logging for evaluating information visualization.Beyond system logging: human logging for evaluating information visualization.
Beyond system logging: human logging for evaluating information visualization.BELIV Workshop
 
Scanning Between Graph Visualizations: An Eye Tracking Evaluation.
Scanning Between Graph Visualizations: An Eye Tracking Evaluation.Scanning Between Graph Visualizations: An Eye Tracking Evaluation.
Scanning Between Graph Visualizations: An Eye Tracking Evaluation.BELIV Workshop
 
Focus Groups for Functional InfoVis Prototype Evaluation: A Case Study.
Focus Groups for Functional InfoVis Prototype Evaluation: A Case Study.Focus Groups for Functional InfoVis Prototype Evaluation: A Case Study.
Focus Groups for Functional InfoVis Prototype Evaluation: A Case Study.BELIV Workshop
 
Learning-Based Evaluation of Visual Analytic Systems.
Learning-Based Evaluation of Visual Analytic Systems.Learning-Based Evaluation of Visual Analytic Systems.
Learning-Based Evaluation of Visual Analytic Systems.BELIV Workshop
 
Visualization Evaluation of the Masses, by the Masses, and for the Masses.
Visualization Evaluation of the Masses, by the Masses, and for the Masses.Visualization Evaluation of the Masses, by the Masses, and for the Masses.
Visualization Evaluation of the Masses, by the Masses, and for the Masses.BELIV Workshop
 
Evaluating Information Visualization in Large Companies: Challenges, Experien...
Evaluating Information Visualization in Large Companies: Challenges, Experien...Evaluating Information Visualization in Large Companies: Challenges, Experien...
Evaluating Information Visualization in Large Companies: Challenges, Experien...BELIV Workshop
 
Pragmatic Challenges in the Evaluation of Interactive Visualization Systems.
Pragmatic Challenges in the Evaluation of Interactive Visualization Systems.Pragmatic Challenges in the Evaluation of Interactive Visualization Systems.
Pragmatic Challenges in the Evaluation of Interactive Visualization Systems.BELIV Workshop
 
Comparative Evaluation of Two Interface Tools in Performing Visual Analytics ...
Comparative Evaluation of Two Interface Tools in Performing Visual Analytics ...Comparative Evaluation of Two Interface Tools in Performing Visual Analytics ...
Comparative Evaluation of Two Interface Tools in Performing Visual Analytics ...BELIV Workshop
 
BELIV'10 Keynote: Conceptual and Practical Challenges in InfoViz Evaluations
BELIV'10 Keynote: Conceptual and Practical Challenges in InfoViz EvaluationsBELIV'10 Keynote: Conceptual and Practical Challenges in InfoViz Evaluations
BELIV'10 Keynote: Conceptual and Practical Challenges in InfoViz EvaluationsBELIV Workshop
 

Mehr von BELIV Workshop (13)

Towards Information-Theoretic Visualization Evaluation Measure: A Practical e...
Towards Information-Theoretic Visualization Evaluation Measure: A Practical e...Towards Information-Theoretic Visualization Evaluation Measure: A Practical e...
Towards Information-Theoretic Visualization Evaluation Measure: A Practical e...
 
Is Your User Hunting or Gathering Insights? Identifying Insight Drivers Acros...
Is Your User Hunting or Gathering Insights? Identifying Insight Drivers Acros...Is Your User Hunting or Gathering Insights? Identifying Insight Drivers Acros...
Is Your User Hunting or Gathering Insights? Identifying Insight Drivers Acros...
 
A Descriptive Model of Visual Scanning.
A Descriptive Model of Visual Scanning.A Descriptive Model of Visual Scanning.
A Descriptive Model of Visual Scanning.
 
Generating a synthetic video dataset
Generating a synthetic video datasetGenerating a synthetic video dataset
Generating a synthetic video dataset
 
Beyond system logging: human logging for evaluating information visualization.
Beyond system logging: human logging for evaluating information visualization.Beyond system logging: human logging for evaluating information visualization.
Beyond system logging: human logging for evaluating information visualization.
 
Scanning Between Graph Visualizations: An Eye Tracking Evaluation.
Scanning Between Graph Visualizations: An Eye Tracking Evaluation.Scanning Between Graph Visualizations: An Eye Tracking Evaluation.
Scanning Between Graph Visualizations: An Eye Tracking Evaluation.
 
Focus Groups for Functional InfoVis Prototype Evaluation: A Case Study.
Focus Groups for Functional InfoVis Prototype Evaluation: A Case Study.Focus Groups for Functional InfoVis Prototype Evaluation: A Case Study.
Focus Groups for Functional InfoVis Prototype Evaluation: A Case Study.
 
Learning-Based Evaluation of Visual Analytic Systems.
Learning-Based Evaluation of Visual Analytic Systems.Learning-Based Evaluation of Visual Analytic Systems.
Learning-Based Evaluation of Visual Analytic Systems.
 
Visualization Evaluation of the Masses, by the Masses, and for the Masses.
Visualization Evaluation of the Masses, by the Masses, and for the Masses.Visualization Evaluation of the Masses, by the Masses, and for the Masses.
Visualization Evaluation of the Masses, by the Masses, and for the Masses.
 
Evaluating Information Visualization in Large Companies: Challenges, Experien...
Evaluating Information Visualization in Large Companies: Challenges, Experien...Evaluating Information Visualization in Large Companies: Challenges, Experien...
Evaluating Information Visualization in Large Companies: Challenges, Experien...
 
Pragmatic Challenges in the Evaluation of Interactive Visualization Systems.
Pragmatic Challenges in the Evaluation of Interactive Visualization Systems.Pragmatic Challenges in the Evaluation of Interactive Visualization Systems.
Pragmatic Challenges in the Evaluation of Interactive Visualization Systems.
 
Comparative Evaluation of Two Interface Tools in Performing Visual Analytics ...
Comparative Evaluation of Two Interface Tools in Performing Visual Analytics ...Comparative Evaluation of Two Interface Tools in Performing Visual Analytics ...
Comparative Evaluation of Two Interface Tools in Performing Visual Analytics ...
 
BELIV'10 Keynote: Conceptual and Practical Challenges in InfoViz Evaluations
BELIV'10 Keynote: Conceptual and Practical Challenges in InfoViz EvaluationsBELIV'10 Keynote: Conceptual and Practical Challenges in InfoViz Evaluations
BELIV'10 Keynote: Conceptual and Practical Challenges in InfoViz Evaluations
 

Mutually Linked Studies - Balancing Threats to Internal and Ecological Validity in InfoVis Evaluation.

  • 1. Mutually Linked StudiesBalancing Threats to Internal and Ecological Validity in InfoVis Evaluation Niklas ElmqvistSchool of Electrical & Computer EngineeringPurdue University
  • 2. The Problem of Validity InfoVis evaluation often comes in two flavors: Rigorous + unrealistic Scientific rigor Toy dataset => Lacking in ecological validity Realistic + ad hoc Real dataset Imperfect experimental design => Lacking in internal validity 2
  • 3. Example: The ColorLens 3 [McDonnel & Elmqvist 2009]
  • 4. Example: Evaluating the ColorLens Realistic photographs: Ecologically valid (+) Learning effects (-) Cannot control images (-) Does not “feel” canonical! (-) Perlin Noise: Full control over dataset (+) Eliminate learning (+) Canonical and abstract (+) Not realistic! (-) 4 ?
  • 5. Mutually Linked Studies Why choose one type of study? Do both! Idea: Mutually linked studies 1 x toy study – canonical and exhaustive 1 x realistic study – prove it works Complement each other Recycle much of the mechanics for both Experimental design Tasks Analysis Participants (!) 5 Reduces time and paper space requirements for conducting two studies (they are not separate)!
  • 6. Questions? Acknowledgments: Color Lens co-authors(coming soon to a journal near you)Pierre DragicevicHM Jean-Daniel Fekete 6
  • 7. Validity Primer Several different types of validity… I will discuss the following here: Internal validity Degree to which the outcome is a function of the controlled parameters of experiment Ecological validity Degree to which the results of the experiment can be applied to realistic situations 7