SlideShare ist ein Scribd-Unternehmen logo
1 von 16
Downloaden Sie, um offline zu lesen
Evaluation
& User Study
Byungkyu Kang
FourEyes Lab,
Dept. of Computer Science
UC Santa Barbara
User Study
•

What is a study?
-

•

Empirically testing a hypothesis

Why run a study?
-

•

Determine ‘truth’
Evaluate if a statement is true

User Study on Different Platforms
-

Online / Offline
Purpose of User Study

•

Evaluate New Interface

•

Find the Ground Truth

•

Verify a Hypothesis

•

Discover errors and areas of improvement
What to Measure?

•

Usability Testing (User Study in HCI)
-

Efficiency : time and steps in a given task

-

Accuracy : mistakes (fatal or recoverable?)

-

Recall : How much does the person remember?

-

Emotional response : feeling about the task
(confident, stressed? recommendable?)
Crowdsourcing User Study

•

Online User Study using Crowdsource Platform
-

General Usability Test!

-

Ground Truth Annotation!
‣
‣

Micro-tasks on the Internet

‣

•

Amazon Mechanical Turk, CrowdFlower

Large sample, fast and low cost

Kittur et al., Crowdsourcing user studies with Mechanical
Turk. (CHI '08)
Crowdsourcing User Study
A Type
Interface A

Interface B

B Type
Screening Task

Task A
Questionnaire
Task B
Examples of User Study
•

User Interface Design
-

System, Application, Web Search

•

User Experience Evaluation

•

Virtual or Augmented Reality

•

Ground Truth Annotation

•

Visualization
-

Scientific Visualization

-

Information Visualization
What does User Study do?

When
Why

System

User Study

Qualitative!
Performance

Measure!
Quantitative!
Performance

How

Evaluate!
Overall!
Performance
User Study in InfoVis

“User studies offer a scientifically sound
method to measure a visualization’s
performance”
“to evaluate the strengths and
weaknesses of different visualization
techniques”
Kosara, Robert, et al. "Thoughts on user studies: Why, how, and when." 

IEEE Computer Graphics and Applications 23.4 (2003): 20-25.
IEEE VIS 2013: Panel
Evaluation: How Much Evaluation is Enough?
Panelists:
Min Chen, David Ebert, Brian Fisher, Tamara Munzner

Methodologies

Visual Analytics
&
Cognitive Science

Every paper needs an empirical evaluation?
D-Cog Study
Brian Fischer
•

To define an analytics that underpins
analysis

•

Pair analytics : Student "drives",
expert "navigates"
-

•

Student visual analyst & trained domain
expert collaborate on analytic task

Research Snapshot
-

How much of this can we do at VIS?

-

How can we facilitate others to do the
rest?

-

How can we interact with them?

-

What organization coordinates the
whole process?
Evaluation, When and How
Tamara Munzner

•

how to pick the right evaluation method.
-

A Nested Model for Visualization Design and
Validation. Munzner. TVCG 15(6):921-928, 2009
[InfoVis 09]

-

Remained Question: do you need a study if you’re
proposing a new idea?
Evaluation: broadly interpreted
[A Nested Model for Visualization Design and Validation. Munzner. TVCG 15(6):
921-928, 2009 (Proc. InfoVis 09).]

problem domain: !
observe target users using existing tools!
data/task abstraction:!
encoding/interaction technique: 

justify design wrt alternatives!
algorithm: 

measure system time

analyze computational complexity!
analyze results qualitatively

measure human time with lab experiment (“user study”)"
observe target users post-deployment (“field study”)"
measure adoption
http://www.cs.ubc.ca/labs/imager/tr/2009/NestedModel/
Evaluation: broadly interpreted

Threats and validation in the nested model.
Others
•

David Ebert

It all depends to context (How much eval?)
-

Answer important questions
‣

Better than previous contributions?

‣

Is the system effective and useful?

•

wrong scientific approach

•

statistically significant performance with toy study
do not work!

•

Publishing without user studies are fine and
sometimes better!
Discussions
•

Hypothesis “Blinded” vs “Opened”

•

Bias-free Design?

•

Avoid W.E.I.R.D.(Western, Educated, Industrialized,
Rich and Democratic) Society!

•

How Many Subjects Required?

•

Is Questionnaire Clear or Ambiguous?

•

Hawthorne effect (Observer Effect)?

Weitere ähnliche Inhalte

Was ist angesagt?

Admin Panel
Admin Panel Admin Panel
Online Job Portal ppt presentation
Online Job Portal ppt presentationOnline Job Portal ppt presentation
Online Job Portal ppt presentation
Prateek Kulshrestha
 

Was ist angesagt? (20)

What is Web-scraping?
What is Web-scraping?What is Web-scraping?
What is Web-scraping?
 
Do you trust your artificial intelligence system?
Do you trust your artificial intelligence system?Do you trust your artificial intelligence system?
Do you trust your artificial intelligence system?
 
Presentation Slides of College Management System Report
Presentation Slides of College Management System ReportPresentation Slides of College Management System Report
Presentation Slides of College Management System Report
 
Build data quality rules and data cleansing into your data pipelines
Build data quality rules and data cleansing into your data pipelinesBuild data quality rules and data cleansing into your data pipelines
Build data quality rules and data cleansing into your data pipelines
 
Admin Panel
Admin Panel Admin Panel
Admin Panel
 
Explainable AI is not yet Understandable AI
Explainable AI is not yet Understandable AIExplainable AI is not yet Understandable AI
Explainable AI is not yet Understandable AI
 
Library management system
Library management systemLibrary management system
Library management system
 
Alumni management system
Alumni management systemAlumni management system
Alumni management system
 
Information architecture
Information architectureInformation architecture
Information architecture
 
Tangible User Interface Showcase
Tangible User Interface ShowcaseTangible User Interface Showcase
Tangible User Interface Showcase
 
Usability testing
Usability testing  Usability testing
Usability testing
 
Teacher networks, teacher digital competence and professional development
Teacher networks, teacher digital competence and professional developmentTeacher networks, teacher digital competence and professional development
Teacher networks, teacher digital competence and professional development
 
Online school website
Online school websiteOnline school website
Online school website
 
Digital Archaeology
Digital ArchaeologyDigital Archaeology
Digital Archaeology
 
Azure Data Fundamentals DP 900 Full Course
Azure Data Fundamentals DP 900 Full CourseAzure Data Fundamentals DP 900 Full Course
Azure Data Fundamentals DP 900 Full Course
 
Huber, S. & Maykoo, Z. Getting Visual: Assessing the need for visual literacy...
Huber, S. & Maykoo, Z. Getting Visual: Assessing the need for visual literacy...Huber, S. & Maykoo, Z. Getting Visual: Assessing the need for visual literacy...
Huber, S. & Maykoo, Z. Getting Visual: Assessing the need for visual literacy...
 
Online Job Portal ppt presentation
Online Job Portal ppt presentationOnline Job Portal ppt presentation
Online Job Portal ppt presentation
 
Presentation on e learning management system
Presentation on e learning management systemPresentation on e learning management system
Presentation on e learning management system
 
Usability test
Usability testUsability test
Usability test
 
Library management system synopsis
Library management system synopsisLibrary management system synopsis
Library management system synopsis
 

Ähnlich wie Evaluation and User Study in HCI

ECE695DVisualAnalyticsprojectproposal (2)
ECE695DVisualAnalyticsprojectproposal (2)ECE695DVisualAnalyticsprojectproposal (2)
ECE695DVisualAnalyticsprojectproposal (2)
Shweta Gupte
 

Ähnlich wie Evaluation and User Study in HCI (20)

UX Design Process | Sample Proposal
UX Design Process | Sample Proposal UX Design Process | Sample Proposal
UX Design Process | Sample Proposal
 
ECE695DVisualAnalyticsprojectproposal (2)
ECE695DVisualAnalyticsprojectproposal (2)ECE695DVisualAnalyticsprojectproposal (2)
ECE695DVisualAnalyticsprojectproposal (2)
 
Aect 2018 workshop
Aect 2018 workshopAect 2018 workshop
Aect 2018 workshop
 
Aect2018 workshop-v6ij-compressed
Aect2018 workshop-v6ij-compressedAect2018 workshop-v6ij-compressed
Aect2018 workshop-v6ij-compressed
 
Levi McCusker UXD
Levi McCusker UXDLevi McCusker UXD
Levi McCusker UXD
 
Advanced Methods for User Evaluation in Enterprise AR
Advanced Methods for User Evaluation in Enterprise ARAdvanced Methods for User Evaluation in Enterprise AR
Advanced Methods for User Evaluation in Enterprise AR
 
Information Experience Lab, IE Lab at SISLT
Information Experience Lab, IE Lab at SISLTInformation Experience Lab, IE Lab at SISLT
Information Experience Lab, IE Lab at SISLT
 
Successfully Managing Customer Experience Combining VoC and UX Testing
Successfully Managing Customer Experience Combining VoC and UX TestingSuccessfully Managing Customer Experience Combining VoC and UX Testing
Successfully Managing Customer Experience Combining VoC and UX Testing
 
Conducting User Research
Conducting User ResearchConducting User Research
Conducting User Research
 
Rosenhan "User Research"
Rosenhan "User Research"Rosenhan "User Research"
Rosenhan "User Research"
 
UX (User Experience) Process, May 2017
UX (User Experience) Process, May 2017UX (User Experience) Process, May 2017
UX (User Experience) Process, May 2017
 
Generating Mobile Application Onboarding Insights Through Minimalist Instruction
Generating Mobile Application Onboarding Insights Through Minimalist InstructionGenerating Mobile Application Onboarding Insights Through Minimalist Instruction
Generating Mobile Application Onboarding Insights Through Minimalist Instruction
 
UXprobe workshop at Dare Festival 2016
UXprobe workshop at Dare Festival 2016UXprobe workshop at Dare Festival 2016
UXprobe workshop at Dare Festival 2016
 
Cognitive Science Perspective on User eXperience!
Cognitive Science Perspective on User eXperience!Cognitive Science Perspective on User eXperience!
Cognitive Science Perspective on User eXperience!
 
Understanding The Value Of User Research, Usability Testing, and Information ...
Understanding The Value Of User Research, Usability Testing, and Information ...Understanding The Value Of User Research, Usability Testing, and Information ...
Understanding The Value Of User Research, Usability Testing, and Information ...
 
Life as a UX consultant
Life as a UX consultant Life as a UX consultant
Life as a UX consultant
 
Kedar Chavan - UX Process.pdf
Kedar Chavan - UX Process.pdfKedar Chavan - UX Process.pdf
Kedar Chavan - UX Process.pdf
 
Validating hypotheses with user research
Validating hypotheses with user researchValidating hypotheses with user research
Validating hypotheses with user research
 
Brightfind world usability day 2016 full deck final
Brightfind world usability day 2016   full deck finalBrightfind world usability day 2016   full deck final
Brightfind world usability day 2016 full deck final
 
Jan Moons at WUD16
Jan Moons at WUD16Jan Moons at WUD16
Jan Moons at WUD16
 

Kürzlich hochgeladen

Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 

Kürzlich hochgeladen (20)

Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
A Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusA Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source Milvus
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 

Evaluation and User Study in HCI

  • 1. Evaluation & User Study Byungkyu Kang FourEyes Lab, Dept. of Computer Science UC Santa Barbara
  • 2. User Study • What is a study? - • Empirically testing a hypothesis Why run a study? - • Determine ‘truth’ Evaluate if a statement is true User Study on Different Platforms - Online / Offline
  • 3. Purpose of User Study • Evaluate New Interface • Find the Ground Truth • Verify a Hypothesis • Discover errors and areas of improvement
  • 4. What to Measure? • Usability Testing (User Study in HCI) - Efficiency : time and steps in a given task - Accuracy : mistakes (fatal or recoverable?) - Recall : How much does the person remember? - Emotional response : feeling about the task (confident, stressed? recommendable?)
  • 5. Crowdsourcing User Study • Online User Study using Crowdsource Platform - General Usability Test! - Ground Truth Annotation! ‣ ‣ Micro-tasks on the Internet ‣ • Amazon Mechanical Turk, CrowdFlower Large sample, fast and low cost Kittur et al., Crowdsourcing user studies with Mechanical Turk. (CHI '08)
  • 6. Crowdsourcing User Study A Type Interface A Interface B B Type Screening Task Task A Questionnaire Task B
  • 7. Examples of User Study • User Interface Design - System, Application, Web Search • User Experience Evaluation • Virtual or Augmented Reality • Ground Truth Annotation • Visualization - Scientific Visualization - Information Visualization
  • 8. What does User Study do? When Why System User Study Qualitative! Performance Measure! Quantitative! Performance How Evaluate! Overall! Performance
  • 9. User Study in InfoVis “User studies offer a scientifically sound method to measure a visualization’s performance” “to evaluate the strengths and weaknesses of different visualization techniques” Kosara, Robert, et al. "Thoughts on user studies: Why, how, and when." 
 IEEE Computer Graphics and Applications 23.4 (2003): 20-25.
  • 10. IEEE VIS 2013: Panel Evaluation: How Much Evaluation is Enough? Panelists: Min Chen, David Ebert, Brian Fisher, Tamara Munzner Methodologies Visual Analytics & Cognitive Science Every paper needs an empirical evaluation?
  • 11. D-Cog Study Brian Fischer • To define an analytics that underpins analysis • Pair analytics : Student "drives", expert "navigates" - • Student visual analyst & trained domain expert collaborate on analytic task Research Snapshot - How much of this can we do at VIS? - How can we facilitate others to do the rest? - How can we interact with them? - What organization coordinates the whole process?
  • 12. Evaluation, When and How Tamara Munzner • how to pick the right evaluation method. - A Nested Model for Visualization Design and Validation. Munzner. TVCG 15(6):921-928, 2009 [InfoVis 09] - Remained Question: do you need a study if you’re proposing a new idea?
  • 13. Evaluation: broadly interpreted [A Nested Model for Visualization Design and Validation. Munzner. TVCG 15(6): 921-928, 2009 (Proc. InfoVis 09).] problem domain: ! observe target users using existing tools! data/task abstraction:! encoding/interaction technique: 
 justify design wrt alternatives! algorithm: 
 measure system time
 analyze computational complexity! analyze results qualitatively
 measure human time with lab experiment (“user study”)" observe target users post-deployment (“field study”)" measure adoption http://www.cs.ubc.ca/labs/imager/tr/2009/NestedModel/
  • 14. Evaluation: broadly interpreted Threats and validation in the nested model.
  • 15. Others • David Ebert It all depends to context (How much eval?) - Answer important questions ‣ Better than previous contributions? ‣ Is the system effective and useful? • wrong scientific approach • statistically significant performance with toy study do not work! • Publishing without user studies are fine and sometimes better!
  • 16. Discussions • Hypothesis “Blinded” vs “Opened” • Bias-free Design? • Avoid W.E.I.R.D.(Western, Educated, Industrialized, Rich and Democratic) Society! • How Many Subjects Required? • Is Questionnaire Clear or Ambiguous? • Hawthorne effect (Observer Effect)?