SlideShare ist ein Scribd-Unternehmen logo
1 von 26
Analyzing Responses to Likert Items An Exploration of Data from a Credibility Study Involving WikiDashboard (http://wikidashboard.parc.com) by Sanjay Kairam
WikiDashboard Study The System The Study The Data
WikiDashboard “Social Dynamic Analysis Tool” for Wikipedia Michael Scott (The Office): “Wikipedia is the best thing ever. Anyone in the world can write anything they want about any subject, so you know you are getting the best possible information” What happens when we see who is doing the editing?
WikiDashboard (Close-Up)
WikiDashboard Study Study conducted on Amazon Mechanical Turk N = 288 subjects Subjects paid $0.08 / HIT “Please read and evaluate this Wikipedia Article.”
Experiment Conditions Participants each placed in 1 of 3 conditions (each N = 96): Wiki Only (WO) Wiki + History (WH) WikiDashboard (WD)
Articles Used Each subject read 1 (of 8 possible) Wikipedia articles. Article “Quality”: “Low-Quality” articles were those flagged as “B-Class” or “C-Class” by the Wikipedia community. “High-Quality” articles were those which had at one time been “Featured Articles”. Article “Controversiality”: “Controversial” articles were those on the extensive “List of Controversial Articles”.
Survey Self-Reported Expertise “How familiar are you with the topic discussed on this Wikipedia page?” Manipulation/Quality Checks “In 5-20 words, please describe what this Wikipedia page is about.” “Please describe one fact from the article that you found interesting.” (WO) “Please name at least one user (by username or IP address) who has made multiple edits to this page. (WH, WD)
Credibility Assessment Assessing agreement with these statements: “I believe that the information on this page is accurate.” (Accuracy) “I believe that the information on this page is objective.” (Objectivity) “I believe that the information on this page is current and up-to-date.” (Currency) “I believe that this page fully covers the relevant information on the topic.” (Coverage) “I trust the information on this page.” (Trust)
Likert Item Responses Participants answered using a 5-point scale: -2:	“Strongly Disagree” -1:	“Somewhat Disagree” 0:	“Neither Agree nor Disagree” +1:	“Somewhat Agree” +2:	“Strongly Agree” Now, what do we do with this data?
Analyzing Likert Item Responses Very often, we see papers reporting Likert responses using means: What is the average of 1 “Somewhat Agree” and 3 “Somewhat Disagree”s? Hint: It’s not “Somewhat Disagree and a Half” In this case, what does a “mean” mean? In most cases, an ANOVA would definitely not work as well, though people still try!
Options for Analysis Non-Parametric Tests for Ordinal Data Conversion to an Interval Scale Aggregating Items
Mann-Whitney U Test Also called “Mann-Whitney-Wolcoxon”, “Wilcoxon Rank-Sum”, or “Wilcoxon-Mann-Whitney” test. Non-parametric test for assessing whether two independent samples of observations have equally large values. http://en.wikipedia.org/wiki/Mann-Whitney_U
Mann-Whitney U Test Assumptions: All observations from both groups are independent of each other. The responses are ordinal or continuous measurements. Null hypothesis includes symmetry between two populations considered Under alternative hypothesis, probability of an observation from pop. X exceeding an observation from pop. Y is not equal to 0.5 http://en.wikipedia.org/wiki/Mann-Whitney_U
Kruskal-Wallis ANOVA What if we want to test more than 2 groups? (as we do, given our 3 experimental conditions)  Kruskal-Wallis ANOVA is an extension of Mann-Whitney U to 3 or more groups. Also non-parametric, though it does assume that both distributions have a similar underlying shape. http://en.wikipedia.org/wiki/Kruskal-Wallis_one-way_analysis_of_variance
Analysis Using Non-Parametric Tests Do participants actually notice differences in article quality? Mann-Whitney: Significant effects of article quality for ratings of Accuracy (p< 0.001), Coverage (p< 0.01), Currency (p< 0.001), and Trust (p< 0.001), with marginally significant effect on Objectivity (p< 0.096). Kruskal-Wallis: Significant effect on ratings of Accuracy (p< 0.001), Coverage (p< 0.012), Currency (p< 0.001), and Trust (p< 0.001), with no significant effect on Objectivity.
Sample Boxplots: Ratings by Article Quality Accuracy Coverage
Analysis Using Non-Parametric Tests Do participants notice differences in how “controversial” an article is? Mann-Whitney: Significant effect on ratings of Coverage (p < 0.039), Currency (p < 0.039), Objectivity (p < 0.021), and Trust (p < 0.021), with no effect on ratings of Accuracy. Kruskal-Wallis: Significant effect on ratings of Objectivity (p < 0.042), and marginally significant effect for Coverage (p < 0.077) and Currency (p < 0.083), but no significant effect on Accuracy or Trust.
Analysis Using Non-Parametric Tests What we really want to know, however, is whether using WikiDashboard or Wiki + History makes participants more sensitive to article quality or controversiality than participants using Wikipedia on its own. Both tests only allow us to compare populations separated on the basis of a single variable, however, so we can’t explore these interaction effects.
Conversion to Interval Scale If there were a way to map our Likert item responses on to an interval scale, we could use more familiar/powerful statistical tests. If we found that the mapped data was normal, for instance, we could use our usual parametric tests such as MANOVA, which would help us find these interaction effects.
Conversion to Interval Scale E.J. Snell (1964) describes a procedure for mapping ordered data, like Likert responses, to an assumed underlying continuous scale of measurement. At the end, he emphasizes that “the usefulness of the proposed method depends upon the assumption that the underlying scale of measurement can be transformed to produce a normal distribution.” Snell, E.J. A Scaling Procedure for Ordered Categorical Data, Biometrics 20(3), pp. 592-607 (1964). http://www.jstor.org/stable/2528498
Utilizing the Snell Conversion The conversion procedure was used to transform the data – essentially mapped each response (ranging from -2 to +2) to a new point which ranged from roughly -1.00 to +4.05 Essentially, it looks as if only the distances between the values has changed.
Histogram: Original Data
Histogram: Snell-Converted Data
Aggregating Likert Items If we consider the various Likert items to be different measurements of a certain underlying trait (Credibility), then can we sum them and run parametric statistical tests? Haven’t tried this yet – is this a valid approach?
Analyzing Responses to Likert Items by Sanjay Kairam Email: sanjay.kairam@gmail.com Twitter: @skairam

Weitere ähnliche Inhalte

Was ist angesagt?

Data Analysis in Research: Descriptive Statistics & Normality
Data Analysis in Research: Descriptive Statistics & NormalityData Analysis in Research: Descriptive Statistics & Normality
Data Analysis in Research: Descriptive Statistics & NormalityIkbal Ahmed
 
Intro to quant_s_tudents
Intro to quant_s_tudentsIntro to quant_s_tudents
Intro to quant_s_tudentsMPA502a
 
Slayter on planning quant design for flc projects - may 2011
Slayter   on planning quant design for flc projects - may 2011Slayter   on planning quant design for flc projects - may 2011
Slayter on planning quant design for flc projects - may 2011Elspeth Slayter
 
2 statistics, measurement, graphical techniques
2 statistics, measurement, graphical techniques2 statistics, measurement, graphical techniques
2 statistics, measurement, graphical techniquesPenny Jiang
 
Spss introductory session data entry and descriptive stats
Spss introductory session data entry and descriptive statsSpss introductory session data entry and descriptive stats
Spss introductory session data entry and descriptive statse1033930
 
Week4 Ensure Analysis Is Accurate And Complete
Week4 Ensure Analysis Is Accurate And CompleteWeek4 Ensure Analysis Is Accurate And Complete
Week4 Ensure Analysis Is Accurate And Completehapy
 
Coursework Data Interpretation
Coursework   Data InterpretationCoursework   Data Interpretation
Coursework Data InterpretationAndy Knill
 
Analysing/Interpreting Quantitative Research
Analysing/Interpreting  Quantitative Research Analysing/Interpreting  Quantitative Research
Analysing/Interpreting Quantitative Research HariBolKafle
 
Data analysis and Presentation
Data analysis and PresentationData analysis and Presentation
Data analysis and PresentationJignesh Kariya
 
Mba ii rm unit-4.1 data analysis & presentation a
Mba ii rm unit-4.1 data analysis & presentation aMba ii rm unit-4.1 data analysis & presentation a
Mba ii rm unit-4.1 data analysis & presentation aRai University
 
Statistical analysis, presentation on Data Analysis in Research.
Statistical analysis, presentation on Data Analysis in Research.Statistical analysis, presentation on Data Analysis in Research.
Statistical analysis, presentation on Data Analysis in Research.Leena Gauraha
 
Initial analysis of data metpen
Initial analysis of data metpenInitial analysis of data metpen
Initial analysis of data metpenGfv Gfv
 
Mba2216 week 11 data analysis part 03 appendix
Mba2216 week 11 data analysis part 03 appendixMba2216 week 11 data analysis part 03 appendix
Mba2216 week 11 data analysis part 03 appendixStephen Ong
 
3 survey, questionaire, graphic techniques
3 survey, questionaire, graphic techniques3 survey, questionaire, graphic techniques
3 survey, questionaire, graphic techniquesPenny Jiang
 
Abdm4064 week 11 data analysis
Abdm4064 week 11 data analysisAbdm4064 week 11 data analysis
Abdm4064 week 11 data analysisStephen Ong
 
Confirmatory factor analysis (cfa)
Confirmatory factor analysis (cfa)Confirmatory factor analysis (cfa)
Confirmatory factor analysis (cfa)HennaAnsari
 
DataGathering-Qualitative and Quantitative
DataGathering-Qualitative and QuantitativeDataGathering-Qualitative and Quantitative
DataGathering-Qualitative and QuantitativeSreenivas Ravi
 

Was ist angesagt? (20)

Data Analysis in Research: Descriptive Statistics & Normality
Data Analysis in Research: Descriptive Statistics & NormalityData Analysis in Research: Descriptive Statistics & Normality
Data Analysis in Research: Descriptive Statistics & Normality
 
Intro to quant_s_tudents
Intro to quant_s_tudentsIntro to quant_s_tudents
Intro to quant_s_tudents
 
Slayter on planning quant design for flc projects - may 2011
Slayter   on planning quant design for flc projects - may 2011Slayter   on planning quant design for flc projects - may 2011
Slayter on planning quant design for flc projects - may 2011
 
2 statistics, measurement, graphical techniques
2 statistics, measurement, graphical techniques2 statistics, measurement, graphical techniques
2 statistics, measurement, graphical techniques
 
Analyzing survey data
Analyzing survey dataAnalyzing survey data
Analyzing survey data
 
Spss introductory session data entry and descriptive stats
Spss introductory session data entry and descriptive statsSpss introductory session data entry and descriptive stats
Spss introductory session data entry and descriptive stats
 
Week4 Ensure Analysis Is Accurate And Complete
Week4 Ensure Analysis Is Accurate And CompleteWeek4 Ensure Analysis Is Accurate And Complete
Week4 Ensure Analysis Is Accurate And Complete
 
Coursework Data Interpretation
Coursework   Data InterpretationCoursework   Data Interpretation
Coursework Data Interpretation
 
Analysing/Interpreting Quantitative Research
Analysing/Interpreting  Quantitative Research Analysing/Interpreting  Quantitative Research
Analysing/Interpreting Quantitative Research
 
Data analysis and Presentation
Data analysis and PresentationData analysis and Presentation
Data analysis and Presentation
 
Mba ii rm unit-4.1 data analysis & presentation a
Mba ii rm unit-4.1 data analysis & presentation aMba ii rm unit-4.1 data analysis & presentation a
Mba ii rm unit-4.1 data analysis & presentation a
 
Statistical analysis, presentation on Data Analysis in Research.
Statistical analysis, presentation on Data Analysis in Research.Statistical analysis, presentation on Data Analysis in Research.
Statistical analysis, presentation on Data Analysis in Research.
 
Unit 1.2
Unit 1.2Unit 1.2
Unit 1.2
 
Initial analysis of data metpen
Initial analysis of data metpenInitial analysis of data metpen
Initial analysis of data metpen
 
Mba2216 week 11 data analysis part 03 appendix
Mba2216 week 11 data analysis part 03 appendixMba2216 week 11 data analysis part 03 appendix
Mba2216 week 11 data analysis part 03 appendix
 
3 survey, questionaire, graphic techniques
3 survey, questionaire, graphic techniques3 survey, questionaire, graphic techniques
3 survey, questionaire, graphic techniques
 
Abdm4064 week 11 data analysis
Abdm4064 week 11 data analysisAbdm4064 week 11 data analysis
Abdm4064 week 11 data analysis
 
Confirmatory factor analysis (cfa)
Confirmatory factor analysis (cfa)Confirmatory factor analysis (cfa)
Confirmatory factor analysis (cfa)
 
DataGathering-Qualitative and Quantitative
DataGathering-Qualitative and QuantitativeDataGathering-Qualitative and Quantitative
DataGathering-Qualitative and Quantitative
 
Inferential Statistics
Inferential StatisticsInferential Statistics
Inferential Statistics
 

Ähnlich wie Analyzing Responses to Likert Items

reliability and validity
reliability and validityreliability and validity
reliability and validitymikki khan
 
Multivariate Models in Questionnaire Development
Multivariate Models in Questionnaire DevelopmentMultivariate Models in Questionnaire Development
Multivariate Models in Questionnaire DevelopmentD Dutta Roy
 
Contemporary research practices
Contemporary research practicesContemporary research practices
Contemporary research practicesCarlo Magno
 
S6 quantitative research 2019
S6 quantitative research 2019S6 quantitative research 2019
S6 quantitative research 2019collierdr709
 
Answer all questions individually and cite all work!!1. Provid.docx
Answer all questions individually and cite all work!!1. Provid.docxAnswer all questions individually and cite all work!!1. Provid.docx
Answer all questions individually and cite all work!!1. Provid.docxfestockton
 
Method of measuring test reliability
Method of measuring test reliabilityMethod of measuring test reliability
Method of measuring test reliabilitynamrata227
 
Factor anaysis scale dimensionality
Factor anaysis scale dimensionalityFactor anaysis scale dimensionality
Factor anaysis scale dimensionalityCarlo Magno
 
Wikipedia as an Ontology for Describing Documents
Wikipedia as an Ontology for Describing DocumentsWikipedia as an Ontology for Describing Documents
Wikipedia as an Ontology for Describing DocumentsZareen Syed
 
Doing observation and Data Analysis for Qualitative Research
Doing observation and Data Analysis for Qualitative ResearchDoing observation and Data Analysis for Qualitative Research
Doing observation and Data Analysis for Qualitative ResearchAhmad Johari Sihes
 
Review of Basic Statistics and Terminology
Review of Basic Statistics and TerminologyReview of Basic Statistics and Terminology
Review of Basic Statistics and Terminologyaswhite
 
Factor analysis
Factor analysis Factor analysis
Factor analysis Nima
 
Back to the basics-Part2: Data exploration: representing and testing data pro...
Back to the basics-Part2: Data exploration: representing and testing data pro...Back to the basics-Part2: Data exploration: representing and testing data pro...
Back to the basics-Part2: Data exploration: representing and testing data pro...Giannis Tsakonas
 
Strict Standards Only variables should be passed by reference.docx
Strict Standards Only variables should be passed by reference.docxStrict Standards Only variables should be passed by reference.docx
Strict Standards Only variables should be passed by reference.docxflorriezhamphrey3065
 
Validity and Reliability of the Research Instrument; How to Test the Validati...
Validity and Reliability of the Research Instrument; How to Test the Validati...Validity and Reliability of the Research Instrument; How to Test the Validati...
Validity and Reliability of the Research Instrument; How to Test the Validati...Hamed Taherdoost
 
Statistics Chapter 01[1]
Statistics  Chapter 01[1]Statistics  Chapter 01[1]
Statistics Chapter 01[1]plisasm
 

Ähnlich wie Analyzing Responses to Likert Items (20)

reliability and validity
reliability and validityreliability and validity
reliability and validity
 
Multivariate Models in Questionnaire Development
Multivariate Models in Questionnaire DevelopmentMultivariate Models in Questionnaire Development
Multivariate Models in Questionnaire Development
 
Contemporary research practices
Contemporary research practicesContemporary research practices
Contemporary research practices
 
Statistics
StatisticsStatistics
Statistics
 
S6 quantitative research 2019
S6 quantitative research 2019S6 quantitative research 2019
S6 quantitative research 2019
 
Experimental
ExperimentalExperimental
Experimental
 
Answer all questions individually and cite all work!!1. Provid.docx
Answer all questions individually and cite all work!!1. Provid.docxAnswer all questions individually and cite all work!!1. Provid.docx
Answer all questions individually and cite all work!!1. Provid.docx
 
Method of measuring test reliability
Method of measuring test reliabilityMethod of measuring test reliability
Method of measuring test reliability
 
Factor anaysis scale dimensionality
Factor anaysis scale dimensionalityFactor anaysis scale dimensionality
Factor anaysis scale dimensionality
 
EDA
EDAEDA
EDA
 
Wikipedia as an Ontology for Describing Documents
Wikipedia as an Ontology for Describing DocumentsWikipedia as an Ontology for Describing Documents
Wikipedia as an Ontology for Describing Documents
 
Doing observation and Data Analysis for Qualitative Research
Doing observation and Data Analysis for Qualitative ResearchDoing observation and Data Analysis for Qualitative Research
Doing observation and Data Analysis for Qualitative Research
 
Review of Basic Statistics and Terminology
Review of Basic Statistics and TerminologyReview of Basic Statistics and Terminology
Review of Basic Statistics and Terminology
 
Factor analysis
Factor analysis Factor analysis
Factor analysis
 
Back to the basics-Part2: Data exploration: representing and testing data pro...
Back to the basics-Part2: Data exploration: representing and testing data pro...Back to the basics-Part2: Data exploration: representing and testing data pro...
Back to the basics-Part2: Data exploration: representing and testing data pro...
 
Indexes scales and typologies
Indexes scales and typologiesIndexes scales and typologies
Indexes scales and typologies
 
Week_2_Lecture.pdf
Week_2_Lecture.pdfWeek_2_Lecture.pdf
Week_2_Lecture.pdf
 
Strict Standards Only variables should be passed by reference.docx
Strict Standards Only variables should be passed by reference.docxStrict Standards Only variables should be passed by reference.docx
Strict Standards Only variables should be passed by reference.docx
 
Validity and Reliability of the Research Instrument; How to Test the Validati...
Validity and Reliability of the Research Instrument; How to Test the Validati...Validity and Reliability of the Research Instrument; How to Test the Validati...
Validity and Reliability of the Research Instrument; How to Test the Validati...
 
Statistics Chapter 01[1]
Statistics  Chapter 01[1]Statistics  Chapter 01[1]
Statistics Chapter 01[1]
 

Kürzlich hochgeladen

ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxAreebaZafar22
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxRamakrishna Reddy Bijjam
 
Food Chain and Food Web (Ecosystem) EVS, B. Pharmacy 1st Year, Sem-II
Food Chain and Food Web (Ecosystem) EVS, B. Pharmacy 1st Year, Sem-IIFood Chain and Food Web (Ecosystem) EVS, B. Pharmacy 1st Year, Sem-II
Food Chain and Food Web (Ecosystem) EVS, B. Pharmacy 1st Year, Sem-IIShubhangi Sonawane
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibitjbellavia9
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.christianmathematics
 
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...Shubhangi Sonawane
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfJayanti Pande
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...Nguyen Thanh Tu Collection
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactPECB
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfAyushMahapatra5
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...christianmathematics
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfagholdier
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdfQucHHunhnh
 
Making and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdfMaking and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdfChris Hunter
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDThiyagu K
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxheathfieldcps1
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphThiyagu K
 

Kürzlich hochgeladen (20)

ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptx
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docx
 
Food Chain and Food Web (Ecosystem) EVS, B. Pharmacy 1st Year, Sem-II
Food Chain and Food Web (Ecosystem) EVS, B. Pharmacy 1st Year, Sem-IIFood Chain and Food Web (Ecosystem) EVS, B. Pharmacy 1st Year, Sem-II
Food Chain and Food Web (Ecosystem) EVS, B. Pharmacy 1st Year, Sem-II
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibit
 
Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdf
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdf
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
Asian American Pacific Islander Month DDSD 2024.pptx
Asian American Pacific Islander Month DDSD 2024.pptxAsian American Pacific Islander Month DDSD 2024.pptx
Asian American Pacific Islander Month DDSD 2024.pptx
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
Making and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdfMaking and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdf
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SD
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot Graph
 

Analyzing Responses to Likert Items

  • 1. Analyzing Responses to Likert Items An Exploration of Data from a Credibility Study Involving WikiDashboard (http://wikidashboard.parc.com) by Sanjay Kairam
  • 2. WikiDashboard Study The System The Study The Data
  • 3. WikiDashboard “Social Dynamic Analysis Tool” for Wikipedia Michael Scott (The Office): “Wikipedia is the best thing ever. Anyone in the world can write anything they want about any subject, so you know you are getting the best possible information” What happens when we see who is doing the editing?
  • 5. WikiDashboard Study Study conducted on Amazon Mechanical Turk N = 288 subjects Subjects paid $0.08 / HIT “Please read and evaluate this Wikipedia Article.”
  • 6. Experiment Conditions Participants each placed in 1 of 3 conditions (each N = 96): Wiki Only (WO) Wiki + History (WH) WikiDashboard (WD)
  • 7. Articles Used Each subject read 1 (of 8 possible) Wikipedia articles. Article “Quality”: “Low-Quality” articles were those flagged as “B-Class” or “C-Class” by the Wikipedia community. “High-Quality” articles were those which had at one time been “Featured Articles”. Article “Controversiality”: “Controversial” articles were those on the extensive “List of Controversial Articles”.
  • 8. Survey Self-Reported Expertise “How familiar are you with the topic discussed on this Wikipedia page?” Manipulation/Quality Checks “In 5-20 words, please describe what this Wikipedia page is about.” “Please describe one fact from the article that you found interesting.” (WO) “Please name at least one user (by username or IP address) who has made multiple edits to this page. (WH, WD)
  • 9. Credibility Assessment Assessing agreement with these statements: “I believe that the information on this page is accurate.” (Accuracy) “I believe that the information on this page is objective.” (Objectivity) “I believe that the information on this page is current and up-to-date.” (Currency) “I believe that this page fully covers the relevant information on the topic.” (Coverage) “I trust the information on this page.” (Trust)
  • 10. Likert Item Responses Participants answered using a 5-point scale: -2: “Strongly Disagree” -1: “Somewhat Disagree” 0: “Neither Agree nor Disagree” +1: “Somewhat Agree” +2: “Strongly Agree” Now, what do we do with this data?
  • 11. Analyzing Likert Item Responses Very often, we see papers reporting Likert responses using means: What is the average of 1 “Somewhat Agree” and 3 “Somewhat Disagree”s? Hint: It’s not “Somewhat Disagree and a Half” In this case, what does a “mean” mean? In most cases, an ANOVA would definitely not work as well, though people still try!
  • 12. Options for Analysis Non-Parametric Tests for Ordinal Data Conversion to an Interval Scale Aggregating Items
  • 13. Mann-Whitney U Test Also called “Mann-Whitney-Wolcoxon”, “Wilcoxon Rank-Sum”, or “Wilcoxon-Mann-Whitney” test. Non-parametric test for assessing whether two independent samples of observations have equally large values. http://en.wikipedia.org/wiki/Mann-Whitney_U
  • 14. Mann-Whitney U Test Assumptions: All observations from both groups are independent of each other. The responses are ordinal or continuous measurements. Null hypothesis includes symmetry between two populations considered Under alternative hypothesis, probability of an observation from pop. X exceeding an observation from pop. Y is not equal to 0.5 http://en.wikipedia.org/wiki/Mann-Whitney_U
  • 15. Kruskal-Wallis ANOVA What if we want to test more than 2 groups? (as we do, given our 3 experimental conditions) Kruskal-Wallis ANOVA is an extension of Mann-Whitney U to 3 or more groups. Also non-parametric, though it does assume that both distributions have a similar underlying shape. http://en.wikipedia.org/wiki/Kruskal-Wallis_one-way_analysis_of_variance
  • 16. Analysis Using Non-Parametric Tests Do participants actually notice differences in article quality? Mann-Whitney: Significant effects of article quality for ratings of Accuracy (p< 0.001), Coverage (p< 0.01), Currency (p< 0.001), and Trust (p< 0.001), with marginally significant effect on Objectivity (p< 0.096). Kruskal-Wallis: Significant effect on ratings of Accuracy (p< 0.001), Coverage (p< 0.012), Currency (p< 0.001), and Trust (p< 0.001), with no significant effect on Objectivity.
  • 17. Sample Boxplots: Ratings by Article Quality Accuracy Coverage
  • 18. Analysis Using Non-Parametric Tests Do participants notice differences in how “controversial” an article is? Mann-Whitney: Significant effect on ratings of Coverage (p < 0.039), Currency (p < 0.039), Objectivity (p < 0.021), and Trust (p < 0.021), with no effect on ratings of Accuracy. Kruskal-Wallis: Significant effect on ratings of Objectivity (p < 0.042), and marginally significant effect for Coverage (p < 0.077) and Currency (p < 0.083), but no significant effect on Accuracy or Trust.
  • 19. Analysis Using Non-Parametric Tests What we really want to know, however, is whether using WikiDashboard or Wiki + History makes participants more sensitive to article quality or controversiality than participants using Wikipedia on its own. Both tests only allow us to compare populations separated on the basis of a single variable, however, so we can’t explore these interaction effects.
  • 20. Conversion to Interval Scale If there were a way to map our Likert item responses on to an interval scale, we could use more familiar/powerful statistical tests. If we found that the mapped data was normal, for instance, we could use our usual parametric tests such as MANOVA, which would help us find these interaction effects.
  • 21. Conversion to Interval Scale E.J. Snell (1964) describes a procedure for mapping ordered data, like Likert responses, to an assumed underlying continuous scale of measurement. At the end, he emphasizes that “the usefulness of the proposed method depends upon the assumption that the underlying scale of measurement can be transformed to produce a normal distribution.” Snell, E.J. A Scaling Procedure for Ordered Categorical Data, Biometrics 20(3), pp. 592-607 (1964). http://www.jstor.org/stable/2528498
  • 22. Utilizing the Snell Conversion The conversion procedure was used to transform the data – essentially mapped each response (ranging from -2 to +2) to a new point which ranged from roughly -1.00 to +4.05 Essentially, it looks as if only the distances between the values has changed.
  • 25. Aggregating Likert Items If we consider the various Likert items to be different measurements of a certain underlying trait (Credibility), then can we sum them and run parametric statistical tests? Haven’t tried this yet – is this a valid approach?
  • 26. Analyzing Responses to Likert Items by Sanjay Kairam Email: sanjay.kairam@gmail.com Twitter: @skairam

Hinweis der Redaktion

  1. Incidentally, the box plots show that not all of the effects are actually going in the supposed direction.For instance, Accuracy/Trust are higher for “Low-Quality” items, and Coverage/Currency/Objectivity are higher for “High-Quality” items.Some Thoughts:While the subjects rated the B/C-Class Articles as less complete and more current than the Featured Articles, they rated the “lower-quality” articles as more accurate and more trustworthy overall than the Featured Articles. This makes more sense when you consider why these articles were flagged in the first place. The article for “hip hop music” was actually once a featured article, but is currently flagged for reasons relating to the scope of hip hop (“No references to any of the classic hip hop stars”, “Hip-hop and Rap are entirely different things”, etc.), so perhaps while the existing material in the article may have been seen as accurate, the article overall is unsatisfactory for its lack of coverage and currency. Regarding “hypnagogia”, one of the discussion points flagged even on the main article page is that the article should be merged with another article, “Threshold Consciousness”, indicating that the article may not cover the topic fully.
  2. Again, we see an interesting pattern where they rated the “controversial” items as more credible on some measures and less credible on others. The fact that they rated the “controversial” articles higher on “coverage”, “currency”, and “objectivity” may have to do with the fact that these articles receive a high number of edits overall, meaning that the pages are frequently updated by a number of different editors? It’s interesting to see that in spite of seeing the “controversial” articles as more complete and objective, they still trust the non-controversial articles more.