SlideShare ist ein Scribd-Unternehmen logo
1 von 17
A Brief
Introduction to
the 12 Steps of
Evaluation Data
Cleaning
Jennifer Ann Morrow, Ph.D.
University of Tennessee
Importance of Cleaning Data
• As evaluators we need our evaluation data to
be:
– Accurate
– Complete
– High quality
– Reliable
– Unbiased
– Valid
If We Don’t Clean Our Data
• Problems that can occur:
– Inaccurate/biased conclusions
– Increased error
– Reduced credibility
– Reduced generalizability
– Violation of statistical assumptions
1: Create a Data Codebook
• Contains all relevant information for your
evaluation project
• Suggestions for what to include:
– Electronic file names
– Variable names, variable labels, value labels
– Complete list of modified variables
– Citations for instrument sources
– Project diary
2: Create a Data Analysis Plan
• Your analysis plan should list each step you
will take when analyzing your data
• Suggestions for what to include:
– General instructions for data analysts
– List of datasets
– Evaluation questions
– Variables used for each analysis
– Specific analyses and graphics for each
evaluation question
3: Perform Initial Frequencies –
Round 1
• After organizing your codebook and analysis
plan you can now begin to start the data
cleaning process
• Conduct frequency analyses (frequencies,
percentages) for EVERY variable in your
evaluation dataset
• Suggestion:
– request a graphic (bar chart or histogram) for
each variable
4: Check for Coding Mistakes
• Coding errors are any values that are not
within the specified range for your variable
(e.g., you have a rating scale from 1-5 and
you have a value of 9)
• Suggestions:
– Compare all values to what is listed in your
codebook
– In many cases errors are unspecified missing
data values
5: Modify and Create Variables
• It is now time to modify your variables so
they can be used in your planned analyses
• Suggestions:
– Reverse code any variables that need to be
merged with others that are on the opposite
scale
– Recode any variables to match your codebook
– Create new variables (e.g., averages, total
scores) to be used for future analyses
6: Frequencies and Descriptives –
Round 2
• At this step you conduct frequency analyses
on every variable and descriptive analyses
on every continuous variable
• Suggestions:
– Review the following descriptives: mean,
median, mode, standard deviation, skewness,
kurtosis, minimum, and maximum
– Create standardized scores (i.e., Z-scores) for
every continuous variable
7: Search for Outliers
• Review your standardized scores and
histograms to check for outliers
• Outliers are scores that deviate greatly from
the mean (e.g., >/3.29/ standard
deviations) and potentially can create or
cover up statistical significance
• Suggestions:
– delete, transform, or alter (winsorize, trim,
modify) your outliers
8: Assess for Normality
• For many inferential statistics (e.g., analyses
of variance, regressions) your outcome
(dependent) variable should be normally
distributed (i.e., mean=median=mode)
• Suggestions:
– check to see if the values of your skewness and
kurtosis are greater than /2/
– Transform the variable, use a non-parametric
analysis, or modify your alpha level
9: Dealing with Missing Data
• You should always check to see if missing
data is random or non-random (i.e.,
patterns of missing data)
• Evaluation results can be misleading and
less generalizable
• Suggestions:
– Delete cases/variables with missing data,
estimate missing data, conduct analyses with
and without modifying variables
10: Examine Cell Sample Size
• For many of our analyses (e.g., group
difference statistics) we want to have equal
sample sizes in our cells of our design
• Unequal sample sizes lead to lower statistical
power and reduced generalizability
• Suggestions:
– Collapse categories within a variable, use a non-
parametric analysis, or apply a more stringent
alpha level
11: Frequencies and Descriptives –
The Finale
• Your data is now cleaned and ready to be
summarized!
• Conduct a final set of frequencies and
descriptives prior to conducting your
inferential statistics
• Suggestion:
– Use a variety of graphics and visual aids to
showcase your evaluation data for your clients
12: Assumption Testing
• For some inferential statistics (e.g.,
correlational analyses, group difference
analyses) you still need to address a few
additional assumptions in order to conduct
the analysis
• Suggestions:
– Some common assumptions are: homogeneity of
variance, linearity, independence of errors,
multicollinearity, and reliability
Some Helpful Resources
• YouTube videos
– http://www.youtube.com/watch?v=R6Cc5flsbsw
– http://www.youtube.com/watch?
v=5qhLDYr70MM&feature=channel&list=UL
• Websites
– http://clinistat.hk/internetresource.php
– http://pareonline.net/getvn.asp?v=9&n=6
• Software
– http://www.gnu.org/software/pspp/
– http://davidmlane.com/hyperstat/Statistical_analyses.html
Contact Information
Jennifer Ann Morrow, Ph.D.
Associate Professor of Evaluation, Statistics, and Measurement
Department of Educational Psychology and Counseling
University of Tennessee
Knoxville, TN 37996-3452
Email: jamorrow@utk.edu
http://web.utk.edu/~edpsych/eval_assessment/default.html

Weitere ähnliche Inhalte

Was ist angesagt?

Quantitative Data Analysis
Quantitative Data AnalysisQuantitative Data Analysis
Quantitative Data AnalysisAsma Muhamad
 
Analysis of data in research
Analysis of data in researchAnalysis of data in research
Analysis of data in researchAbhijeet Birari
 
Data Analysis and Statistics
Data Analysis and StatisticsData Analysis and Statistics
Data Analysis and StatisticsT.S. Lim
 
Exploratory data analysis
Exploratory data analysisExploratory data analysis
Exploratory data analysisVishwas N
 
Data collection and instrumentation
Data collection and instrumentationData collection and instrumentation
Data collection and instrumentationAmobi Peter
 
Exploratory data analysis
Exploratory data analysisExploratory data analysis
Exploratory data analysisGramener
 
Sampling Design in qualitative Research.pdf
Sampling Design in qualitative Research.pdfSampling Design in qualitative Research.pdf
Sampling Design in qualitative Research.pdfDaniel Temesgen Gelan
 
Data analysis
Data analysisData analysis
Data analysisneha147
 
Statistical analysis and interpretation
Statistical analysis and interpretationStatistical analysis and interpretation
Statistical analysis and interpretationDave Marcial
 
Statistical analysis using spss
Statistical analysis using spssStatistical analysis using spss
Statistical analysis using spssjpcagphil
 
EDA | Exploratory Data Analysis | Machine Learning | Data Science
EDA | Exploratory Data Analysis | Machine Learning | Data ScienceEDA | Exploratory Data Analysis | Machine Learning | Data Science
EDA | Exploratory Data Analysis | Machine Learning | Data ScienceSumit Pandey
 
Data Analysis - Approach & Techniques
Data Analysis - Approach & TechniquesData Analysis - Approach & Techniques
Data Analysis - Approach & TechniquesInvenkLearn
 
data collection
data collection data collection
data collection KingMajanga
 

Was ist angesagt? (20)

Quantitative Data Analysis
Quantitative Data AnalysisQuantitative Data Analysis
Quantitative Data Analysis
 
Analysis of data in research
Analysis of data in researchAnalysis of data in research
Analysis of data in research
 
Data Analysis and Statistics
Data Analysis and StatisticsData Analysis and Statistics
Data Analysis and Statistics
 
Exploratory data analysis
Exploratory data analysisExploratory data analysis
Exploratory data analysis
 
Star schema PPT
Star schema PPTStar schema PPT
Star schema PPT
 
Data collection and instrumentation
Data collection and instrumentationData collection and instrumentation
Data collection and instrumentation
 
Exploratory data analysis
Exploratory data analysisExploratory data analysis
Exploratory data analysis
 
Sampling Design in qualitative Research.pdf
Sampling Design in qualitative Research.pdfSampling Design in qualitative Research.pdf
Sampling Design in qualitative Research.pdf
 
Chapter 10-DATA ANALYSIS & PRESENTATION
Chapter 10-DATA ANALYSIS & PRESENTATIONChapter 10-DATA ANALYSIS & PRESENTATION
Chapter 10-DATA ANALYSIS & PRESENTATION
 
Data analysis
Data analysisData analysis
Data analysis
 
Statistical analysis and interpretation
Statistical analysis and interpretationStatistical analysis and interpretation
Statistical analysis and interpretation
 
Statistical analysis using spss
Statistical analysis using spssStatistical analysis using spss
Statistical analysis using spss
 
Introduction to Stata
Introduction to Stata Introduction to Stata
Introduction to Stata
 
EDA | Exploratory Data Analysis | Machine Learning | Data Science
EDA | Exploratory Data Analysis | Machine Learning | Data ScienceEDA | Exploratory Data Analysis | Machine Learning | Data Science
EDA | Exploratory Data Analysis | Machine Learning | Data Science
 
Data Preprocessing
Data PreprocessingData Preprocessing
Data Preprocessing
 
Data Analysis - Approach & Techniques
Data Analysis - Approach & TechniquesData Analysis - Approach & Techniques
Data Analysis - Approach & Techniques
 
Data Analysis
Data Analysis Data Analysis
Data Analysis
 
data collection
data collection data collection
data collection
 
Data preprocessing
Data preprocessingData preprocessing
Data preprocessing
 
DATA ANALYSIS
DATA ANALYSISDATA ANALYSIS
DATA ANALYSIS
 

Ähnlich wie Brief Introduction to the 12 Steps of Evaluation Data Cleaning

Mba ii rm unit-4.1 data analysis & presentation a
Mba ii rm unit-4.1 data analysis & presentation aMba ii rm unit-4.1 data analysis & presentation a
Mba ii rm unit-4.1 data analysis & presentation aRai University
 
Workshop on SPSS: Basic to Intermediate Level
Workshop on SPSS: Basic to Intermediate LevelWorkshop on SPSS: Basic to Intermediate Level
Workshop on SPSS: Basic to Intermediate LevelHiram Ting
 
Module 4 data analysis
Module 4 data analysisModule 4 data analysis
Module 4 data analysisILRI-Jmaru
 
Group 1 Report CRISP - DM METHODOLOGY.pptx
Group 1 Report CRISP - DM METHODOLOGY.pptxGroup 1 Report CRISP - DM METHODOLOGY.pptx
Group 1 Report CRISP - DM METHODOLOGY.pptxellamangapis2003
 
Lecture_4_Data_Gathering_and_Analysis.pdf
Lecture_4_Data_Gathering_and_Analysis.pdfLecture_4_Data_Gathering_and_Analysis.pdf
Lecture_4_Data_Gathering_and_Analysis.pdfAbdullahOmar64
 
Data Preparation and Processing
Data Preparation and ProcessingData Preparation and Processing
Data Preparation and ProcessingMehul Gondaliya
 
Final spss hands on training (descriptive analysis) may 24th 2013
Final spss  hands on training (descriptive analysis) may 24th 2013Final spss  hands on training (descriptive analysis) may 24th 2013
Final spss hands on training (descriptive analysis) may 24th 2013Tin Myo Han
 
Design of experiments BY Minitab
Design of experiments BY MinitabDesign of experiments BY Minitab
Design of experiments BY MinitabDr-Jitendra Patel
 
Data analysis plan in medicine and nurse.pptx
Data analysis plan in medicine and nurse.pptxData analysis plan in medicine and nurse.pptx
Data analysis plan in medicine and nurse.pptxJuma675663
 
Data Refinement: The missing link between data collection and decisions
Data Refinement: The missing link between data collection and decisionsData Refinement: The missing link between data collection and decisions
Data Refinement: The missing link between data collection and decisionsVivastream
 
Data Analysis and Synthesis & Techniques of System.pptx
Data Analysis and Synthesis & Techniques of System.pptxData Analysis and Synthesis & Techniques of System.pptx
Data Analysis and Synthesis & Techniques of System.pptxTs. Heshalini Rajagopal
 
Data warehouse 16 data analysis techniques
Data warehouse 16 data analysis techniquesData warehouse 16 data analysis techniques
Data warehouse 16 data analysis techniquesVaibhav Khanna
 

Ähnlich wie Brief Introduction to the 12 Steps of Evaluation Data Cleaning (20)

lecture-8.pdf
lecture-8.pdflecture-8.pdf
lecture-8.pdf
 
Mba ii rm unit-4.1 data analysis & presentation a
Mba ii rm unit-4.1 data analysis & presentation aMba ii rm unit-4.1 data analysis & presentation a
Mba ii rm unit-4.1 data analysis & presentation a
 
Workshop on SPSS: Basic to Intermediate Level
Workshop on SPSS: Basic to Intermediate LevelWorkshop on SPSS: Basic to Intermediate Level
Workshop on SPSS: Basic to Intermediate Level
 
Module 4 data analysis
Module 4 data analysisModule 4 data analysis
Module 4 data analysis
 
Group 1 Report CRISP - DM METHODOLOGY.pptx
Group 1 Report CRISP - DM METHODOLOGY.pptxGroup 1 Report CRISP - DM METHODOLOGY.pptx
Group 1 Report CRISP - DM METHODOLOGY.pptx
 
Lecture_4_Data_Gathering_and_Analysis.pdf
Lecture_4_Data_Gathering_and_Analysis.pdfLecture_4_Data_Gathering_and_Analysis.pdf
Lecture_4_Data_Gathering_and_Analysis.pdf
 
Data Preparation and Processing
Data Preparation and ProcessingData Preparation and Processing
Data Preparation and Processing
 
Data mining
Data miningData mining
Data mining
 
Final spss hands on training (descriptive analysis) may 24th 2013
Final spss  hands on training (descriptive analysis) may 24th 2013Final spss  hands on training (descriptive analysis) may 24th 2013
Final spss hands on training (descriptive analysis) may 24th 2013
 
Modeling and analysis
Modeling and analysisModeling and analysis
Modeling and analysis
 
Design of experiments BY Minitab
Design of experiments BY MinitabDesign of experiments BY Minitab
Design of experiments BY Minitab
 
How to Think Like A Statistician
How to Think Like A StatisticianHow to Think Like A Statistician
How to Think Like A Statistician
 
Data analysis plan in medicine and nurse.pptx
Data analysis plan in medicine and nurse.pptxData analysis plan in medicine and nurse.pptx
Data analysis plan in medicine and nurse.pptx
 
Hm306 week 1 ppt 1
Hm306 week 1 ppt 1Hm306 week 1 ppt 1
Hm306 week 1 ppt 1
 
Hm306 week 1 ppt A
Hm306 week 1 ppt AHm306 week 1 ppt A
Hm306 week 1 ppt A
 
Data Analysis
Data AnalysisData Analysis
Data Analysis
 
Data Refinement: The missing link between data collection and decisions
Data Refinement: The missing link between data collection and decisionsData Refinement: The missing link between data collection and decisions
Data Refinement: The missing link between data collection and decisions
 
Data Analysis and Synthesis & Techniques of System.pptx
Data Analysis and Synthesis & Techniques of System.pptxData Analysis and Synthesis & Techniques of System.pptx
Data Analysis and Synthesis & Techniques of System.pptx
 
UNIT 4.pptx
UNIT 4.pptxUNIT 4.pptx
UNIT 4.pptx
 
Data warehouse 16 data analysis techniques
Data warehouse 16 data analysis techniquesData warehouse 16 data analysis techniques
Data warehouse 16 data analysis techniques
 

Mehr von Jennifer Morrow

Using Collaborative and Expressive Writing Activities to Educate First-Year S...
Using Collaborative and Expressive Writing Activities to Educate First-Year S...Using Collaborative and Expressive Writing Activities to Educate First-Year S...
Using Collaborative and Expressive Writing Activities to Educate First-Year S...Jennifer Morrow
 
Preparing to go on the job market: Strategies for academic and non-academic j...
Preparing to go on the job market: Strategies for academic and non-academic j...Preparing to go on the job market: Strategies for academic and non-academic j...
Preparing to go on the job market: Strategies for academic and non-academic j...Jennifer Morrow
 
Collecting Longitudinal Evaluation Data in a College Setting
Collecting Longitudinal Evaluation Data in a College SettingCollecting Longitudinal Evaluation Data in a College Setting
Collecting Longitudinal Evaluation Data in a College SettingJennifer Morrow
 
What is program evaluation lecture 100207 [compatibility mode]
What is program evaluation lecture   100207 [compatibility mode]What is program evaluation lecture   100207 [compatibility mode]
What is program evaluation lecture 100207 [compatibility mode]Jennifer Morrow
 
APA Version 6 Quick Guide
APA Version 6 Quick GuideAPA Version 6 Quick Guide
APA Version 6 Quick GuideJennifer Morrow
 
How to create multiple choice questions
How to create multiple choice questionsHow to create multiple choice questions
How to create multiple choice questionsJennifer Morrow
 

Mehr von Jennifer Morrow (6)

Using Collaborative and Expressive Writing Activities to Educate First-Year S...
Using Collaborative and Expressive Writing Activities to Educate First-Year S...Using Collaborative and Expressive Writing Activities to Educate First-Year S...
Using Collaborative and Expressive Writing Activities to Educate First-Year S...
 
Preparing to go on the job market: Strategies for academic and non-academic j...
Preparing to go on the job market: Strategies for academic and non-academic j...Preparing to go on the job market: Strategies for academic and non-academic j...
Preparing to go on the job market: Strategies for academic and non-academic j...
 
Collecting Longitudinal Evaluation Data in a College Setting
Collecting Longitudinal Evaluation Data in a College SettingCollecting Longitudinal Evaluation Data in a College Setting
Collecting Longitudinal Evaluation Data in a College Setting
 
What is program evaluation lecture 100207 [compatibility mode]
What is program evaluation lecture   100207 [compatibility mode]What is program evaluation lecture   100207 [compatibility mode]
What is program evaluation lecture 100207 [compatibility mode]
 
APA Version 6 Quick Guide
APA Version 6 Quick GuideAPA Version 6 Quick Guide
APA Version 6 Quick Guide
 
How to create multiple choice questions
How to create multiple choice questionsHow to create multiple choice questions
How to create multiple choice questions
 

Kürzlich hochgeladen

Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsTechSoup
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpinRaunakKeshri1
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfAyushMahapatra5
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...christianmathematics
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...EduSkills OECD
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAssociation for Project Management
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxVishalSingh1417
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdfQucHHunhnh
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
General AI for Medical Educators April 2024
General AI for Medical Educators April 2024General AI for Medical Educators April 2024
General AI for Medical Educators April 2024Janet Corral
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Celine George
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfagholdier
 
9548086042 for call girls in Indira Nagar with room service
9548086042  for call girls in Indira Nagar  with room service9548086042  for call girls in Indira Nagar  with room service
9548086042 for call girls in Indira Nagar with room servicediscovermytutordmt
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformChameera Dedduwage
 
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...fonyou31
 
fourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingfourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingTeacherCyreneCayanan
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfAdmir Softic
 
social pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajansocial pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajanpragatimahajan3
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Krashi Coaching
 

Kürzlich hochgeladen (20)

Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpin
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdf
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across Sectors
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
General AI for Medical Educators April 2024
General AI for Medical Educators April 2024General AI for Medical Educators April 2024
General AI for Medical Educators April 2024
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
9548086042 for call girls in Indira Nagar with room service
9548086042  for call girls in Indira Nagar  with room service9548086042  for call girls in Indira Nagar  with room service
9548086042 for call girls in Indira Nagar with room service
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
 
fourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingfourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writing
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
social pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajansocial pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajan
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
 

Brief Introduction to the 12 Steps of Evaluation Data Cleaning

  • 1. A Brief Introduction to the 12 Steps of Evaluation Data Cleaning Jennifer Ann Morrow, Ph.D. University of Tennessee
  • 2. Importance of Cleaning Data • As evaluators we need our evaluation data to be: – Accurate – Complete – High quality – Reliable – Unbiased – Valid
  • 3. If We Don’t Clean Our Data • Problems that can occur: – Inaccurate/biased conclusions – Increased error – Reduced credibility – Reduced generalizability – Violation of statistical assumptions
  • 4. 1: Create a Data Codebook • Contains all relevant information for your evaluation project • Suggestions for what to include: – Electronic file names – Variable names, variable labels, value labels – Complete list of modified variables – Citations for instrument sources – Project diary
  • 5. 2: Create a Data Analysis Plan • Your analysis plan should list each step you will take when analyzing your data • Suggestions for what to include: – General instructions for data analysts – List of datasets – Evaluation questions – Variables used for each analysis – Specific analyses and graphics for each evaluation question
  • 6. 3: Perform Initial Frequencies – Round 1 • After organizing your codebook and analysis plan you can now begin to start the data cleaning process • Conduct frequency analyses (frequencies, percentages) for EVERY variable in your evaluation dataset • Suggestion: – request a graphic (bar chart or histogram) for each variable
  • 7. 4: Check for Coding Mistakes • Coding errors are any values that are not within the specified range for your variable (e.g., you have a rating scale from 1-5 and you have a value of 9) • Suggestions: – Compare all values to what is listed in your codebook – In many cases errors are unspecified missing data values
  • 8. 5: Modify and Create Variables • It is now time to modify your variables so they can be used in your planned analyses • Suggestions: – Reverse code any variables that need to be merged with others that are on the opposite scale – Recode any variables to match your codebook – Create new variables (e.g., averages, total scores) to be used for future analyses
  • 9. 6: Frequencies and Descriptives – Round 2 • At this step you conduct frequency analyses on every variable and descriptive analyses on every continuous variable • Suggestions: – Review the following descriptives: mean, median, mode, standard deviation, skewness, kurtosis, minimum, and maximum – Create standardized scores (i.e., Z-scores) for every continuous variable
  • 10. 7: Search for Outliers • Review your standardized scores and histograms to check for outliers • Outliers are scores that deviate greatly from the mean (e.g., >/3.29/ standard deviations) and potentially can create or cover up statistical significance • Suggestions: – delete, transform, or alter (winsorize, trim, modify) your outliers
  • 11. 8: Assess for Normality • For many inferential statistics (e.g., analyses of variance, regressions) your outcome (dependent) variable should be normally distributed (i.e., mean=median=mode) • Suggestions: – check to see if the values of your skewness and kurtosis are greater than /2/ – Transform the variable, use a non-parametric analysis, or modify your alpha level
  • 12. 9: Dealing with Missing Data • You should always check to see if missing data is random or non-random (i.e., patterns of missing data) • Evaluation results can be misleading and less generalizable • Suggestions: – Delete cases/variables with missing data, estimate missing data, conduct analyses with and without modifying variables
  • 13. 10: Examine Cell Sample Size • For many of our analyses (e.g., group difference statistics) we want to have equal sample sizes in our cells of our design • Unequal sample sizes lead to lower statistical power and reduced generalizability • Suggestions: – Collapse categories within a variable, use a non- parametric analysis, or apply a more stringent alpha level
  • 14. 11: Frequencies and Descriptives – The Finale • Your data is now cleaned and ready to be summarized! • Conduct a final set of frequencies and descriptives prior to conducting your inferential statistics • Suggestion: – Use a variety of graphics and visual aids to showcase your evaluation data for your clients
  • 15. 12: Assumption Testing • For some inferential statistics (e.g., correlational analyses, group difference analyses) you still need to address a few additional assumptions in order to conduct the analysis • Suggestions: – Some common assumptions are: homogeneity of variance, linearity, independence of errors, multicollinearity, and reliability
  • 16. Some Helpful Resources • YouTube videos – http://www.youtube.com/watch?v=R6Cc5flsbsw – http://www.youtube.com/watch? v=5qhLDYr70MM&feature=channel&list=UL • Websites – http://clinistat.hk/internetresource.php – http://pareonline.net/getvn.asp?v=9&n=6 • Software – http://www.gnu.org/software/pspp/ – http://davidmlane.com/hyperstat/Statistical_analyses.html
  • 17. Contact Information Jennifer Ann Morrow, Ph.D. Associate Professor of Evaluation, Statistics, and Measurement Department of Educational Psychology and Counseling University of Tennessee Knoxville, TN 37996-3452 Email: jamorrow@utk.edu http://web.utk.edu/~edpsych/eval_assessment/default.html