SlideShare ist ein Scribd-Unternehmen logo
1 von 29
David DeGeest 06J:278 Staffing April 13, 2010 SJTs: Avoiding a Grand Failure
Why are people so interested? The Big Picture of SJT lit
The claims about SJTs “They seem to represent psychometric alchemy” (Landy, 2007). Adverse impact is down, validity up Assessees like them Seem to address relevant KSAOs They assess soft skills and tacit knowledge They provide incremental validity above GMA and personality for predicting college GPA (Oswald et al., 2004) Some SJTs have demonstrated criterion-related validities as high as r=.36 (Pereira and Harve, 1999) They measure tacit knowledge and “non-academic intelligence” (Sternberg et al., 1995)
What is an SJT? Situational Judgment Tests (SJTs) or Inventories (SJIs) are psychological tests which offer respondents realistic, hypothetical scenarios and ask for an appropriate response. SJTs are often-identified as a type of low-fidelity situation (Motowildo, 1990). SJTs can be designed to be predictive of job performance, managerial ability, integrity, personality, and apparently other measures or constructs.
Example of an SJT item from Becker (2005) 11. You’re retiring from a successful business that you started, and must now decide who will replace you. Two of your children want the position and would probably do a fine job. However, three non-family employees are more qualified. Who would you most likely put in charge? A. The best performing non-family member, because the most qualified person deserves the job.  B. The lowest performing non-family member, because this won’t hurt your children’s feelings.  C. The highest performing child, because you have the right to do what is best for your kids.  D. The child you love the most, as long as he or she is able to do the job.
History of SJTs First recorded SJT: George Washington University Social Intelligence Test (1926) Some usage during WWII by military psychologists (1990): Motowildo’s research resurrected interest in SJTs Idea of the “low fidelity simulation” Commonly used now in industry as “customized” tool for organizations, consultants, etc. Takeaway: There is a lot of perceived promise and sunk cost in SJT research.
What the heck is an SJT? Construct Validity and the Development of SJTs
Item Characteristics	 McDaniel et al (2005) claims that SJTs have eight differentiating characteristics: Test fidelity Stem length Stem complexity Stem comprehensibility Nested stems Nature of responses  Response instructions Degree of item heterogeneity No proscribed standards to develop an SJT
Item Characteristics Examples of response options: What is the best answer? What would you most likely do? Rate each response for effectiveness Rate each response on likelihood you would engage in behavior Knowledge v. Behavioral Tendency Dichotomization Issue
Item Characteristics and Construct Validity Construct Heterogeneity Most items tend to correlate with GMA, Agreeableness, Conscientiousness, or Emotional Stability (McDaniel, 2005) Ployhart and Erhart (2003) notes that multiple constructs measured with SJTs makes it hard to measure differences across studies Takeaway: SJTs are best described as a method, not a construct (Schmitt & Chan, 2006)
Exciting finds from research on SJTs The promise of SJTs
Generalizability McDaniel et al. (2007) meta-analytically demonstrated SJTs have incremental validity of .03 to .05 over GMA .06 to .07 over Big Five .01 to .02 over GMA/Big Five composite McDaniel et al. (2001) showed that SJTs are generalizable as predictors of job performance 90% CV did not contain zero in the meta Potosky et al. (2004) showed that a .84 score-equivalence correlation between an SJT administered via paper-and-pencil and the Internet No effects based on beliefs in computer efficacy Takeaway: multiple metas have demonstrated the generalizability of SJTs in predicting job performance.
Variability in SJTs Lievens and Sackett (2006) showed that video-based SJTs for interpersonal skills have more validity than written SJTs. McDaniel et al. (2007) showed that reliabilities for SJTs can range from .63 to .88 The meta refers to alpha, but other reliability measures matter Takeaway: Effects of variations in level of fidelity offer interesting possibilities for research.
Assessment Reactions and Face Validity Chan & Schmitt (1997) showed that B-W differences in test performance and face validity reactions were lower for video-based SJTs than pencil-and-paper tests Race X Method interaction attributable to reading comprehension differences in subgroups Increasing fidelity increased mean performance on SJT Chan (1997) showed that paper-and-pencil SJTs  are more consistent  with beliefs, values, and expectations of whites. Moving to video-based SJT increased validity perceptions for both whites and blacks Bauer and Truxillo (2006) asserts that SJTs always have better face validity than do cognitive and personality measures. Takeaways: SJTs are useful in terms of face validity and justice perceptions, particularly high-fidelity (video) simulations.
The trouble with… Problems with SJTs
The problem of g ,[object Object]
Peeters and Lievens (2005) found that faking-good instructions produced differences in means and criterion-related validities across subgroups.
“Specifically, the fakability of SJTs might depend on their correlation with one particular construct, namely, cognitive ability (see Nguyen &McDaniel, 2001).” (p.73)
Schmidt and Hunter (2003) found low discriminant validity between SJTs and job knowledge tests.
Retesting Issue
“If subgroup differences on a test exist, policies that permit retests by candidates who were unsuccessful on the test might inflate calculations of adverse impact.” (Lievens et al., 2005, p. 1005)Takeaway: If the degree of fakibility of an SJT depends on its GMA load, SJTs might just be contaminated g tests or low-reliability job knowledge tests.
Faking Nguyen et al. (2005) found that d=.34 for honest instructions and d=.15 for faking Ployhart and Erhart (2003) also note that behavioral response instructions are both more prone to faking and have problematic more reliability issues Hooper et al. (2006) notes that the fragmentation of the literature has made a meta-analytic study of this issue impossible.
Response Instructions Ployhart and Erhart (2003) found that response instructions had dramatic effects on validity, reliability, and performance for SJTs Showed that dimensionality of an SJT is crucial to determining the reliability estimate to use. McDaniel et al. (2007) found meaningful differences between means for tests with different behavioral and knowledge instructions. Lievens and Sackett (2009) found no meaningful differences between means in a high-stakes testing environment with med school applicants. Last two studies found that knowledge instructions for an SJT increased the scores’ correlation with a GMA measure Takeaway: meta-analytic integration of these results is needed, but the primary research has yet to support this.
What is the reliability for an SJT? Bess (2001) points out the elephant in the room: “SJTs by definition are multidimensional and therefore internal consistency is not an appropriate measure of reliability” (p.29) Schmitt and Chan (1997) also notes this problem. Examples of reliability estimates: Ployhart and Erhart (2003) Used split-half estimates to get .67 and .68 reliabilities. Lievens and Sackett (2009) found low alphas for their SJT (.55-.56) Lievens and Sackett (2007) noted generating alternate forms is difficult for SJTs, given the contextual specificity of items. This means parallel forms reliability is a non-practical measure. Takeaway: no one is quite sure how to systematically assess reliabilities for SJT measures
What do we know about SJTs? Conclusions
Things we know fairly clearly SJTs are primarily a method, not a construct. SJTs have demonstrated generalizable meta-analytic incremental validity over GMA and Big 5 single and composite measures in predicting job performance Most SJTs are correlated with GMA to a varying extent and share some benefits and disadvantages with GMA SJTs often correlate with the Big 3
McDaniel et al. (2006) integrated model for SJT
Where do researchers go from here?  Practitioners? Future Directions for SJTs
Ployhart & Weekly (2006) Agenda for Research Construct Validity Correlates are known, but nomological net uncertain SJTs targeted to constructs: the “holy grail” (p.348) What exactly is “judgment?” Understanding SJT structure How do we build SJTs to get construct homogeneity? How do we enhance the reliability of these measures? More Experimentation/Micro Research Correlation studies and metas show generalizability Experimental studies can enhance understanding

Weitere ähnliche Inhalte

Andere mochten auch

Thesis Defense - Tommy May
Thesis Defense - Tommy MayThesis Defense - Tommy May
Thesis Defense - Tommy MayTommy May
 
An Introduction to Saville Analysis Aptitude
An Introduction to Saville Analysis Aptitude An Introduction to Saville Analysis Aptitude
An Introduction to Saville Analysis Aptitude JobTestPrep
 
Volunteer Fire Brigade Training Module 3 firefighting apparatus, tools and eq...
Volunteer Fire Brigade Training Module 3 firefighting apparatus, tools and eq...Volunteer Fire Brigade Training Module 3 firefighting apparatus, tools and eq...
Volunteer Fire Brigade Training Module 3 firefighting apparatus, tools and eq...Ronaldo Pioquinto
 
Types of Firefighting Equipment
Types of Firefighting EquipmentTypes of Firefighting Equipment
Types of Firefighting Equipmenttradeford
 
Hand and Power Tools
Hand and Power ToolsHand and Power Tools
Hand and Power Toolsshoptools
 
Workshop technology hand basic tools
Workshop technology hand basic toolsWorkshop technology hand basic tools
Workshop technology hand basic toolsnanishajieha
 
Situational Judgement Test - Preparation Guide Part #1
Situational Judgement Test - Preparation Guide Part #1Situational Judgement Test - Preparation Guide Part #1
Situational Judgement Test - Preparation Guide Part #1JobTestPrep
 
Series & Parallel
Series & ParallelSeries & Parallel
Series & Parallelfourangela
 
Electric Circuits Ppt Slides
Electric Circuits Ppt SlidesElectric Circuits Ppt Slides
Electric Circuits Ppt Slidesguest5e66ab3
 

Andere mochten auch (9)

Thesis Defense - Tommy May
Thesis Defense - Tommy MayThesis Defense - Tommy May
Thesis Defense - Tommy May
 
An Introduction to Saville Analysis Aptitude
An Introduction to Saville Analysis Aptitude An Introduction to Saville Analysis Aptitude
An Introduction to Saville Analysis Aptitude
 
Volunteer Fire Brigade Training Module 3 firefighting apparatus, tools and eq...
Volunteer Fire Brigade Training Module 3 firefighting apparatus, tools and eq...Volunteer Fire Brigade Training Module 3 firefighting apparatus, tools and eq...
Volunteer Fire Brigade Training Module 3 firefighting apparatus, tools and eq...
 
Types of Firefighting Equipment
Types of Firefighting EquipmentTypes of Firefighting Equipment
Types of Firefighting Equipment
 
Hand and Power Tools
Hand and Power ToolsHand and Power Tools
Hand and Power Tools
 
Workshop technology hand basic tools
Workshop technology hand basic toolsWorkshop technology hand basic tools
Workshop technology hand basic tools
 
Situational Judgement Test - Preparation Guide Part #1
Situational Judgement Test - Preparation Guide Part #1Situational Judgement Test - Preparation Guide Part #1
Situational Judgement Test - Preparation Guide Part #1
 
Series & Parallel
Series & ParallelSeries & Parallel
Series & Parallel
 
Electric Circuits Ppt Slides
Electric Circuits Ppt SlidesElectric Circuits Ppt Slides
Electric Circuits Ppt Slides
 

Ähnlich wie Avoiding A Grand Failure

The Importance Of A Family Intervention For Heart Failure...
The Importance Of A Family Intervention For Heart Failure...The Importance Of A Family Intervention For Heart Failure...
The Importance Of A Family Intervention For Heart Failure...Paula Smith
 
Gamification in recruitment and selection
Gamification in recruitment and selectionGamification in recruitment and selection
Gamification in recruitment and selectionIoannis Nikolaou
 
Educational Research 102: Selecting the Best Study Design for your Research Q...
Educational Research 102: Selecting the Best Study Design for your Research Q...Educational Research 102: Selecting the Best Study Design for your Research Q...
Educational Research 102: Selecting the Best Study Design for your Research Q...fnuthalapaty
 
Designing for Resilience and Compassionate Action
Designing for Resilience and Compassionate ActionDesigning for Resilience and Compassionate Action
Designing for Resilience and Compassionate ActionHopeLab
 
Group 12_ Qualitative Data Analysis.pptx
Group 12_ Qualitative Data Analysis.pptxGroup 12_ Qualitative Data Analysis.pptx
Group 12_ Qualitative Data Analysis.pptxogie6
 
Notes for question please no plag use references to cite wk 2 .docx
Notes for question please no plag use references to cite wk 2 .docxNotes for question please no plag use references to cite wk 2 .docx
Notes for question please no plag use references to cite wk 2 .docxcherishwinsland
 
The Developmental Coordination Disorder Questionnaire
The Developmental Coordination Disorder QuestionnaireThe Developmental Coordination Disorder Questionnaire
The Developmental Coordination Disorder QuestionnaireMandy Cross
 
Chapter 3 CONSTRUCT-IRRELEVANT VARIANCE IN AC.docx
Chapter 3    CONSTRUCT-IRRELEVANT VARIANCE IN AC.docxChapter 3    CONSTRUCT-IRRELEVANT VARIANCE IN AC.docx
Chapter 3 CONSTRUCT-IRRELEVANT VARIANCE IN AC.docxchristinemaritza
 
Relationships Among Classical Test Theory and Item Response Theory Frameworks...
Relationships Among Classical Test Theory and Item Response Theory Frameworks...Relationships Among Classical Test Theory and Item Response Theory Frameworks...
Relationships Among Classical Test Theory and Item Response Theory Frameworks...AnusornKoedsri3
 
Correlation and Regression Study.docx
Correlation and Regression Study.docxCorrelation and Regression Study.docx
Correlation and Regression Study.docxsdfghj21
 
Evaluation19(3) 321 –332© The Author(s) 2013 Reprints .docx
Evaluation19(3) 321 –332© The Author(s) 2013 Reprints .docxEvaluation19(3) 321 –332© The Author(s) 2013 Reprints .docx
Evaluation19(3) 321 –332© The Author(s) 2013 Reprints .docxelbanglis
 
Detail the overall research design in the Ohio Lottery case (See E
Detail the overall research design in the Ohio Lottery case (See EDetail the overall research design in the Ohio Lottery case (See E
Detail the overall research design in the Ohio Lottery case (See ELinaCovington707
 
ANALYTIC GENERALIZATION VALIDATING THEORIES THROUGH RESEARCH BY MANAGEMENT P...
ANALYTIC GENERALIZATION  VALIDATING THEORIES THROUGH RESEARCH BY MANAGEMENT P...ANALYTIC GENERALIZATION  VALIDATING THEORIES THROUGH RESEARCH BY MANAGEMENT P...
ANALYTIC GENERALIZATION VALIDATING THEORIES THROUGH RESEARCH BY MANAGEMENT P...Mary Calkins
 
Bokums et al_2012_psych_v3_n5_406_409
Bokums et al_2012_psych_v3_n5_406_409Bokums et al_2012_psych_v3_n5_406_409
Bokums et al_2012_psych_v3_n5_406_409Cassio Meira Jr.
 
A COGNITIVE ANALYSIS OF SOLUTIONS FOR VERBAL, INFORMAL, AND FORMAL-DEDUCTIVE ...
A COGNITIVE ANALYSIS OF SOLUTIONS FOR VERBAL, INFORMAL, AND FORMAL-DEDUCTIVE ...A COGNITIVE ANALYSIS OF SOLUTIONS FOR VERBAL, INFORMAL, AND FORMAL-DEDUCTIVE ...
A COGNITIVE ANALYSIS OF SOLUTIONS FOR VERBAL, INFORMAL, AND FORMAL-DEDUCTIVE ...Tiffany Daniels
 
College of Doctoral StudiesExpanded Comparison.docx
                College of Doctoral StudiesExpanded Comparison.docx                College of Doctoral StudiesExpanded Comparison.docx
College of Doctoral StudiesExpanded Comparison.docxjoyjonna282
 
GMU Preapplication and Competencies (NEAAHP 2011)
GMU Preapplication and Competencies (NEAAHP 2011)GMU Preapplication and Competencies (NEAAHP 2011)
GMU Preapplication and Competencies (NEAAHP 2011)Emil Chuck
 
Face Construct And Criterion-Related Validity Essay
Face Construct And Criterion-Related Validity EssayFace Construct And Criterion-Related Validity Essay
Face Construct And Criterion-Related Validity EssayDeb Birch
 
Bijker, M. (2010) Making Measures And Inferences Reserve
Bijker, M. (2010)   Making Measures And Inferences ReserveBijker, M. (2010)   Making Measures And Inferences Reserve
Bijker, M. (2010) Making Measures And Inferences ReserveFontys University
 
Bijker, M. (2010) Making Measures And Inferences Reserve
Bijker, M. (2010)   Making Measures And Inferences ReserveBijker, M. (2010)   Making Measures And Inferences Reserve
Bijker, M. (2010) Making Measures And Inferences ReserveFontys University
 

Ähnlich wie Avoiding A Grand Failure (20)

The Importance Of A Family Intervention For Heart Failure...
The Importance Of A Family Intervention For Heart Failure...The Importance Of A Family Intervention For Heart Failure...
The Importance Of A Family Intervention For Heart Failure...
 
Gamification in recruitment and selection
Gamification in recruitment and selectionGamification in recruitment and selection
Gamification in recruitment and selection
 
Educational Research 102: Selecting the Best Study Design for your Research Q...
Educational Research 102: Selecting the Best Study Design for your Research Q...Educational Research 102: Selecting the Best Study Design for your Research Q...
Educational Research 102: Selecting the Best Study Design for your Research Q...
 
Designing for Resilience and Compassionate Action
Designing for Resilience and Compassionate ActionDesigning for Resilience and Compassionate Action
Designing for Resilience and Compassionate Action
 
Group 12_ Qualitative Data Analysis.pptx
Group 12_ Qualitative Data Analysis.pptxGroup 12_ Qualitative Data Analysis.pptx
Group 12_ Qualitative Data Analysis.pptx
 
Notes for question please no plag use references to cite wk 2 .docx
Notes for question please no plag use references to cite wk 2 .docxNotes for question please no plag use references to cite wk 2 .docx
Notes for question please no plag use references to cite wk 2 .docx
 
The Developmental Coordination Disorder Questionnaire
The Developmental Coordination Disorder QuestionnaireThe Developmental Coordination Disorder Questionnaire
The Developmental Coordination Disorder Questionnaire
 
Chapter 3 CONSTRUCT-IRRELEVANT VARIANCE IN AC.docx
Chapter 3    CONSTRUCT-IRRELEVANT VARIANCE IN AC.docxChapter 3    CONSTRUCT-IRRELEVANT VARIANCE IN AC.docx
Chapter 3 CONSTRUCT-IRRELEVANT VARIANCE IN AC.docx
 
Relationships Among Classical Test Theory and Item Response Theory Frameworks...
Relationships Among Classical Test Theory and Item Response Theory Frameworks...Relationships Among Classical Test Theory and Item Response Theory Frameworks...
Relationships Among Classical Test Theory and Item Response Theory Frameworks...
 
Correlation and Regression Study.docx
Correlation and Regression Study.docxCorrelation and Regression Study.docx
Correlation and Regression Study.docx
 
Evaluation19(3) 321 –332© The Author(s) 2013 Reprints .docx
Evaluation19(3) 321 –332© The Author(s) 2013 Reprints .docxEvaluation19(3) 321 –332© The Author(s) 2013 Reprints .docx
Evaluation19(3) 321 –332© The Author(s) 2013 Reprints .docx
 
Detail the overall research design in the Ohio Lottery case (See E
Detail the overall research design in the Ohio Lottery case (See EDetail the overall research design in the Ohio Lottery case (See E
Detail the overall research design in the Ohio Lottery case (See E
 
ANALYTIC GENERALIZATION VALIDATING THEORIES THROUGH RESEARCH BY MANAGEMENT P...
ANALYTIC GENERALIZATION  VALIDATING THEORIES THROUGH RESEARCH BY MANAGEMENT P...ANALYTIC GENERALIZATION  VALIDATING THEORIES THROUGH RESEARCH BY MANAGEMENT P...
ANALYTIC GENERALIZATION VALIDATING THEORIES THROUGH RESEARCH BY MANAGEMENT P...
 
Bokums et al_2012_psych_v3_n5_406_409
Bokums et al_2012_psych_v3_n5_406_409Bokums et al_2012_psych_v3_n5_406_409
Bokums et al_2012_psych_v3_n5_406_409
 
A COGNITIVE ANALYSIS OF SOLUTIONS FOR VERBAL, INFORMAL, AND FORMAL-DEDUCTIVE ...
A COGNITIVE ANALYSIS OF SOLUTIONS FOR VERBAL, INFORMAL, AND FORMAL-DEDUCTIVE ...A COGNITIVE ANALYSIS OF SOLUTIONS FOR VERBAL, INFORMAL, AND FORMAL-DEDUCTIVE ...
A COGNITIVE ANALYSIS OF SOLUTIONS FOR VERBAL, INFORMAL, AND FORMAL-DEDUCTIVE ...
 
College of Doctoral StudiesExpanded Comparison.docx
                College of Doctoral StudiesExpanded Comparison.docx                College of Doctoral StudiesExpanded Comparison.docx
College of Doctoral StudiesExpanded Comparison.docx
 
GMU Preapplication and Competencies (NEAAHP 2011)
GMU Preapplication and Competencies (NEAAHP 2011)GMU Preapplication and Competencies (NEAAHP 2011)
GMU Preapplication and Competencies (NEAAHP 2011)
 
Face Construct And Criterion-Related Validity Essay
Face Construct And Criterion-Related Validity EssayFace Construct And Criterion-Related Validity Essay
Face Construct And Criterion-Related Validity Essay
 
Bijker, M. (2010) Making Measures And Inferences Reserve
Bijker, M. (2010)   Making Measures And Inferences ReserveBijker, M. (2010)   Making Measures And Inferences Reserve
Bijker, M. (2010) Making Measures And Inferences Reserve
 
Bijker, M. (2010) Making Measures And Inferences Reserve
Bijker, M. (2010)   Making Measures And Inferences ReserveBijker, M. (2010)   Making Measures And Inferences Reserve
Bijker, M. (2010) Making Measures And Inferences Reserve
 

Kürzlich hochgeladen

Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 

Kürzlich hochgeladen (20)

Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other Frameworks
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 

Avoiding A Grand Failure

  • 1. David DeGeest 06J:278 Staffing April 13, 2010 SJTs: Avoiding a Grand Failure
  • 2. Why are people so interested? The Big Picture of SJT lit
  • 3. The claims about SJTs “They seem to represent psychometric alchemy” (Landy, 2007). Adverse impact is down, validity up Assessees like them Seem to address relevant KSAOs They assess soft skills and tacit knowledge They provide incremental validity above GMA and personality for predicting college GPA (Oswald et al., 2004) Some SJTs have demonstrated criterion-related validities as high as r=.36 (Pereira and Harve, 1999) They measure tacit knowledge and “non-academic intelligence” (Sternberg et al., 1995)
  • 4. What is an SJT? Situational Judgment Tests (SJTs) or Inventories (SJIs) are psychological tests which offer respondents realistic, hypothetical scenarios and ask for an appropriate response. SJTs are often-identified as a type of low-fidelity situation (Motowildo, 1990). SJTs can be designed to be predictive of job performance, managerial ability, integrity, personality, and apparently other measures or constructs.
  • 5. Example of an SJT item from Becker (2005) 11. You’re retiring from a successful business that you started, and must now decide who will replace you. Two of your children want the position and would probably do a fine job. However, three non-family employees are more qualified. Who would you most likely put in charge? A. The best performing non-family member, because the most qualified person deserves the job. B. The lowest performing non-family member, because this won’t hurt your children’s feelings. C. The highest performing child, because you have the right to do what is best for your kids. D. The child you love the most, as long as he or she is able to do the job.
  • 6. History of SJTs First recorded SJT: George Washington University Social Intelligence Test (1926) Some usage during WWII by military psychologists (1990): Motowildo’s research resurrected interest in SJTs Idea of the “low fidelity simulation” Commonly used now in industry as “customized” tool for organizations, consultants, etc. Takeaway: There is a lot of perceived promise and sunk cost in SJT research.
  • 7. What the heck is an SJT? Construct Validity and the Development of SJTs
  • 8. Item Characteristics McDaniel et al (2005) claims that SJTs have eight differentiating characteristics: Test fidelity Stem length Stem complexity Stem comprehensibility Nested stems Nature of responses Response instructions Degree of item heterogeneity No proscribed standards to develop an SJT
  • 9. Item Characteristics Examples of response options: What is the best answer? What would you most likely do? Rate each response for effectiveness Rate each response on likelihood you would engage in behavior Knowledge v. Behavioral Tendency Dichotomization Issue
  • 10. Item Characteristics and Construct Validity Construct Heterogeneity Most items tend to correlate with GMA, Agreeableness, Conscientiousness, or Emotional Stability (McDaniel, 2005) Ployhart and Erhart (2003) notes that multiple constructs measured with SJTs makes it hard to measure differences across studies Takeaway: SJTs are best described as a method, not a construct (Schmitt & Chan, 2006)
  • 11. Exciting finds from research on SJTs The promise of SJTs
  • 12. Generalizability McDaniel et al. (2007) meta-analytically demonstrated SJTs have incremental validity of .03 to .05 over GMA .06 to .07 over Big Five .01 to .02 over GMA/Big Five composite McDaniel et al. (2001) showed that SJTs are generalizable as predictors of job performance 90% CV did not contain zero in the meta Potosky et al. (2004) showed that a .84 score-equivalence correlation between an SJT administered via paper-and-pencil and the Internet No effects based on beliefs in computer efficacy Takeaway: multiple metas have demonstrated the generalizability of SJTs in predicting job performance.
  • 13. Variability in SJTs Lievens and Sackett (2006) showed that video-based SJTs for interpersonal skills have more validity than written SJTs. McDaniel et al. (2007) showed that reliabilities for SJTs can range from .63 to .88 The meta refers to alpha, but other reliability measures matter Takeaway: Effects of variations in level of fidelity offer interesting possibilities for research.
  • 14. Assessment Reactions and Face Validity Chan & Schmitt (1997) showed that B-W differences in test performance and face validity reactions were lower for video-based SJTs than pencil-and-paper tests Race X Method interaction attributable to reading comprehension differences in subgroups Increasing fidelity increased mean performance on SJT Chan (1997) showed that paper-and-pencil SJTs are more consistent with beliefs, values, and expectations of whites. Moving to video-based SJT increased validity perceptions for both whites and blacks Bauer and Truxillo (2006) asserts that SJTs always have better face validity than do cognitive and personality measures. Takeaways: SJTs are useful in terms of face validity and justice perceptions, particularly high-fidelity (video) simulations.
  • 15. The trouble with… Problems with SJTs
  • 16.
  • 17. Peeters and Lievens (2005) found that faking-good instructions produced differences in means and criterion-related validities across subgroups.
  • 18. “Specifically, the fakability of SJTs might depend on their correlation with one particular construct, namely, cognitive ability (see Nguyen &McDaniel, 2001).” (p.73)
  • 19. Schmidt and Hunter (2003) found low discriminant validity between SJTs and job knowledge tests.
  • 21. “If subgroup differences on a test exist, policies that permit retests by candidates who were unsuccessful on the test might inflate calculations of adverse impact.” (Lievens et al., 2005, p. 1005)Takeaway: If the degree of fakibility of an SJT depends on its GMA load, SJTs might just be contaminated g tests or low-reliability job knowledge tests.
  • 22. Faking Nguyen et al. (2005) found that d=.34 for honest instructions and d=.15 for faking Ployhart and Erhart (2003) also note that behavioral response instructions are both more prone to faking and have problematic more reliability issues Hooper et al. (2006) notes that the fragmentation of the literature has made a meta-analytic study of this issue impossible.
  • 23. Response Instructions Ployhart and Erhart (2003) found that response instructions had dramatic effects on validity, reliability, and performance for SJTs Showed that dimensionality of an SJT is crucial to determining the reliability estimate to use. McDaniel et al. (2007) found meaningful differences between means for tests with different behavioral and knowledge instructions. Lievens and Sackett (2009) found no meaningful differences between means in a high-stakes testing environment with med school applicants. Last two studies found that knowledge instructions for an SJT increased the scores’ correlation with a GMA measure Takeaway: meta-analytic integration of these results is needed, but the primary research has yet to support this.
  • 24. What is the reliability for an SJT? Bess (2001) points out the elephant in the room: “SJTs by definition are multidimensional and therefore internal consistency is not an appropriate measure of reliability” (p.29) Schmitt and Chan (1997) also notes this problem. Examples of reliability estimates: Ployhart and Erhart (2003) Used split-half estimates to get .67 and .68 reliabilities. Lievens and Sackett (2009) found low alphas for their SJT (.55-.56) Lievens and Sackett (2007) noted generating alternate forms is difficult for SJTs, given the contextual specificity of items. This means parallel forms reliability is a non-practical measure. Takeaway: no one is quite sure how to systematically assess reliabilities for SJT measures
  • 25. What do we know about SJTs? Conclusions
  • 26. Things we know fairly clearly SJTs are primarily a method, not a construct. SJTs have demonstrated generalizable meta-analytic incremental validity over GMA and Big 5 single and composite measures in predicting job performance Most SJTs are correlated with GMA to a varying extent and share some benefits and disadvantages with GMA SJTs often correlate with the Big 3
  • 27. McDaniel et al. (2006) integrated model for SJT
  • 28. Where do researchers go from here? Practitioners? Future Directions for SJTs
  • 29. Ployhart & Weekly (2006) Agenda for Research Construct Validity Correlates are known, but nomological net uncertain SJTs targeted to constructs: the “holy grail” (p.348) What exactly is “judgment?” Understanding SJT structure How do we build SJTs to get construct homogeneity? How do we enhance the reliability of these measures? More Experimentation/Micro Research Correlation studies and metas show generalizability Experimental studies can enhance understanding
  • 30. Ployhart & Weekly (2006) Agenda for Research Need for Theoretical Development Will help to integrate SJTs in mainline I/O research Theory of situation perception and judgment research The limits of SJTs Little is known about applicant conditions for SJTs Generalizability in international contexts? Expansion of org context for SJTs Possibility of use in training and development contexts (Fritzche et al., 2006) Use in team contexts (Mumford et al., 2006)
  • 31. Other possibilities Personality and Self-Reports Ployhart and Ryan (2004) proposed integrating personality measures with an SJT to predict customer service orientation Hogan (2005)’s work suggests that it may be possible to build a conscientiousness measure via an SJT that is more resistant to faking than current self-reports. Bledow et Freese (2003)’s work also supports using SJTs to create non-self-report measures of constructs like personal drive or initiative. Teams Mumford et. al (2006)’s work suggests building an SJT that would ably predict a “team player” mentality.
  • 33. Bauxer T.N.; and Truxillo, D.M. (2006). Applicant reactions to situational judgment tests: research and related practical issues. In Situational Judgement Tests: Theory, Measurement, and Application. Becker, Thomas E. (2005). Development and Validation of a Situational Judgment Test of Employee Integrity. International Journal of Selection and Assessment, 13(3), 225-232. Bess, T.L. (2001). Exploring the dimensionality of situational judgment: task and contextual knowledge. Unpublished Master’s Thesis. Accessesed 4/08/10 at: http://scholar.lib.vt.edu/theses/available/etd-04122001-183219/unrestricted/sjtdimensionality.pdf Bledow, Ronald and Freese, Michael. A situational judgment test of personal initiative and its relation to performance. (Summer 2009) Personnel Psychology. 229-258 Chan, David. Racial subgroup differences in predictive validity perceptions on personality and cognitive ability tests. Journal of Applied Psychology. Vol 82(2), Apr 1997, 311-320. Fritzsche, B.A.; Stagl, K.C.; Salas, E.; Burke, C.S. (2006) Enhancing the design, delivery, and evaluation of scenario-based training: can situational judgment tests contribute. In Situational Judgement Tests: Theory, Measurement, and Application. Hogan, Robert (2005). In defense of personality measurement: new wine for old whiners. Human Performance 18(4), 31-41. Hooper, A., Cullen, M., and Sackett, P. (2006). Operational Threats to the Use of SJTs: Faking, Coaching, and Retesting Issues. In Situational Judgement Tests: Theory, Measurement, and Application. Landy, F. J. (2007). The validation of personnel decisions in the twenty first century: Back to the future. In S. M. McPhail (Ed.), Alternate validation strategies: Developing and leveraging existing validity evidence (pp. 409–426). San Francisco: Jossey-Bass. Lievens, Filip and Sackett, Paul R. (2006). Video-Based Versus Written Situational Judgment Tests: A Comparison in Terms of Predictive Validity. Journal of Applied Psychology, 91, 1181-1188. Lievens, F.; Sackett, P.; and Buyse, Tine. (2009). The effects of response instructions on situational judgment test performance and validity in a high-stakes context. Journal of Applied Psychology 94(4), 1095-1101. McDaniel, Michael and Nguyen, Nhung. Sitautional Judgment Tests: a review of practice and constructs assessed. (March/June 2001). International Journal of Selection and Assessment, 9(1/2), 103-113. McDaniel, Michael, Finnegan, Elizabeth, Morgeson, Frederick, Campion, Michael, Braverman, Eric. (2001). Use of Situational Judgment Tests to Predict Job Performance: A clarification of the literature. Journal of Applied Psychology, 86(4), 730-740.
  • 34. McDaniel, Whetzel, Hartman, Nguyen, and Grubb (2006). Situational Judgment Tests: Validity and an Integrative Model. Situational Judgment Tests: Theory, Measurement, and Application. McDaniel, Michael, Hartman, Nathan S., Whetzel, Deborah, Grubb, W. Lee III. (2007) Situational judgment tests, response instructions, and validity: a meta-analysis. Personnel Psychology,60, 63-91. Motowidlo, SJ, Dunnette, MD, Carter, GW (1990), "An alternative selection procedure: thelow-fidelity simulation", Journal of Applied Psychology, Vol. 75 pp.640-7 Mumford, T.V.; Campion, M.; Morgeson, F.P. (2006). Sitautional judgment tests in work teams: a team role typology. In Situational Judgement Tests: Theory, Measurement, and Application. Nguyen, N. T., Biderman, M. D., & McDaniel, M. A. (2005). Effects of response instructions on faking a situational judgment test. International Journal of Selection and Assessment, 13, 250–260. Frederick L. Oswald, Neal Schmitt, Brian H. Kim, Lauren J. Ramsay, and Michael A. Gillespie (2004). Developing a Biodata Measure and Situational Judgment Inventory as Predictors of College Student Performance. Journal of Applied Psychology 89: 187-207. Helga Peeters and Filip Lievens (2005). Situational Judgment Tests and their Predictiveness of College Students’ Success: The Influence of Faking. Educational and Psychological Measurement, 65: 70-89. Ployhart, R.E. and Ryan, A.M. (2004, April) Integrating personality tests with situational judgment tests for the prediction of customer service performance. Paper presented at the 15th annual conference of the Society for Industrial and Organizational Psychology, New Orleans, LA. Ployhart, Robert E. and Weekley, Jeff. (2003) Web-based and paper-and-pencil testing of applicants in a proctored setting: are personality tests, biodata, and situational judgment tests comparable? Personnel Psychology, 56, 733-752. Potosky, Denise and Bobko, Philip (2004). Selection testing via the Internet: practical considerations and exploratory empirical findings. Personnel Psychology,57, 1003-1034. Schmidt, Frank and Hunter, John E. (2003). Tacit Knowledge, Practical Intelligence, General Mental Ability, and Job Knowledge. Current Directions in Psychological Science, 2, 8-9.