SlideShare ist ein Scribd-Unternehmen logo
1 von 64
Psy 427
Cal State Northridge
Andrew Ainsworth, PhD
Contents
 Item Analysis in General
 Classical Test Theory
 Item Response Theory Basics
Item Response Functions
Item Information Functions
Invariance
 IRT Assumptions
 Parameter Estimation in IRT
 Scoring
 Applications
What is item analysis in general?
 Item analysis provides a way of measuring
the quality of questions - seeing how
appropriate they were for the respondents
and how well they measured their ability/trait.
 It also provides a way of re-using items over
and over again in different tests with prior
knowledge of how they are going to perform;
creating a population of questions with known
properties (e.g. test bank)
Classical Test Theory
 Classical Test Theory (CTT) - analyses are
the easiest and most widely used form of
analyses. The statistics can be computed
by readily available statistical packages (or
even by hand)
 Classical Analyses are performed on the
test as a whole rather than on the item and
although item statistics can be generated,
they apply only to that group of students on
that collection of items
Classical Test Theory
 CTT is based on the true score model
 In CTT we assume that the error :
Is normally distributed
Uncorrelated with true score
Has a mean of Zero
X T E= +
Classical Test Theory
Statistics
 Difficulty (item level statistic)
 Discrimination (item level statistic)
 Reliability (test level statistic)
Classical Test Theory vs.
Latent Trait Models
 Classical analysis has the test (not the
item) as its basis. Although the statistics
generated are often generalised to similar
students taking a similar test; they only
really apply to those students taking that
test
 Latent trait models aim to look beyond that
at the underlying traits which are producing
the test performance. They are measured
at item level and provide sample-free
measurement
Latent Trait Models
 Latent trait models have been around since the
1940s, but were not widely used until the 1960s.
Although theoretically possible, it is practically
unfeasible to use these without specialized
software.
 They aim to measure the underlying ability (or
trait) which is producing the test performance
rather than measuring performance per se.
 This leads to them being sample-free. As the
statistics are not dependant on the test situation
which generated them, they can be used more
flexibly
Item Response Theory
 Item Response Theory (IRT) – refers to a
family of latent trait models used to establish
psychometric properties of items and scales
 Sometimes referred to as modern
psychometrics because in large-scale
education assessment, testing programs and
professional testing firms IRT has almost
completely replaced CTT as method of
choice
 IRT has many advantages over CTT that
have brought IRT into more frequent use
Three Basics Components of IRT
 Item Response Function (IRF) – Mathematical
function that relates the latent trait to the
probability of endorsing an item
 Item Information Function – an indication of
item quality; an item’s ability to differentiate
among respondents
 Invariance – position on the latent trait can be
estimated by any items with know IRFs and
item characteristics are population
independent within a linear transformation
IRT - Item Response Function
 Item Response Function (IRF) - characterizes
the relation between a latent variable (i.e.,
individual differences on a construct) and the
probability of endorsing an item.
 The IRF models the relationship between
examinee trait level, item properties and the
probability of endorsing the item.
 Examinee trait level is signified by the greek
letter theta (θ) and typically has mean = 0 and
a standard deviation = 1
IRT - Item Characteristic Curves
 IRFs can then be converted into Item
Characteristic Curves (ICC) which are
graphical functions that represents the
respondents ability as a function of the
probability of endorsing the item
IRF – Item Parameters
Location (b)
 An item’s location is defined as the amount
of the latent trait needed to have a .5
probability of endorsing the item.
 The higher the “b” parameter the higher on
the trait level a respondent needs to be in
order to endorse the item
 Analogous to difficulty in CTT
 Like Z scores, the values of b typically range
from -3 to +3
IRF – Item Parameters
Discrimination (a)
 Indicates the steepness of the IRF at the
items location
 An items discrimination indicates how
strongly related the item is to the latent trait
like loadings in a factor analysis
 Items with high discriminations are better at
differentiating respondents around the
location point; small changes in the latent trait
lead to large changes in probability
 Vice versa for items with low discriminations
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
-3 -2 -1 0 1 2 3
Latent Trait
ProbabilityofEndorsing.
Location
Discrimination
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
-3 -2 -1 0 1 2 3
Latent Trait
ProbabilityofEndorsing.
Same
Discrimination
Different
Locations
1 2
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
-3 -2 -1 0 1 2 3
Latent Trait
ProbabilityofEndorsing.
Different
Discriminations
Different
Locations
1
2
IRF – Item Parameters
Guessing (c)
 The inclusion of a “c” parameter suggests
that respondents very low on the trait may
still choose the correct answer.
 In other words respondents with low trait
levels may still have a small probability of
endorsing an item
 This is mostly used with multiple choice
testing…and the value should not vary
excessively from the reciprocal of the number
of choices.
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
-3 -2 -1 0 1 2 3
Latent Trait
ProbabilityofEndorsing.
Different
Locations
Different
Discriminations
Lower
Asymptote
1
2
IRF – Item Parameters
Upper asymptote (d)
 The inclusion of a “d” parameter suggests
that respondents very high on the latent trait
are not guaranteed (i.e. have less than 1
probability) to endorse the item
 Often an item that is difficult to endorse (e.g.
suicide ideation as an indicator of
depression)
-3 -2 -1 0 1 2 3
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
Trait Level
Probability
2plm
3plm
4plm
IRT - Item Response Function
 The 4-parameter logistic model
 Where
θ represents examinee trait level
b is the item difficulty that determines the location of
the IRF
a is the item’s discrimination that determines the
steepness of the IRF
c is a lower asymptote parameter for the IRF
d is an upper asymptote parameter for the IRF
( )
( )
e
( 1 , , , , ) ( )
1 e
a b
a b
P X a b c d c d c
θ
θ
θ
−
−
= = + −
+
IRT - Item Response Function
 The 3-parameter logistic model
 If the upper asymptote parameter is set to
1.0, then the model is termed a 3PL.
 In this model, individuals at low trait levels
have a non-zero probability of endorsing the
item.
( )
( )
e
( 1 , , , ) (1 )
1 e
a b
a b
P X a b c c c
θ
θ
θ
−
−
= = + −
+
IRT - Item Response Function
 The 2-parameter logistic model
 If in addition the lower asymptote
parameter is constrained to zero, then the
model is termed a 2PL.
 In the 2PLM, IRFs vary both in their
discrimination and difficulty (i.e., location)
parameters.
( )
( )
e
( 1 , , )
1 e
a b
a b
P X a b
θ
θ
θ
−
−
= =
+
IRT - Item Response Function
 The 1-parameter logistic model
 If the item discrimination is set to 1.0 (or
any constant) the result is a 1PL
 A 1PL assumes that all scale items relate
to the latent trait equally and items vary
only in difficulty (equivalent to having
equal factor loadings across items).
( )
( )
e
( 1 , )
1 e
b
b
P X b
θ
θ
θ
−
−
= =
+
Quick Detour: Rasch Models vs.
Item Response Theory Models
 Mathematically, Rasch models are
identical to the most basic IRT model
(1PL), however there are some
(important) differences
 In Rasch the model is superior. Data
which does not fit the model is discarded
 Rasch does not permit abilities to be
estimated for extreme items and persons
 And other differences
IRT - Test Response Curve
 Test Response Curves (TRC) - Item
response functions are additive so
that items can be combined to create
a TRC.
 A TRC is the latent trait relative to the
number of items
-4 -3 -2 -1 0 1 2 3 4
0
2
4
6
8
10
12
14
16
18
20
Trait
NumberofItems
IRT - Test Response Curve
IRT – Item Information Function
 Item Information Function (IIF) – Item
reliability is replaced by item information
in IRT.
 Each IRF can be transformed into an
item information function (IIF); the
precision an item provides at all levels of
the latent trait.
 The information is an index representing
the item’s ability to differentiate among
individuals.
IRT – Item Information Function
 The standard error of measurement (which
is the variance of the latent trait level) is
the reciprocal of information, and thus,
more information means less error.
 Measurement error is expressed on the
same metric as the latent trait level, so it
can be used to build confidence intervals.
IRT – Item Information Function
 Difficulty parameter - the location of the
highest information point
 Discrimination - height of the information.
 Large discriminations - tall and narrow
IIFs; high precision/narrow range
 Low discrimination - short and wide IIFs;
low precision/broad range.
-3 -2 -1 0 1 2 3
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
Trait Level
Probability
2plm
3plm
4plm
-3 -2 -1 0 1 2 3
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
Trait Level
Information
2plm
3plm
4plm
IRT – Test Information Function
 Test Information Function (TIF) – The
IIFs are also additive so that we can
judge the test as a whole and see at
which part of the trait range it is working
the best.
-3 -2 -1 0 1 2 3
0
0.2
0.4
0.6
0.8
1
1.2
1.4
Trait Level
Information
0
5
10
15
20
25
30
-3 -2 -1 0 1 2 3
Latent Trait
Information
0
2.4
1.2
StandardError
Item Response Theory
Example
 The same 24 items from the MMPI-2 that
assess Social Discomfort
 Dichotomous Items; 1 represents an
endorsement of the item in the direction of
discomfort
 Assess a 2pl IRT model of the data to look at
the difficulty, discrimination and information
for each item
IRT - Invariance
 Invariance - IRT model parameters have an
invariance property
Examinee trait level estimates do not depend on
which items are administered, and in turn, item
parameters do not depend on a particular sample
of examinees (within a linear transformation).
 Invariance allows researchers to: 1) efficiently
“link” different scales that measure the same
construct, 2) compare examinees even if they
responded to different items, and 3)
implement computerized adaptive testing.
IRT - Assumptions
 Monotonicity - logistic IRT models
assume a monotonically increasing
functions (as trait level increases, so
does the probability of endorsing an
item).
 If this is violated, then it makes no
sense to apply logistic models to
characterize item response data.
IRT - Assumptions
 Unidimensionality – In the IRT models
described above, individual differences
are characterized by a single parameter,
theta.
Multidimensional IRT models exist but are not
as commonly applied
Commonly applied IRT models assume that a
single common factor (i.e., the latent trait)
accounts for the item covariance.
Often assessed using specialized Factor
Analytic models for dichotomous items
IRT - Assumptions
 Local independence - The Local
independence (LI) assumption requires
that item responses are uncorrelated
after controlling for the latent trait.
When LI is violated, this is called local
dependence (LD).
LI and unidimensionality are related
Highly univocal scales can still have
violations of local independence (e.g. item
content, etc.).
IRT - Assumptions
 Local dependence:
1. distorts item parameter estimates (i.e., can
cause item slopes to be larger than they
should be),
2. causes scales to look more precise than they
really are, and
3. when LD exists, a large correlation between
two or more items can essentially define or
dominate the latent trait, thus causing the
scale to lack construct validity.
IRT - Assumptions
 Once LD is identified, the next step is to
address it:
Form testlets (Wainer & Kiely, 1987) by
combining locally dependent items
Delete one or more of the LD items from the scale so
local independence is achieved.
IRT - Assumptions
 Qualitatively homogeneous population - IRT
models assume that the same IRF applies to
all members of the population
Differential item functioning (DIF) is a violation of
this and means that there is a violation of the
invariance property
DIF occurs when an item has a different IRF for two
or more groups; therefore examinees that are equal
on the latent trait have different probabilities
(expected scores) of endorsing the item.
No single IRF can be applied to the population
Applications
 Ordered Polytomous Items
IRT models exist for data that are not
dichotomously scored
With dichotomous items there is a single difficulty
(location) that indicates the threshold at which the
probability switches from favoring one choice to
favoring the other
With polytomous items a separate difficulty exists
as thresholds between each sets of ordered
categories
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
-3 -2 -1 0 1 2 3
Latent Trait
ProbabilityofEndorsing.
Category
thresholds
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
-3 -2 -1 0 1 2 3
Latent Trait
ProbabilityofEndorsing.
Category
thresholds
Item 1
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
-3 -2 -1 0 1 2 3
Latent Trait
ProbabilityofEndorsing.
Item 2
Category
Thresholds
Applications
 Differential Item Functioning
How can age groups, genders, cultures, ethnic groups,
and socioecomonic backgrounds be meaningfully
compared?
Can be a research goal as opposed to just a test of an
assumption
Test equivalency of test items translated into multiple
languages
Test items influenced by cultural differences
Test for intelligence items that gender biased
Test for age differences in response to personality
items
“Don’t care about life”
Applications
 Scaling individuals for further analysis
We often collect data in multifaceted forms (e.g.
multi-items surveys) and then collapse them into
a single raw score
IRT based scores represent an optimal scaling of
individuals on the trait
Most sophisticated analyses require at-least
interval level measurement and IRT scores are
closer to interval level than raw scores
Using scaled scores as opposed to raw scores
has been shown to reduce spurious results
Applications
 Scale Construction and Modification
The focus is changing from creating fixed
length, paper/pencil tests to creating a
“universe” of items with known IRF’s that
can be used interchangeably
Scales are being designed based around
IRT properties
Pre-existing scales that were developed
using CTT are being “revamped” using IRT
Applications
 Computer Adaptive Testing (CAT)
As an extension of the previous slide, once a
“universe” (i.e. test bank) of items with known
IRFs is created they can be used to measure
traits in a computer adaptive form
An item is given to the participant (usually easy
to moderate difficulty) and their answer allows
their trait score to be estimated, so that the next
item is chosen to target that trait level
After the second item is answered their trait
score is re-estimated, etc.
Applications
 Computer Adaptive Testing (CAT)
CA tests are at least twice as efficient as their
paper and pencil counterparts with no loss of
precision
Primary testing approach used by ETS
Adaptive form of the Headache Impact Survey
outperformed the P and P counterpart in
reducing patient burden, tracking change and in
reliability and validity (Ware et al., 2003)
Applications
 Test Equating
Participants that have taken different tests
measuring the same construct (e.g. Beck
depression vs. CESD), but both have items
with known IRFS, can be placed on the
same scale and compared or scored
equivalently
Equating across grades on math ability
Equating across years for placement or
admissions tests

Weitere ähnliche Inhalte

Was ist angesagt?

Item Response Theory in Constructing Measures
Item Response Theory in Constructing MeasuresItem Response Theory in Constructing Measures
Item Response Theory in Constructing MeasuresCarlo Magno
 
Norms and the Meaning of Test Scores
Norms and the Meaning of Test ScoresNorms and the Meaning of Test Scores
Norms and the Meaning of Test ScoresMushfikFRahman
 
Chapter 1 history of testing
Chapter 1 history of testingChapter 1 history of testing
Chapter 1 history of testingRoi Xcel
 
Stanford-Binet Intelligence Scale
Stanford-Binet Intelligence ScaleStanford-Binet Intelligence Scale
Stanford-Binet Intelligence ScaleMauliRastogi
 
Presentation validity
Presentation validityPresentation validity
Presentation validityAshMusavi
 
Theories of intelligence
Theories of intelligenceTheories of intelligence
Theories of intelligenceSaloniSawhney1
 
VALIDITY
VALIDITYVALIDITY
VALIDITYANCYBS
 
IRT - Item response Theory
IRT - Item response TheoryIRT - Item response Theory
IRT - Item response TheoryAjay Dhamija
 
Stanford binet intelligence scale- fifth edition
Stanford binet intelligence scale- fifth editionStanford binet intelligence scale- fifth edition
Stanford binet intelligence scale- fifth editionMuhammad Musawar Ali
 
M.Ed Guidance & Counselling II Topic-administration ofPsychological testing- ...
M.Ed Guidance & Counselling II Topic-administration ofPsychological testing- ...M.Ed Guidance & Counselling II Topic-administration ofPsychological testing- ...
M.Ed Guidance & Counselling II Topic-administration ofPsychological testing- ...fatima roshan
 
Individual vs group test
Individual vs group testIndividual vs group test
Individual vs group testAmina Tariq
 
Educational testing and assessment
Educational testing and assessmentEducational testing and assessment
Educational testing and assessmentAbdul Majid
 
TRIARCHIC THEORY OF INTELLIGENCE
TRIARCHIC THEORY OF INTELLIGENCETRIARCHIC THEORY OF INTELLIGENCE
TRIARCHIC THEORY OF INTELLIGENCEYang Comia
 
psychological assessment standardization, evaluation etc
psychological assessment standardization, evaluation etc psychological assessment standardization, evaluation etc
psychological assessment standardization, evaluation etc DrGireesha123
 

Was ist angesagt? (20)

Item writing
Item writingItem writing
Item writing
 
Neo personality inventory
Neo personality inventoryNeo personality inventory
Neo personality inventory
 
Item Response Theory in Constructing Measures
Item Response Theory in Constructing MeasuresItem Response Theory in Constructing Measures
Item Response Theory in Constructing Measures
 
Norms and the Meaning of Test Scores
Norms and the Meaning of Test ScoresNorms and the Meaning of Test Scores
Norms and the Meaning of Test Scores
 
Reliability
ReliabilityReliability
Reliability
 
Chapter 1 history of testing
Chapter 1 history of testingChapter 1 history of testing
Chapter 1 history of testing
 
Stanford-Binet Intelligence Scale
Stanford-Binet Intelligence ScaleStanford-Binet Intelligence Scale
Stanford-Binet Intelligence Scale
 
Norms[1]
Norms[1]Norms[1]
Norms[1]
 
Presentation validity
Presentation validityPresentation validity
Presentation validity
 
Theories of intelligence
Theories of intelligenceTheories of intelligence
Theories of intelligence
 
VALIDITY
VALIDITYVALIDITY
VALIDITY
 
IRT - Item response Theory
IRT - Item response TheoryIRT - Item response Theory
IRT - Item response Theory
 
Stanford binet intelligence scale- fifth edition
Stanford binet intelligence scale- fifth editionStanford binet intelligence scale- fifth edition
Stanford binet intelligence scale- fifth edition
 
M.Ed Guidance & Counselling II Topic-administration ofPsychological testing- ...
M.Ed Guidance & Counselling II Topic-administration ofPsychological testing- ...M.Ed Guidance & Counselling II Topic-administration ofPsychological testing- ...
M.Ed Guidance & Counselling II Topic-administration ofPsychological testing- ...
 
Individual vs group test
Individual vs group testIndividual vs group test
Individual vs group test
 
Educational testing and assessment
Educational testing and assessmentEducational testing and assessment
Educational testing and assessment
 
Irt assessment
Irt assessmentIrt assessment
Irt assessment
 
Objective Tests
Objective TestsObjective Tests
Objective Tests
 
TRIARCHIC THEORY OF INTELLIGENCE
TRIARCHIC THEORY OF INTELLIGENCETRIARCHIC THEORY OF INTELLIGENCE
TRIARCHIC THEORY OF INTELLIGENCE
 
psychological assessment standardization, evaluation etc
psychological assessment standardization, evaluation etc psychological assessment standardization, evaluation etc
psychological assessment standardization, evaluation etc
 

Ähnlich wie Introduction to Item Response Theory

Item Analysis: Classical and Beyond
Item Analysis: Classical and BeyondItem Analysis: Classical and Beyond
Item Analysis: Classical and BeyondMhairi Mcalpine
 
A visual guide to item response theory
A visual guide to item response theoryA visual guide to item response theory
A visual guide to item response theoryahmad rustam
 
Introduction to Item Response Theory
Introduction to Item Response TheoryIntroduction to Item Response Theory
Introduction to Item Response TheoryNathan Thompson
 
Introduction to unidimensional item response model
Introduction to unidimensional item response modelIntroduction to unidimensional item response model
Introduction to unidimensional item response modelSumit Das
 
Using Item Response Theory to Improve Assessment
Using Item Response Theory to Improve AssessmentUsing Item Response Theory to Improve Assessment
Using Item Response Theory to Improve AssessmentNathan Thompson
 
Artificial intyelligence and machine learning introduction.pptx
Artificial intyelligence and machine learning introduction.pptxArtificial intyelligence and machine learning introduction.pptx
Artificial intyelligence and machine learning introduction.pptxChandrakalaV15
 
Top 100+ Google Data Science Interview Questions.pdf
Top 100+ Google Data Science Interview Questions.pdfTop 100+ Google Data Science Interview Questions.pdf
Top 100+ Google Data Science Interview Questions.pdfDatacademy.ai
 
On Feature Selection Algorithms and Feature Selection Stability Measures : A ...
On Feature Selection Algorithms and Feature Selection Stability Measures : A ...On Feature Selection Algorithms and Feature Selection Stability Measures : A ...
On Feature Selection Algorithms and Feature Selection Stability Measures : A ...AIRCC Publishing Corporation
 
ON FEATURE SELECTION ALGORITHMS AND FEATURE SELECTION STABILITY MEASURES: A C...
ON FEATURE SELECTION ALGORITHMS AND FEATURE SELECTION STABILITY MEASURES: A C...ON FEATURE SELECTION ALGORITHMS AND FEATURE SELECTION STABILITY MEASURES: A C...
ON FEATURE SELECTION ALGORITHMS AND FEATURE SELECTION STABILITY MEASURES: A C...ijcsit
 
On Feature Selection Algorithms and Feature Selection Stability Measures : A...
 On Feature Selection Algorithms and Feature Selection Stability Measures : A... On Feature Selection Algorithms and Feature Selection Stability Measures : A...
On Feature Selection Algorithms and Feature Selection Stability Measures : A...AIRCC Publishing Corporation
 
Data What Type Of Data Do You Have V2.1
Data   What Type Of Data Do You Have V2.1Data   What Type Of Data Do You Have V2.1
Data What Type Of Data Do You Have V2.1TimKasse
 
Machine Learning.pdf
Machine Learning.pdfMachine Learning.pdf
Machine Learning.pdfBeyaNasr1
 
Implementing Item Response Theory
Implementing Item Response TheoryImplementing Item Response Theory
Implementing Item Response TheoryNathan Thompson
 
FEATURE SELECTION ON BOOLEAN SYMBOLIC OBJECTS
FEATURE SELECTION ON BOOLEAN SYMBOLIC OBJECTSFEATURE SELECTION ON BOOLEAN SYMBOLIC OBJECTS
FEATURE SELECTION ON BOOLEAN SYMBOLIC OBJECTSijcsity
 
Machine learning Mind Map
Machine learning Mind MapMachine learning Mind Map
Machine learning Mind MapAshish Patel
 
A Fuzzy Logic Intelligent Agent for Information Extraction
A Fuzzy Logic Intelligent Agent for Information ExtractionA Fuzzy Logic Intelligent Agent for Information Extraction
A Fuzzy Logic Intelligent Agent for Information ExtractionTarekMourad8
 
Feature selection on boolean symbolic objects
Feature selection on boolean symbolic objectsFeature selection on boolean symbolic objects
Feature selection on boolean symbolic objectsijcsity
 
Anomaly Detection for Real-World Systems
Anomaly Detection for Real-World SystemsAnomaly Detection for Real-World Systems
Anomaly Detection for Real-World SystemsManojit Nandi
 

Ähnlich wie Introduction to Item Response Theory (20)

Item Analysis: Classical and Beyond
Item Analysis: Classical and BeyondItem Analysis: Classical and Beyond
Item Analysis: Classical and Beyond
 
A visual guide to item response theory
A visual guide to item response theoryA visual guide to item response theory
A visual guide to item response theory
 
Introduction to Item Response Theory
Introduction to Item Response TheoryIntroduction to Item Response Theory
Introduction to Item Response Theory
 
Introduction to unidimensional item response model
Introduction to unidimensional item response modelIntroduction to unidimensional item response model
Introduction to unidimensional item response model
 
Using Item Response Theory to Improve Assessment
Using Item Response Theory to Improve AssessmentUsing Item Response Theory to Improve Assessment
Using Item Response Theory to Improve Assessment
 
IRT in Test Construction
IRT in Test Construction IRT in Test Construction
IRT in Test Construction
 
Slide Psikologi.docx
Slide Psikologi.docxSlide Psikologi.docx
Slide Psikologi.docx
 
Artificial intyelligence and machine learning introduction.pptx
Artificial intyelligence and machine learning introduction.pptxArtificial intyelligence and machine learning introduction.pptx
Artificial intyelligence and machine learning introduction.pptx
 
Top 100+ Google Data Science Interview Questions.pdf
Top 100+ Google Data Science Interview Questions.pdfTop 100+ Google Data Science Interview Questions.pdf
Top 100+ Google Data Science Interview Questions.pdf
 
On Feature Selection Algorithms and Feature Selection Stability Measures : A ...
On Feature Selection Algorithms and Feature Selection Stability Measures : A ...On Feature Selection Algorithms and Feature Selection Stability Measures : A ...
On Feature Selection Algorithms and Feature Selection Stability Measures : A ...
 
ON FEATURE SELECTION ALGORITHMS AND FEATURE SELECTION STABILITY MEASURES: A C...
ON FEATURE SELECTION ALGORITHMS AND FEATURE SELECTION STABILITY MEASURES: A C...ON FEATURE SELECTION ALGORITHMS AND FEATURE SELECTION STABILITY MEASURES: A C...
ON FEATURE SELECTION ALGORITHMS AND FEATURE SELECTION STABILITY MEASURES: A C...
 
On Feature Selection Algorithms and Feature Selection Stability Measures : A...
 On Feature Selection Algorithms and Feature Selection Stability Measures : A... On Feature Selection Algorithms and Feature Selection Stability Measures : A...
On Feature Selection Algorithms and Feature Selection Stability Measures : A...
 
Data What Type Of Data Do You Have V2.1
Data   What Type Of Data Do You Have V2.1Data   What Type Of Data Do You Have V2.1
Data What Type Of Data Do You Have V2.1
 
Machine Learning.pdf
Machine Learning.pdfMachine Learning.pdf
Machine Learning.pdf
 
Implementing Item Response Theory
Implementing Item Response TheoryImplementing Item Response Theory
Implementing Item Response Theory
 
FEATURE SELECTION ON BOOLEAN SYMBOLIC OBJECTS
FEATURE SELECTION ON BOOLEAN SYMBOLIC OBJECTSFEATURE SELECTION ON BOOLEAN SYMBOLIC OBJECTS
FEATURE SELECTION ON BOOLEAN SYMBOLIC OBJECTS
 
Machine learning Mind Map
Machine learning Mind MapMachine learning Mind Map
Machine learning Mind Map
 
A Fuzzy Logic Intelligent Agent for Information Extraction
A Fuzzy Logic Intelligent Agent for Information ExtractionA Fuzzy Logic Intelligent Agent for Information Extraction
A Fuzzy Logic Intelligent Agent for Information Extraction
 
Feature selection on boolean symbolic objects
Feature selection on boolean symbolic objectsFeature selection on boolean symbolic objects
Feature selection on boolean symbolic objects
 
Anomaly Detection for Real-World Systems
Anomaly Detection for Real-World SystemsAnomaly Detection for Real-World Systems
Anomaly Detection for Real-World Systems
 

Mehr von OpenThink Labs

Program Outline in Regenerative Entrepreneurship
Program Outline in Regenerative EntrepreneurshipProgram Outline in Regenerative Entrepreneurship
Program Outline in Regenerative EntrepreneurshipOpenThink Labs
 
Low Carbon Development: A Paradigm Shift Towards a Green Economy in Indonesia
Low Carbon Development: A Paradigm Shift Towards a Green Economy in IndonesiaLow Carbon Development: A Paradigm Shift Towards a Green Economy in Indonesia
Low Carbon Development: A Paradigm Shift Towards a Green Economy in IndonesiaOpenThink Labs
 
Pedoman Sertifikasi Fitosanitari Buah Alpukat Indonesia
Pedoman Sertifikasi Fitosanitari Buah Alpukat IndonesiaPedoman Sertifikasi Fitosanitari Buah Alpukat Indonesia
Pedoman Sertifikasi Fitosanitari Buah Alpukat IndonesiaOpenThink Labs
 
Alpukat / Avokad ( Persea americana Mill / Persea gratissima Gaerth )
Alpukat / Avokad ( Persea americana Mill / Persea gratissima Gaerth )Alpukat / Avokad ( Persea americana Mill / Persea gratissima Gaerth )
Alpukat / Avokad ( Persea americana Mill / Persea gratissima Gaerth )OpenThink Labs
 
Mengenal Tipe-Tipe Reaktor Biogas
Mengenal Tipe-Tipe Reaktor BiogasMengenal Tipe-Tipe Reaktor Biogas
Mengenal Tipe-Tipe Reaktor BiogasOpenThink Labs
 
OpenThink SAS : Manajemen Data Induk Siswa
OpenThink SAS : Manajemen Data Induk SiswaOpenThink SAS : Manajemen Data Induk Siswa
OpenThink SAS : Manajemen Data Induk SiswaOpenThink Labs
 
08. ElasticSearch : Sorting and Relevance
08.  ElasticSearch : Sorting and Relevance08.  ElasticSearch : Sorting and Relevance
08. ElasticSearch : Sorting and RelevanceOpenThink Labs
 
06. ElasticSearch : Mapping and Analysis
06. ElasticSearch : Mapping and Analysis06. ElasticSearch : Mapping and Analysis
06. ElasticSearch : Mapping and AnalysisOpenThink Labs
 
03. ElasticSearch : Data In, Data Out
03. ElasticSearch : Data In, Data Out03. ElasticSearch : Data In, Data Out
03. ElasticSearch : Data In, Data OutOpenThink Labs
 
01 ElasticSearch : Getting Started
01 ElasticSearch : Getting Started01 ElasticSearch : Getting Started
01 ElasticSearch : Getting StartedOpenThink Labs
 
A Simple Guide to the Item Response Theory (IRT) and Rasch Modeling
A Simple Guide to the Item Response Theory (IRT) and Rasch ModelingA Simple Guide to the Item Response Theory (IRT) and Rasch Modeling
A Simple Guide to the Item Response Theory (IRT) and Rasch ModelingOpenThink Labs
 
A Non-Technical Approach for Illustrating Item Response Theory
A Non-Technical Approach for Illustrating Item Response TheoryA Non-Technical Approach for Illustrating Item Response Theory
A Non-Technical Approach for Illustrating Item Response TheoryOpenThink Labs
 
Strategi Pembuatan Karya Ilmiah Bagi Anggota KIR (Kelompok Ilmiah Remaja)
Strategi Pembuatan Karya Ilmiah Bagi Anggota KIR (Kelompok Ilmiah Remaja)Strategi Pembuatan Karya Ilmiah Bagi Anggota KIR (Kelompok Ilmiah Remaja)
Strategi Pembuatan Karya Ilmiah Bagi Anggota KIR (Kelompok Ilmiah Remaja)OpenThink Labs
 
Software Development : Template Dokumen Uji Terima Aplikasi (User Acceptance ...
Software Development : Template Dokumen Uji Terima Aplikasi (User Acceptance ...Software Development : Template Dokumen Uji Terima Aplikasi (User Acceptance ...
Software Development : Template Dokumen Uji Terima Aplikasi (User Acceptance ...OpenThink Labs
 
Software Development : Change Request Template (2)
Software Development : Change Request Template (2)Software Development : Change Request Template (2)
Software Development : Change Request Template (2)OpenThink Labs
 
Software Development : Minutes of Meeting Form - Template
Software Development : Minutes of Meeting Form - TemplateSoftware Development : Minutes of Meeting Form - Template
Software Development : Minutes of Meeting Form - TemplateOpenThink Labs
 
Software Development : Change Request Template
Software Development : Change Request TemplateSoftware Development : Change Request Template
Software Development : Change Request TemplateOpenThink Labs
 

Mehr von OpenThink Labs (18)

Program Outline in Regenerative Entrepreneurship
Program Outline in Regenerative EntrepreneurshipProgram Outline in Regenerative Entrepreneurship
Program Outline in Regenerative Entrepreneurship
 
Low Carbon Development: A Paradigm Shift Towards a Green Economy in Indonesia
Low Carbon Development: A Paradigm Shift Towards a Green Economy in IndonesiaLow Carbon Development: A Paradigm Shift Towards a Green Economy in Indonesia
Low Carbon Development: A Paradigm Shift Towards a Green Economy in Indonesia
 
Pedoman Sertifikasi Fitosanitari Buah Alpukat Indonesia
Pedoman Sertifikasi Fitosanitari Buah Alpukat IndonesiaPedoman Sertifikasi Fitosanitari Buah Alpukat Indonesia
Pedoman Sertifikasi Fitosanitari Buah Alpukat Indonesia
 
Alpukat / Avokad ( Persea americana Mill / Persea gratissima Gaerth )
Alpukat / Avokad ( Persea americana Mill / Persea gratissima Gaerth )Alpukat / Avokad ( Persea americana Mill / Persea gratissima Gaerth )
Alpukat / Avokad ( Persea americana Mill / Persea gratissima Gaerth )
 
Ubi Cilembu
Ubi CilembuUbi Cilembu
Ubi Cilembu
 
Mengenal Tipe-Tipe Reaktor Biogas
Mengenal Tipe-Tipe Reaktor BiogasMengenal Tipe-Tipe Reaktor Biogas
Mengenal Tipe-Tipe Reaktor Biogas
 
OpenThink SAS : Manajemen Data Induk Siswa
OpenThink SAS : Manajemen Data Induk SiswaOpenThink SAS : Manajemen Data Induk Siswa
OpenThink SAS : Manajemen Data Induk Siswa
 
08. ElasticSearch : Sorting and Relevance
08.  ElasticSearch : Sorting and Relevance08.  ElasticSearch : Sorting and Relevance
08. ElasticSearch : Sorting and Relevance
 
06. ElasticSearch : Mapping and Analysis
06. ElasticSearch : Mapping and Analysis06. ElasticSearch : Mapping and Analysis
06. ElasticSearch : Mapping and Analysis
 
03. ElasticSearch : Data In, Data Out
03. ElasticSearch : Data In, Data Out03. ElasticSearch : Data In, Data Out
03. ElasticSearch : Data In, Data Out
 
01 ElasticSearch : Getting Started
01 ElasticSearch : Getting Started01 ElasticSearch : Getting Started
01 ElasticSearch : Getting Started
 
A Simple Guide to the Item Response Theory (IRT) and Rasch Modeling
A Simple Guide to the Item Response Theory (IRT) and Rasch ModelingA Simple Guide to the Item Response Theory (IRT) and Rasch Modeling
A Simple Guide to the Item Response Theory (IRT) and Rasch Modeling
 
A Non-Technical Approach for Illustrating Item Response Theory
A Non-Technical Approach for Illustrating Item Response TheoryA Non-Technical Approach for Illustrating Item Response Theory
A Non-Technical Approach for Illustrating Item Response Theory
 
Strategi Pembuatan Karya Ilmiah Bagi Anggota KIR (Kelompok Ilmiah Remaja)
Strategi Pembuatan Karya Ilmiah Bagi Anggota KIR (Kelompok Ilmiah Remaja)Strategi Pembuatan Karya Ilmiah Bagi Anggota KIR (Kelompok Ilmiah Remaja)
Strategi Pembuatan Karya Ilmiah Bagi Anggota KIR (Kelompok Ilmiah Remaja)
 
Software Development : Template Dokumen Uji Terima Aplikasi (User Acceptance ...
Software Development : Template Dokumen Uji Terima Aplikasi (User Acceptance ...Software Development : Template Dokumen Uji Terima Aplikasi (User Acceptance ...
Software Development : Template Dokumen Uji Terima Aplikasi (User Acceptance ...
 
Software Development : Change Request Template (2)
Software Development : Change Request Template (2)Software Development : Change Request Template (2)
Software Development : Change Request Template (2)
 
Software Development : Minutes of Meeting Form - Template
Software Development : Minutes of Meeting Form - TemplateSoftware Development : Minutes of Meeting Form - Template
Software Development : Minutes of Meeting Form - Template
 
Software Development : Change Request Template
Software Development : Change Request TemplateSoftware Development : Change Request Template
Software Development : Change Request Template
 

Kürzlich hochgeladen

Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Jisc
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.pptRamjanShidvankar
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxRamakrishna Reddy Bijjam
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsMebane Rash
 
Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and ModificationsMJDuyan
 
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...Amil baba
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibitjbellavia9
 
Fostering Friendships - Enhancing Social Bonds in the Classroom
Fostering Friendships - Enhancing Social Bonds  in the ClassroomFostering Friendships - Enhancing Social Bonds  in the Classroom
Fostering Friendships - Enhancing Social Bonds in the ClassroomPooky Knightsmith
 
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...Pooja Bhuva
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxAreebaZafar22
 
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptxHMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptxmarlenawright1
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...Nguyen Thanh Tu Collection
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxheathfieldcps1
 
REMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptxREMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptxDr. Ravikiran H M Gowda
 
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...Nguyen Thanh Tu Collection
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfAdmir Softic
 
Graduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - EnglishGraduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - Englishneillewis46
 
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptxCOMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptxannathomasp01
 
FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024Elizabeth Walsh
 

Kürzlich hochgeladen (20)

Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docx
 
Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan Fellows
 
Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and Modifications
 
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibit
 
Fostering Friendships - Enhancing Social Bonds in the Classroom
Fostering Friendships - Enhancing Social Bonds  in the ClassroomFostering Friendships - Enhancing Social Bonds  in the Classroom
Fostering Friendships - Enhancing Social Bonds in the Classroom
 
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptx
 
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptxHMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 
REMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptxREMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptx
 
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
Graduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - EnglishGraduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - English
 
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptxCOMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
 
FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024
 

Introduction to Item Response Theory

  • 1. Psy 427 Cal State Northridge Andrew Ainsworth, PhD
  • 2. Contents  Item Analysis in General  Classical Test Theory  Item Response Theory Basics Item Response Functions Item Information Functions Invariance  IRT Assumptions  Parameter Estimation in IRT  Scoring  Applications
  • 3. What is item analysis in general?  Item analysis provides a way of measuring the quality of questions - seeing how appropriate they were for the respondents and how well they measured their ability/trait.  It also provides a way of re-using items over and over again in different tests with prior knowledge of how they are going to perform; creating a population of questions with known properties (e.g. test bank)
  • 4.
  • 5.
  • 6. Classical Test Theory  Classical Test Theory (CTT) - analyses are the easiest and most widely used form of analyses. The statistics can be computed by readily available statistical packages (or even by hand)  Classical Analyses are performed on the test as a whole rather than on the item and although item statistics can be generated, they apply only to that group of students on that collection of items
  • 7. Classical Test Theory  CTT is based on the true score model  In CTT we assume that the error : Is normally distributed Uncorrelated with true score Has a mean of Zero X T E= +
  • 8. Classical Test Theory Statistics  Difficulty (item level statistic)  Discrimination (item level statistic)  Reliability (test level statistic)
  • 9. Classical Test Theory vs. Latent Trait Models  Classical analysis has the test (not the item) as its basis. Although the statistics generated are often generalised to similar students taking a similar test; they only really apply to those students taking that test  Latent trait models aim to look beyond that at the underlying traits which are producing the test performance. They are measured at item level and provide sample-free measurement
  • 10. Latent Trait Models  Latent trait models have been around since the 1940s, but were not widely used until the 1960s. Although theoretically possible, it is practically unfeasible to use these without specialized software.  They aim to measure the underlying ability (or trait) which is producing the test performance rather than measuring performance per se.  This leads to them being sample-free. As the statistics are not dependant on the test situation which generated them, they can be used more flexibly
  • 11.
  • 12. Item Response Theory  Item Response Theory (IRT) – refers to a family of latent trait models used to establish psychometric properties of items and scales  Sometimes referred to as modern psychometrics because in large-scale education assessment, testing programs and professional testing firms IRT has almost completely replaced CTT as method of choice  IRT has many advantages over CTT that have brought IRT into more frequent use
  • 13. Three Basics Components of IRT  Item Response Function (IRF) – Mathematical function that relates the latent trait to the probability of endorsing an item  Item Information Function – an indication of item quality; an item’s ability to differentiate among respondents  Invariance – position on the latent trait can be estimated by any items with know IRFs and item characteristics are population independent within a linear transformation
  • 14.
  • 15. IRT - Item Response Function  Item Response Function (IRF) - characterizes the relation between a latent variable (i.e., individual differences on a construct) and the probability of endorsing an item.  The IRF models the relationship between examinee trait level, item properties and the probability of endorsing the item.  Examinee trait level is signified by the greek letter theta (θ) and typically has mean = 0 and a standard deviation = 1
  • 16. IRT - Item Characteristic Curves  IRFs can then be converted into Item Characteristic Curves (ICC) which are graphical functions that represents the respondents ability as a function of the probability of endorsing the item
  • 17. IRF – Item Parameters Location (b)  An item’s location is defined as the amount of the latent trait needed to have a .5 probability of endorsing the item.  The higher the “b” parameter the higher on the trait level a respondent needs to be in order to endorse the item  Analogous to difficulty in CTT  Like Z scores, the values of b typically range from -3 to +3
  • 18. IRF – Item Parameters Discrimination (a)  Indicates the steepness of the IRF at the items location  An items discrimination indicates how strongly related the item is to the latent trait like loadings in a factor analysis  Items with high discriminations are better at differentiating respondents around the location point; small changes in the latent trait lead to large changes in probability  Vice versa for items with low discriminations
  • 19. 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 -3 -2 -1 0 1 2 3 Latent Trait ProbabilityofEndorsing. Location Discrimination
  • 20. 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 -3 -2 -1 0 1 2 3 Latent Trait ProbabilityofEndorsing. Same Discrimination Different Locations 1 2
  • 21. 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 -3 -2 -1 0 1 2 3 Latent Trait ProbabilityofEndorsing. Different Discriminations Different Locations 1 2
  • 22. IRF – Item Parameters Guessing (c)  The inclusion of a “c” parameter suggests that respondents very low on the trait may still choose the correct answer.  In other words respondents with low trait levels may still have a small probability of endorsing an item  This is mostly used with multiple choice testing…and the value should not vary excessively from the reciprocal of the number of choices.
  • 23. 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 -3 -2 -1 0 1 2 3 Latent Trait ProbabilityofEndorsing. Different Locations Different Discriminations Lower Asymptote 1 2
  • 24. IRF – Item Parameters Upper asymptote (d)  The inclusion of a “d” parameter suggests that respondents very high on the latent trait are not guaranteed (i.e. have less than 1 probability) to endorse the item  Often an item that is difficult to endorse (e.g. suicide ideation as an indicator of depression)
  • 25. -3 -2 -1 0 1 2 3 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 Trait Level Probability 2plm 3plm 4plm
  • 26. IRT - Item Response Function  The 4-parameter logistic model  Where θ represents examinee trait level b is the item difficulty that determines the location of the IRF a is the item’s discrimination that determines the steepness of the IRF c is a lower asymptote parameter for the IRF d is an upper asymptote parameter for the IRF ( ) ( ) e ( 1 , , , , ) ( ) 1 e a b a b P X a b c d c d c θ θ θ − − = = + − +
  • 27. IRT - Item Response Function  The 3-parameter logistic model  If the upper asymptote parameter is set to 1.0, then the model is termed a 3PL.  In this model, individuals at low trait levels have a non-zero probability of endorsing the item. ( ) ( ) e ( 1 , , , ) (1 ) 1 e a b a b P X a b c c c θ θ θ − − = = + − +
  • 28. IRT - Item Response Function  The 2-parameter logistic model  If in addition the lower asymptote parameter is constrained to zero, then the model is termed a 2PL.  In the 2PLM, IRFs vary both in their discrimination and difficulty (i.e., location) parameters. ( ) ( ) e ( 1 , , ) 1 e a b a b P X a b θ θ θ − − = = +
  • 29. IRT - Item Response Function  The 1-parameter logistic model  If the item discrimination is set to 1.0 (or any constant) the result is a 1PL  A 1PL assumes that all scale items relate to the latent trait equally and items vary only in difficulty (equivalent to having equal factor loadings across items). ( ) ( ) e ( 1 , ) 1 e b b P X b θ θ θ − − = = +
  • 30. Quick Detour: Rasch Models vs. Item Response Theory Models  Mathematically, Rasch models are identical to the most basic IRT model (1PL), however there are some (important) differences  In Rasch the model is superior. Data which does not fit the model is discarded  Rasch does not permit abilities to be estimated for extreme items and persons  And other differences
  • 31. IRT - Test Response Curve  Test Response Curves (TRC) - Item response functions are additive so that items can be combined to create a TRC.  A TRC is the latent trait relative to the number of items
  • 32. -4 -3 -2 -1 0 1 2 3 4 0 2 4 6 8 10 12 14 16 18 20 Trait NumberofItems IRT - Test Response Curve
  • 33.
  • 34. IRT – Item Information Function  Item Information Function (IIF) – Item reliability is replaced by item information in IRT.  Each IRF can be transformed into an item information function (IIF); the precision an item provides at all levels of the latent trait.  The information is an index representing the item’s ability to differentiate among individuals.
  • 35. IRT – Item Information Function  The standard error of measurement (which is the variance of the latent trait level) is the reciprocal of information, and thus, more information means less error.  Measurement error is expressed on the same metric as the latent trait level, so it can be used to build confidence intervals.
  • 36. IRT – Item Information Function  Difficulty parameter - the location of the highest information point  Discrimination - height of the information.  Large discriminations - tall and narrow IIFs; high precision/narrow range  Low discrimination - short and wide IIFs; low precision/broad range.
  • 37. -3 -2 -1 0 1 2 3 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 Trait Level Probability 2plm 3plm 4plm
  • 38. -3 -2 -1 0 1 2 3 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 Trait Level Information 2plm 3plm 4plm
  • 39. IRT – Test Information Function  Test Information Function (TIF) – The IIFs are also additive so that we can judge the test as a whole and see at which part of the trait range it is working the best.
  • 40. -3 -2 -1 0 1 2 3 0 0.2 0.4 0.6 0.8 1 1.2 1.4 Trait Level Information
  • 41. 0 5 10 15 20 25 30 -3 -2 -1 0 1 2 3 Latent Trait Information 0 2.4 1.2 StandardError
  • 42. Item Response Theory Example  The same 24 items from the MMPI-2 that assess Social Discomfort  Dichotomous Items; 1 represents an endorsement of the item in the direction of discomfort  Assess a 2pl IRT model of the data to look at the difficulty, discrimination and information for each item
  • 43.
  • 44. IRT - Invariance  Invariance - IRT model parameters have an invariance property Examinee trait level estimates do not depend on which items are administered, and in turn, item parameters do not depend on a particular sample of examinees (within a linear transformation).  Invariance allows researchers to: 1) efficiently “link” different scales that measure the same construct, 2) compare examinees even if they responded to different items, and 3) implement computerized adaptive testing.
  • 45.
  • 46. IRT - Assumptions  Monotonicity - logistic IRT models assume a monotonically increasing functions (as trait level increases, so does the probability of endorsing an item).  If this is violated, then it makes no sense to apply logistic models to characterize item response data.
  • 47.
  • 48. IRT - Assumptions  Unidimensionality – In the IRT models described above, individual differences are characterized by a single parameter, theta. Multidimensional IRT models exist but are not as commonly applied Commonly applied IRT models assume that a single common factor (i.e., the latent trait) accounts for the item covariance. Often assessed using specialized Factor Analytic models for dichotomous items
  • 49. IRT - Assumptions  Local independence - The Local independence (LI) assumption requires that item responses are uncorrelated after controlling for the latent trait. When LI is violated, this is called local dependence (LD). LI and unidimensionality are related Highly univocal scales can still have violations of local independence (e.g. item content, etc.).
  • 50. IRT - Assumptions  Local dependence: 1. distorts item parameter estimates (i.e., can cause item slopes to be larger than they should be), 2. causes scales to look more precise than they really are, and 3. when LD exists, a large correlation between two or more items can essentially define or dominate the latent trait, thus causing the scale to lack construct validity.
  • 51. IRT - Assumptions  Once LD is identified, the next step is to address it: Form testlets (Wainer & Kiely, 1987) by combining locally dependent items Delete one or more of the LD items from the scale so local independence is achieved.
  • 52. IRT - Assumptions  Qualitatively homogeneous population - IRT models assume that the same IRF applies to all members of the population Differential item functioning (DIF) is a violation of this and means that there is a violation of the invariance property DIF occurs when an item has a different IRF for two or more groups; therefore examinees that are equal on the latent trait have different probabilities (expected scores) of endorsing the item. No single IRF can be applied to the population
  • 53.
  • 54. Applications  Ordered Polytomous Items IRT models exist for data that are not dichotomously scored With dichotomous items there is a single difficulty (location) that indicates the threshold at which the probability switches from favoring one choice to favoring the other With polytomous items a separate difficulty exists as thresholds between each sets of ordered categories
  • 55. 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 -3 -2 -1 0 1 2 3 Latent Trait ProbabilityofEndorsing. Category thresholds
  • 56. 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 -3 -2 -1 0 1 2 3 Latent Trait ProbabilityofEndorsing. Category thresholds Item 1
  • 57. 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 -3 -2 -1 0 1 2 3 Latent Trait ProbabilityofEndorsing. Item 2 Category Thresholds
  • 58. Applications  Differential Item Functioning How can age groups, genders, cultures, ethnic groups, and socioecomonic backgrounds be meaningfully compared? Can be a research goal as opposed to just a test of an assumption Test equivalency of test items translated into multiple languages Test items influenced by cultural differences Test for intelligence items that gender biased Test for age differences in response to personality items
  • 60. Applications  Scaling individuals for further analysis We often collect data in multifaceted forms (e.g. multi-items surveys) and then collapse them into a single raw score IRT based scores represent an optimal scaling of individuals on the trait Most sophisticated analyses require at-least interval level measurement and IRT scores are closer to interval level than raw scores Using scaled scores as opposed to raw scores has been shown to reduce spurious results
  • 61. Applications  Scale Construction and Modification The focus is changing from creating fixed length, paper/pencil tests to creating a “universe” of items with known IRF’s that can be used interchangeably Scales are being designed based around IRT properties Pre-existing scales that were developed using CTT are being “revamped” using IRT
  • 62. Applications  Computer Adaptive Testing (CAT) As an extension of the previous slide, once a “universe” (i.e. test bank) of items with known IRFs is created they can be used to measure traits in a computer adaptive form An item is given to the participant (usually easy to moderate difficulty) and their answer allows their trait score to be estimated, so that the next item is chosen to target that trait level After the second item is answered their trait score is re-estimated, etc.
  • 63. Applications  Computer Adaptive Testing (CAT) CA tests are at least twice as efficient as their paper and pencil counterparts with no loss of precision Primary testing approach used by ETS Adaptive form of the Headache Impact Survey outperformed the P and P counterpart in reducing patient burden, tracking change and in reliability and validity (Ware et al., 2003)
  • 64. Applications  Test Equating Participants that have taken different tests measuring the same construct (e.g. Beck depression vs. CESD), but both have items with known IRFS, can be placed on the same scale and compared or scored equivalently Equating across grades on math ability Equating across years for placement or admissions tests