SlideShare ist ein Scribd-Unternehmen logo
1 von 23
Criteria for Good Measurement
Sushant Kumar Sinha
Sushovan Bej
Criteria for good measurements?
“The use of better instrument will ensure more
accuracy in results, which in turn, will enhance the
scientific quality of the research”
There are three measurement of the characteristics for
evaluating a measurement tool.
1. Validity
2. Reliability
3. Sensitivity
Validity
It is the ability of an instrument to measure what it is
supposed to measure.
That is, when we ask a questions with the hope that we
are tapping the concept, how can we be reasonably
certain that we are indeed measuring the concept we
set out to do and not something else?
Establishing Validity
Researcher have attempted to assess validity in many ways.
They attempt to provide some evidence of a measure’s
degree of validity by answering a variety of questions
There are four basic approaches to establishing validity are
widely classified as following
1. Face Validity
2. Content Validity
3. Criterion-Related Validity
4. Construct Validity
Face Validity
It is considered as a basic and very minimum index of content validity.
It indicates that the items that are intended to measure a concept, do
on the face of it look like they measure the concept.
For e.g. a few people would accept a measure of college student math
ability using a question that asked students: 2 + 2 = ? This is not a valid
measure of collegelevel math ability on the face of it.
Nevertheless, it is a subjective agreement among professionals that a
scale logically appears to reflect accurately what it is supposed to
measure. When it appears evident to experts that the measure provides
adequate coverage of the concept, a measure has face validity
Continued…
Clear, understandable questions such as “How many children do you
have?” generally are agreed to have face validity. But it becomes more
difficult to assess face validity in regard to more complicated
business phenomena.
For instance, consider the concept of customer loyalty. Does the
statement “I prefer to purchase my groceries at Delavan Fine Foods”
appear to capture loyalty? How about “I am very satisfied with my
purchases from Delavan Fine Foods”? What about “Delavan Fine
Foods offers very good value”? While the first statement appears to
capture loyalty, it can be argued the second question is not loyalty
but rather satisfaction. What does the third statement reflect? Do we
think it looks like a loyalty statement?
Content Validity
The content validity of a measuring instrument is the
extent to which it provides adequate coverage of the
investigate questions guiding the study. If the
instrument contains a representative sample of the
universe of subject matter of the interest, then the
content validity is good.
To put it differently, content validity is a function of
how well the dimensions and elements of a concept
have been delineated
Continued…
Look at the concept of feminism which implies a person's
commitment to a set of beliefs creating full equality between men and
women in areas of the arts, intellectual pursuits, family, work, politics,
and authority relations. Does this definition provide adequate coverage
of the different dimensions of the concept?
Then we have the following two questions to measure feminism:
1. Should men and women get equal pay for equal work?
2. Should men and women share household tasks?
These two questions do not provide coverage to all the dimensions
delineated earlier. It definitely falls short of adequate content validity
for measuring feminism
CriterionRelated Validity
Criterion validity uses some standard or criterion to
indicate the a construct accurately. The validity of an
indicator is verified by comparing it with another
measure of the same construct in which research has
confidence.
There are two subtypes of this kind of validity
1. Concurrent Validity
2. Predictive Validity
Concurrent Validity
To have concurrent validity, an indicator must be associated with
a preexisting indicator that is judged to be valid.
For e.g. we create a new test to measure intelligence. For it to be
concurrently valid, it should be highly associated with existing
IQ tests (assuming the same definition of intelligence is used). It
means that most people who score high on the old measure
should also score high on the new one, and vice versa.
The two measures may not be perfectly associated, but if they
measure the same or a similar construct, it is logical for them to
yield similar results.
Predictive Validity
Criterion validity whereby an indicator predicts future events that are
logically related to a construct is called a predictive validity. It cannot be
used for all measures. The measure and the action predicted must be
distinct from but indicate the same construct. Predictive measurement
validity should not be confused with prediction in hypothesis testing,
where one variable predicts a different variable in future.
For e.g. looking at the scholastic assessment tests being given to candidates
seeking admission in different subjects. These are supposed to measure the
scholastic aptitude of the candidates the ability to perform in institution as
well as in the subject. If this test has high predictive validity, then
candidates who get high test score will subsequently do well in their
subjects. If students with high scores perform the same as students with
average or low score, then the test has low predictive validity
Construct Validity
Construct validity is for measures with multiple indicators.
It addresses the question: If the measure is valid, do the
various indicators operate in consistent manner? It
requires a definition with clearly specified conceptual
boundaries. In order to evaluate construct validity, we
consider both theory and the measuring instrument being
used.
There are two subtypes of this kind of validity
1. Convergent Validity
2. Discriminant Validity
Convergent Validity
Convergent validity means that multiple measures of the
same construct hang together or operate in similar ways.
For e.g. we construct "education" by asking people how
much education they have completed, looking at their
institutional records, and asking people to complete a test
of school level knowledge. If the measures do not converge
(i.e. people who claim to have college degree but have no
record of attending college, or those with college degree
perform no better than high school dropouts on the test),
then our test has weak convergent validity and we should
not combine all three indicators into one measure.
Discriminant Validity
Also called Divergent validity, discriminant validity is the
opposite of convergent validity. It means that the indicators of
one construct hang together or converge, but also diverge or are
negatively associated with opposing constructs. It says that if two
constructs A and B are very different, then measures of A and B
should not be associated.
For example, we have 10 items that measure political
conservatism. People answer all 10 in similar ways. But we have
also put 5 questions in the same questionnaire that measure
political liberalism. Our measure of conservatism has
discriminant validity if the 10 conservatism items hang together
and are negatively associated with 5 liberalism ones.
Reliability
The reliability of a measure indicates the extent to
which it is without bias (error free) and hence ensures
consistent measurement across time and across the
various items in the instrument.
In other words, the reliability of a measure is an
indication of the stability and consistency with
which the instrument measures the concept and helps
to assess the “goodness” of measure
Stability
The ability of the measure to remain the same over
time despite uncontrollable testing conditions or the
state of the respondents themselves is indicative of its
stability and low vulnerability to changes in the
situation.
This attests to its "goodness" because the concept is
stably measured, no matter when it is done.
Two tests of stability are testretest reliability and
parallelform reliability
Test-Retest Reliability
Testretest method of determining reliability involves administering the same
scale to the same respondents at two separate times to test for stability. If the
measure is stable over time, the test, administered under the same conditions
each time, should obtain similar results.
For example, suppose a researcher measures job satisfaction and finds that 64
percent of the population is satisfied with their jobs. If the study is repeated a
few weeks later under similar conditions, and the researcher again finds that 64
percent of the population is satisfied with their jobs, it appears that the
measure has repeatability.
The high stability correlation or consistency between the two measures at
time 1 and at time 2 indicates high degree of reliability. This was at the
aggregate level; the same exercise can be applied at the individual level. When
the measuring instrument produces unpredictable results from one testing to
the next, the results are said to be unreliable because of error in measurement.
ParallelForm Reliability
When responses on two comparable sets of measures
tapping the same construct are highly correlated, we have
parallelform reliability. It is also called equivalentform
reliability.
Both forms have similar items and same response format,
the only changes being the wording and the order or
sequence of the questions. What we try to establish here is
the error variability resulting from wording and ordering of
the questions. If two such comparable forms are highly
correlated, we may be fairly certain that the measures are
reasonably reliable, with minimal error variance caused by
wording, ordering, or other factors.
Internal Consistency of Measure
Internal consistency of measures is indicative of the
homogeneity of the items in the measure that tap the
construct.
In other words, the items should `hang together as a set'
and be capable of independently measuring the same
concept so that the respondents attach the same overall
meaning to each of the items. This can be seen by
examining if the items and the subsets of items in the
measuring instrument are highly correlated. Consistency
can be examined through the interitem consistency
reliability and splithalf reliability.
Inter-item Consistency reliability
This is a test of consistency of respondent’s
answers to all the items in a measure. To the degree
that items are independent measures of the same
concept, they will be correlated with one another
Split-Half Reliability
It reflects the correlation between two halves of an
instrument. The estimates could vary depending on how
the items in the measure are split into two halves.
The technique of splitting halves in the most basic method
for checking internal consistency when measures contain a
large number of items. In the split-half method the
researcher may take the results obtained from one half of
the scale items(e.g. odd-numbered items) and check them
against the results from the other half of the items (e.g.
even numbered items). The high correlation tells us there
is similarity (or homogeneity) among its items.
Reliability vs Validity
Reliability is a necessary but not sufficient condition
for validity. A reliable scale may not be valid.
For example, a purchase intention measurement
technique may consistently indicate that 20 percent of
those sampled are willing to purchase a new product.
Whether the measure is valid depends on whether 20
percent of the population indeed purchases the
product. A reliable but invalid instrument will yield
consistently inaccurate results
Reliablity and Validity

Weitere ähnliche Inhalte

Was ist angesagt?

Research Method - Ex Post Facto Research
Research Method - Ex Post Facto ResearchResearch Method - Ex Post Facto Research
Research Method - Ex Post Facto ResearchPsychology Pedia
 
Experimental research
Experimental researchExperimental research
Experimental researchizzajalil
 
NULL AND ALTERNATIVE HYPOTHESIS.pptx
NULL AND ALTERNATIVE HYPOTHESIS.pptxNULL AND ALTERNATIVE HYPOTHESIS.pptx
NULL AND ALTERNATIVE HYPOTHESIS.pptx04ShainaSachdeva
 
Presentation validity
Presentation validityPresentation validity
Presentation validityAshMusavi
 
Questionnaire, interview, observation and rating scale
 Questionnaire, interview, observation and rating scale  Questionnaire, interview, observation and rating scale
Questionnaire, interview, observation and rating scale zunaira rafiq
 
Test Reliability and Validity
Test Reliability and ValidityTest Reliability and Validity
Test Reliability and ValidityBrian Ebie
 
Hypothesis presentation
Hypothesis presentationHypothesis presentation
Hypothesis presentationBasharat Mirza
 
Validity, its types, measurement & factors.
Validity, its types, measurement & factors.Validity, its types, measurement & factors.
Validity, its types, measurement & factors.Maheen Iftikhar
 
Hypothesis Formulation
Hypothesis FormulationHypothesis Formulation
Hypothesis FormulationSarang Bhola
 
Correlational Research
Correlational ResearchCorrelational Research
Correlational Researchirshad narejo
 
Educational research
Educational researchEducational research
Educational researchMukut Deori
 

Was ist angesagt? (20)

Research Method - Ex Post Facto Research
Research Method - Ex Post Facto ResearchResearch Method - Ex Post Facto Research
Research Method - Ex Post Facto Research
 
Experimental research
Experimental researchExperimental research
Experimental research
 
Research Design
Research DesignResearch Design
Research Design
 
Case study method in research
Case study method in researchCase study method in research
Case study method in research
 
NULL AND ALTERNATIVE HYPOTHESIS.pptx
NULL AND ALTERNATIVE HYPOTHESIS.pptxNULL AND ALTERNATIVE HYPOTHESIS.pptx
NULL AND ALTERNATIVE HYPOTHESIS.pptx
 
01 validity and its type
01 validity and its type01 validity and its type
01 validity and its type
 
Research Design
Research Design Research Design
Research Design
 
Reliability
ReliabilityReliability
Reliability
 
Presentation validity
Presentation validityPresentation validity
Presentation validity
 
Questionnaire, interview, observation and rating scale
 Questionnaire, interview, observation and rating scale  Questionnaire, interview, observation and rating scale
Questionnaire, interview, observation and rating scale
 
Test Reliability and Validity
Test Reliability and ValidityTest Reliability and Validity
Test Reliability and Validity
 
Hypothesis presentation
Hypothesis presentationHypothesis presentation
Hypothesis presentation
 
Validity, its types, measurement & factors.
Validity, its types, measurement & factors.Validity, its types, measurement & factors.
Validity, its types, measurement & factors.
 
Hypothesis Formulation
Hypothesis FormulationHypothesis Formulation
Hypothesis Formulation
 
Reliability and validity
Reliability and validityReliability and validity
Reliability and validity
 
Validity and Reliability
Validity and Reliability Validity and Reliability
Validity and Reliability
 
RESEARCH DESIGN
RESEARCH DESIGNRESEARCH DESIGN
RESEARCH DESIGN
 
Correlational Research
Correlational ResearchCorrelational Research
Correlational Research
 
Validity & reliability
Validity & reliabilityValidity & reliability
Validity & reliability
 
Educational research
Educational researchEducational research
Educational research
 

Ähnlich wie Reliablity and Validity

Topic validity
Topic validityTopic validity
Topic validitymikki khan
 
Slides--Reliability and Validity.ppt
Slides--Reliability and Validity.pptSlides--Reliability and Validity.ppt
Slides--Reliability and Validity.pptBoyPenang
 
Session 2 2018
Session 2 2018Session 2 2018
Session 2 2018Sue Hines
 
Validity, Reliability ,Objective & Their Types
Validity, Reliability ,Objective & Their TypesValidity, Reliability ,Objective & Their Types
Validity, Reliability ,Objective & Their TypesMohammadRabbani18
 
Reliability and Validity types and example.pptx
Reliability and Validity types and example.pptxReliability and Validity types and example.pptx
Reliability and Validity types and example.pptxAiswarya Lakshmi
 
Qualities of a Good Test
Qualities of a Good TestQualities of a Good Test
Qualities of a Good TestDrSindhuAlmas
 
Adler clark 4e ppt 06
Adler clark 4e ppt 06Adler clark 4e ppt 06
Adler clark 4e ppt 06arpsychology
 
Business Research Methods Unit 3 notes
Business Research Methods Unit 3 notesBusiness Research Methods Unit 3 notes
Business Research Methods Unit 3 notesSUJEET TAMBE
 
Validity and objectivity of tests
Validity and objectivity of testsValidity and objectivity of tests
Validity and objectivity of testsbushra mushtaq
 
Reliability and Validity and How to Achieve Them - Mass Media Research.pptx
Reliability and Validity and How to Achieve Them - Mass Media Research.pptxReliability and Validity and How to Achieve Them - Mass Media Research.pptx
Reliability and Validity and How to Achieve Them - Mass Media Research.pptxMuhammad Awais
 
Validity of a Research Tool
Validity of a Research ToolValidity of a Research Tool
Validity of a Research TooljobyVarghese22
 
Presentation Validity & Reliability
Presentation Validity & ReliabilityPresentation Validity & Reliability
Presentation Validity & Reliabilitysongoten77
 
Chapter 8 compilation
Chapter 8 compilationChapter 8 compilation
Chapter 8 compilationHannan Mahmud
 
Faith & ReasonFaith is not opposed to reason, but is sometime.docx
Faith & ReasonFaith is not opposed to reason, but is sometime.docxFaith & ReasonFaith is not opposed to reason, but is sometime.docx
Faith & ReasonFaith is not opposed to reason, but is sometime.docxmecklenburgstrelitzh
 
Convergent Validity And Discriminant Validity.ppt
Convergent Validity And Discriminant Validity.pptConvergent Validity And Discriminant Validity.ppt
Convergent Validity And Discriminant Validity.pptRupeshSolanki10
 
Module-14-1-Characterstics of a good test-Reliability,Validity....pdf
Module-14-1-Characterstics of a good test-Reliability,Validity....pdfModule-14-1-Characterstics of a good test-Reliability,Validity....pdf
Module-14-1-Characterstics of a good test-Reliability,Validity....pdfVikramjit Singh
 
Questionnaire measurement (1).pptx
Questionnaire measurement (1).pptxQuestionnaire measurement (1).pptx
Questionnaire measurement (1).pptxChetanGarg52
 

Ähnlich wie Reliablity and Validity (20)

Topic validity
Topic validityTopic validity
Topic validity
 
Slides--Reliability and Validity.ppt
Slides--Reliability and Validity.pptSlides--Reliability and Validity.ppt
Slides--Reliability and Validity.ppt
 
Rep
RepRep
Rep
 
Session 2 2018
Session 2 2018Session 2 2018
Session 2 2018
 
Validity, Reliability ,Objective & Their Types
Validity, Reliability ,Objective & Their TypesValidity, Reliability ,Objective & Their Types
Validity, Reliability ,Objective & Their Types
 
Validity and reliability
Validity and reliabilityValidity and reliability
Validity and reliability
 
Reliability and Validity types and example.pptx
Reliability and Validity types and example.pptxReliability and Validity types and example.pptx
Reliability and Validity types and example.pptx
 
Qualities of a Good Test
Qualities of a Good TestQualities of a Good Test
Qualities of a Good Test
 
Adler clark 4e ppt 06
Adler clark 4e ppt 06Adler clark 4e ppt 06
Adler clark 4e ppt 06
 
Business Research Methods Unit 3 notes
Business Research Methods Unit 3 notesBusiness Research Methods Unit 3 notes
Business Research Methods Unit 3 notes
 
Validity and objectivity of tests
Validity and objectivity of testsValidity and objectivity of tests
Validity and objectivity of tests
 
Reliability and Validity and How to Achieve Them - Mass Media Research.pptx
Reliability and Validity and How to Achieve Them - Mass Media Research.pptxReliability and Validity and How to Achieve Them - Mass Media Research.pptx
Reliability and Validity and How to Achieve Them - Mass Media Research.pptx
 
Business research methods
Business research methodsBusiness research methods
Business research methods
 
Validity of a Research Tool
Validity of a Research ToolValidity of a Research Tool
Validity of a Research Tool
 
Presentation Validity & Reliability
Presentation Validity & ReliabilityPresentation Validity & Reliability
Presentation Validity & Reliability
 
Chapter 8 compilation
Chapter 8 compilationChapter 8 compilation
Chapter 8 compilation
 
Faith & ReasonFaith is not opposed to reason, but is sometime.docx
Faith & ReasonFaith is not opposed to reason, but is sometime.docxFaith & ReasonFaith is not opposed to reason, but is sometime.docx
Faith & ReasonFaith is not opposed to reason, but is sometime.docx
 
Convergent Validity And Discriminant Validity.ppt
Convergent Validity And Discriminant Validity.pptConvergent Validity And Discriminant Validity.ppt
Convergent Validity And Discriminant Validity.ppt
 
Module-14-1-Characterstics of a good test-Reliability,Validity....pdf
Module-14-1-Characterstics of a good test-Reliability,Validity....pdfModule-14-1-Characterstics of a good test-Reliability,Validity....pdf
Module-14-1-Characterstics of a good test-Reliability,Validity....pdf
 
Questionnaire measurement (1).pptx
Questionnaire measurement (1).pptxQuestionnaire measurement (1).pptx
Questionnaire measurement (1).pptx
 

Mehr von Sushant Kumar Sinha

Tariff structure & Cerc tariff regulation
Tariff structure & Cerc tariff regulationTariff structure & Cerc tariff regulation
Tariff structure & Cerc tariff regulationSushant Kumar Sinha
 
Models for energy efficiency project financing
Models for energy efficiency project financingModels for energy efficiency project financing
Models for energy efficiency project financingSushant Kumar Sinha
 
Working capital & cost management
Working capital & cost managementWorking capital & cost management
Working capital & cost managementSushant Kumar Sinha
 
Changing the role of top management
Changing the role of top managementChanging the role of top management
Changing the role of top managementSushant Kumar Sinha
 
Clearance required for setting up thermal power plant
Clearance required for setting up thermal power plantClearance required for setting up thermal power plant
Clearance required for setting up thermal power plantSushant Kumar Sinha
 
Managing production across supply chain
Managing production across supply chainManaging production across supply chain
Managing production across supply chainSushant Kumar Sinha
 
AMR & EMS- Automated Meter Reading and Energy Management System
AMR & EMS- Automated Meter Reading and Energy Management SystemAMR & EMS- Automated Meter Reading and Energy Management System
AMR & EMS- Automated Meter Reading and Energy Management SystemSushant Kumar Sinha
 

Mehr von Sushant Kumar Sinha (15)

Tariff structure & Cerc tariff regulation
Tariff structure & Cerc tariff regulationTariff structure & Cerc tariff regulation
Tariff structure & Cerc tariff regulation
 
Ash handling and utilization
Ash handling and utilizationAsh handling and utilization
Ash handling and utilization
 
Models for energy efficiency project financing
Models for energy efficiency project financingModels for energy efficiency project financing
Models for energy efficiency project financing
 
Case tools
Case toolsCase tools
Case tools
 
Working capital & cost management
Working capital & cost managementWorking capital & cost management
Working capital & cost management
 
Changing the role of top management
Changing the role of top managementChanging the role of top management
Changing the role of top management
 
Clearance required for setting up thermal power plant
Clearance required for setting up thermal power plantClearance required for setting up thermal power plant
Clearance required for setting up thermal power plant
 
Strategy under uncertainty
Strategy under uncertaintyStrategy under uncertainty
Strategy under uncertainty
 
Managing production across supply chain
Managing production across supply chainManaging production across supply chain
Managing production across supply chain
 
At&c losses and remedies
At&c losses and remediesAt&c losses and remedies
At&c losses and remedies
 
Restructuring and Privatization
Restructuring and PrivatizationRestructuring and Privatization
Restructuring and Privatization
 
Power Market regulation
Power Market regulationPower Market regulation
Power Market regulation
 
SCADA
SCADASCADA
SCADA
 
AMR & EMS- Automated Meter Reading and Energy Management System
AMR & EMS- Automated Meter Reading and Energy Management SystemAMR & EMS- Automated Meter Reading and Energy Management System
AMR & EMS- Automated Meter Reading and Energy Management System
 
Royal Enfield Marketing
Royal Enfield MarketingRoyal Enfield Marketing
Royal Enfield Marketing
 

Kürzlich hochgeladen

Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibitjbellavia9
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfAdmir Softic
 
How to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSHow to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSCeline George
 
Graduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - EnglishGraduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - Englishneillewis46
 
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...Pooja Bhuva
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...ZurliaSoop
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Jisc
 
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptxCOMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptxannathomasp01
 
How to Add New Custom Addons Path in Odoo 17
How to Add New Custom Addons Path in Odoo 17How to Add New Custom Addons Path in Odoo 17
How to Add New Custom Addons Path in Odoo 17Celine George
 
How to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptxHow to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptxCeline George
 
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...Pooja Bhuva
 
SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentationcamerronhm
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...Nguyen Thanh Tu Collection
 
How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17Celine George
 
Salient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functionsSalient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functionsKarakKing
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsMebane Rash
 
Towards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxTowards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxJisc
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxAreebaZafar22
 
Wellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptxWellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptxJisc
 
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...pradhanghanshyam7136
 

Kürzlich hochgeladen (20)

Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibit
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
How to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSHow to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POS
 
Graduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - EnglishGraduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - English
 
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)
 
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptxCOMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
 
How to Add New Custom Addons Path in Odoo 17
How to Add New Custom Addons Path in Odoo 17How to Add New Custom Addons Path in Odoo 17
How to Add New Custom Addons Path in Odoo 17
 
How to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptxHow to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptx
 
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
 
SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentation
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
 
How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17
 
Salient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functionsSalient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functions
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan Fellows
 
Towards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxTowards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptx
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptx
 
Wellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptxWellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptx
 
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
 

Reliablity and Validity

  • 1. Criteria for Good Measurement Sushant Kumar Sinha Sushovan Bej
  • 2. Criteria for good measurements? “The use of better instrument will ensure more accuracy in results, which in turn, will enhance the scientific quality of the research” There are three measurement of the characteristics for evaluating a measurement tool. 1. Validity 2. Reliability 3. Sensitivity
  • 3. Validity It is the ability of an instrument to measure what it is supposed to measure. That is, when we ask a questions with the hope that we are tapping the concept, how can we be reasonably certain that we are indeed measuring the concept we set out to do and not something else?
  • 4. Establishing Validity Researcher have attempted to assess validity in many ways. They attempt to provide some evidence of a measure’s degree of validity by answering a variety of questions There are four basic approaches to establishing validity are widely classified as following 1. Face Validity 2. Content Validity 3. Criterion-Related Validity 4. Construct Validity
  • 5. Face Validity It is considered as a basic and very minimum index of content validity. It indicates that the items that are intended to measure a concept, do on the face of it look like they measure the concept. For e.g. a few people would accept a measure of college student math ability using a question that asked students: 2 + 2 = ? This is not a valid measure of collegelevel math ability on the face of it. Nevertheless, it is a subjective agreement among professionals that a scale logically appears to reflect accurately what it is supposed to measure. When it appears evident to experts that the measure provides adequate coverage of the concept, a measure has face validity
  • 6. Continued… Clear, understandable questions such as “How many children do you have?” generally are agreed to have face validity. But it becomes more difficult to assess face validity in regard to more complicated business phenomena. For instance, consider the concept of customer loyalty. Does the statement “I prefer to purchase my groceries at Delavan Fine Foods” appear to capture loyalty? How about “I am very satisfied with my purchases from Delavan Fine Foods”? What about “Delavan Fine Foods offers very good value”? While the first statement appears to capture loyalty, it can be argued the second question is not loyalty but rather satisfaction. What does the third statement reflect? Do we think it looks like a loyalty statement?
  • 7. Content Validity The content validity of a measuring instrument is the extent to which it provides adequate coverage of the investigate questions guiding the study. If the instrument contains a representative sample of the universe of subject matter of the interest, then the content validity is good. To put it differently, content validity is a function of how well the dimensions and elements of a concept have been delineated
  • 8. Continued… Look at the concept of feminism which implies a person's commitment to a set of beliefs creating full equality between men and women in areas of the arts, intellectual pursuits, family, work, politics, and authority relations. Does this definition provide adequate coverage of the different dimensions of the concept? Then we have the following two questions to measure feminism: 1. Should men and women get equal pay for equal work? 2. Should men and women share household tasks? These two questions do not provide coverage to all the dimensions delineated earlier. It definitely falls short of adequate content validity for measuring feminism
  • 9. CriterionRelated Validity Criterion validity uses some standard or criterion to indicate the a construct accurately. The validity of an indicator is verified by comparing it with another measure of the same construct in which research has confidence. There are two subtypes of this kind of validity 1. Concurrent Validity 2. Predictive Validity
  • 10. Concurrent Validity To have concurrent validity, an indicator must be associated with a preexisting indicator that is judged to be valid. For e.g. we create a new test to measure intelligence. For it to be concurrently valid, it should be highly associated with existing IQ tests (assuming the same definition of intelligence is used). It means that most people who score high on the old measure should also score high on the new one, and vice versa. The two measures may not be perfectly associated, but if they measure the same or a similar construct, it is logical for them to yield similar results.
  • 11. Predictive Validity Criterion validity whereby an indicator predicts future events that are logically related to a construct is called a predictive validity. It cannot be used for all measures. The measure and the action predicted must be distinct from but indicate the same construct. Predictive measurement validity should not be confused with prediction in hypothesis testing, where one variable predicts a different variable in future. For e.g. looking at the scholastic assessment tests being given to candidates seeking admission in different subjects. These are supposed to measure the scholastic aptitude of the candidates the ability to perform in institution as well as in the subject. If this test has high predictive validity, then candidates who get high test score will subsequently do well in their subjects. If students with high scores perform the same as students with average or low score, then the test has low predictive validity
  • 12. Construct Validity Construct validity is for measures with multiple indicators. It addresses the question: If the measure is valid, do the various indicators operate in consistent manner? It requires a definition with clearly specified conceptual boundaries. In order to evaluate construct validity, we consider both theory and the measuring instrument being used. There are two subtypes of this kind of validity 1. Convergent Validity 2. Discriminant Validity
  • 13. Convergent Validity Convergent validity means that multiple measures of the same construct hang together or operate in similar ways. For e.g. we construct "education" by asking people how much education they have completed, looking at their institutional records, and asking people to complete a test of school level knowledge. If the measures do not converge (i.e. people who claim to have college degree but have no record of attending college, or those with college degree perform no better than high school dropouts on the test), then our test has weak convergent validity and we should not combine all three indicators into one measure.
  • 14. Discriminant Validity Also called Divergent validity, discriminant validity is the opposite of convergent validity. It means that the indicators of one construct hang together or converge, but also diverge or are negatively associated with opposing constructs. It says that if two constructs A and B are very different, then measures of A and B should not be associated. For example, we have 10 items that measure political conservatism. People answer all 10 in similar ways. But we have also put 5 questions in the same questionnaire that measure political liberalism. Our measure of conservatism has discriminant validity if the 10 conservatism items hang together and are negatively associated with 5 liberalism ones.
  • 15. Reliability The reliability of a measure indicates the extent to which it is without bias (error free) and hence ensures consistent measurement across time and across the various items in the instrument. In other words, the reliability of a measure is an indication of the stability and consistency with which the instrument measures the concept and helps to assess the “goodness” of measure
  • 16. Stability The ability of the measure to remain the same over time despite uncontrollable testing conditions or the state of the respondents themselves is indicative of its stability and low vulnerability to changes in the situation. This attests to its "goodness" because the concept is stably measured, no matter when it is done. Two tests of stability are testretest reliability and parallelform reliability
  • 17. Test-Retest Reliability Testretest method of determining reliability involves administering the same scale to the same respondents at two separate times to test for stability. If the measure is stable over time, the test, administered under the same conditions each time, should obtain similar results. For example, suppose a researcher measures job satisfaction and finds that 64 percent of the population is satisfied with their jobs. If the study is repeated a few weeks later under similar conditions, and the researcher again finds that 64 percent of the population is satisfied with their jobs, it appears that the measure has repeatability. The high stability correlation or consistency between the two measures at time 1 and at time 2 indicates high degree of reliability. This was at the aggregate level; the same exercise can be applied at the individual level. When the measuring instrument produces unpredictable results from one testing to the next, the results are said to be unreliable because of error in measurement.
  • 18. ParallelForm Reliability When responses on two comparable sets of measures tapping the same construct are highly correlated, we have parallelform reliability. It is also called equivalentform reliability. Both forms have similar items and same response format, the only changes being the wording and the order or sequence of the questions. What we try to establish here is the error variability resulting from wording and ordering of the questions. If two such comparable forms are highly correlated, we may be fairly certain that the measures are reasonably reliable, with minimal error variance caused by wording, ordering, or other factors.
  • 19. Internal Consistency of Measure Internal consistency of measures is indicative of the homogeneity of the items in the measure that tap the construct. In other words, the items should `hang together as a set' and be capable of independently measuring the same concept so that the respondents attach the same overall meaning to each of the items. This can be seen by examining if the items and the subsets of items in the measuring instrument are highly correlated. Consistency can be examined through the interitem consistency reliability and splithalf reliability.
  • 20. Inter-item Consistency reliability This is a test of consistency of respondent’s answers to all the items in a measure. To the degree that items are independent measures of the same concept, they will be correlated with one another
  • 21. Split-Half Reliability It reflects the correlation between two halves of an instrument. The estimates could vary depending on how the items in the measure are split into two halves. The technique of splitting halves in the most basic method for checking internal consistency when measures contain a large number of items. In the split-half method the researcher may take the results obtained from one half of the scale items(e.g. odd-numbered items) and check them against the results from the other half of the items (e.g. even numbered items). The high correlation tells us there is similarity (or homogeneity) among its items.
  • 22. Reliability vs Validity Reliability is a necessary but not sufficient condition for validity. A reliable scale may not be valid. For example, a purchase intention measurement technique may consistently indicate that 20 percent of those sampled are willing to purchase a new product. Whether the measure is valid depends on whether 20 percent of the population indeed purchases the product. A reliable but invalid instrument will yield consistently inaccurate results

Hinweis der Redaktion

  1. E.g. Consider the controversy about highway patrol officers using radar guns to clock speeders. A driver is clocked at 83 mph in a 55 mph zone, but the same radar gun aimed at a house registers 28 mph. The error occurred because the radar gun had picked up impulses from the electrical system of the squad car’s idling engine.