Standardization of tests is a very critical & time-consuming task. It can be done for Ph.D.-level research. The complete procedure of standardization is described in the given presentation.
3. DEFINITION OF A STANDARDIZED TEST
A standardized test is any form of test
that requires all test takers to answer the same
questions, or a selection of questions from common
bank of questions, in the same way,
and
that is scored in a standard or consistent manner,
which makes it possible to compare the relative
performance of individual students or groups of
students.
4. A standardized test is a test that is administered
and scored in a consistent or "standard" manner.
Standardized tests are designed in such a way that
the questions, conditions for administration,
scoring procedures, and interpretations are
consistent and are administered and scored in a
predetermined, standard manner.
DEFINITION OF A STANDARDIZED TEST
5. -
One has to read textbooks
reference books, journals, research
reports, tests etc. to study the
content related with the variable for
which he is planning to prepare a
test.
6. • Researcher has to do
comprehensive search for the
key term for which he/she is
going to prepare the test/tool.
• In other words he/she has to
analyse critically the purpose
of the preparation of the tool
• Researcher has to study the
variable thoroughly for which
he/she is going to prepare the
tool.
7. The items may be of different type multiple
choices or any other.
Items/questions should be objective, simple, non
ambiguous and their answering should be easy.
8. • General & question wise specific instructions should be
given. Instructions should be clear & non ambiguous.
• What is to be done, how much is the time given to solve
the test, how to write answers –all these things are to
be made clear.
9. • After taking experts’ advice & making rectifications, answer key,
model answer paper, marking scheme are to be prepared carefully.
• First administration is to be done on a small group.
• Advantages of the first administration—problems regarding
understanding the question, adequacy of the allotted time & other
problems in the actual administration can be found out. Faulty
questions, ambiguous questions can be identified. Sentence
construction & language of the question can be improved.
Comprehensiveness & adequacy of the instructions can be
checked.
10. • The test is revised & administered on a preliminary
group.
• Curve is drawn & compared with NPC. From that
nature of the group is understood.
11. • Upper 27% & lower 27% are considered.
• Percentage of students solving each item correctly is found out.
• Using statistical formula item analysis is done.
Index of Difficulty
Ru-RL
P = ---------------* 100
T
Where
Ru= Number of the students in the upper group
who answered correctly.
RL=Number of the students in the lower group
who answered correctly.
T==total number of the students who tried the
item
Index of item discriminating power
Ru+RL
D=------------------* 100
½ T
Where
Ru= Number of the students in the upper
group who answered correctly.
RL=Number of the students in the lower group
who answered correctly.
T==total number of the students who tried
the item
D= discriminating index.
12. The questions with required level of discriminating
power & difficulty level are selected. Others are
discarded. Number of questions is made limited.
Time, title ,revised instructions & the expected way
of writing answers ---all these things are described
in detail.
13. • The actual process of standardization is done at
this step.
• Reliability, validity, objectivity are found out.
• The test is administered on a representative group
of 2000 to 5000 respondents . Evaluation is done
as per the marking scheme. Different statistical
measures & other criteria are found out- M,
Standard deviation, PR etc.
16. The term validity refers to whether or not the test measures what it claims to measure.
Reliability tells us how consistently the test scores measure something..
Objectivity means absence of subjectivity. A test is considered to be objective when it makes for
the elimination of the scorer's personal opinion and bias judgment.
Discriminating power (DP) of a test's item can be measured by comparing the number of people with
high test scores who answered that item correctly with the number of people with low scores who
answered the same item correctly.
Adequacy is comprehensiveness or totality of the measurement.
Usability is the degree to which something is able or fit to be used.
17. The complete procedure of preparation of the
test, its characteristics, its administration,
analysis, interpretation, norms---- all these things
are described in the manual.
So manual is the important part of any
standardized test. This is the last step of the
standardization process. Now the test is ready to
use. Obviously researcher has to give the
references used.