Chi square test

CHI SQUARE TEST
CONTENTS:
•IMPORTANT TERMS

•INTRODUCTION

•CHARACTERISTICS OF THE TEST

•CHI SQUARE DISTRIBUTION

•APPLICATIONS OF CHI SQUARE TEST

•CALCULATION OF THE CHI SQUARE

•CONDITION FOR THE APPLICATION OF THE TEST

•EXAMPLE

•YATE’S CORRECTION FOR CONTINUITY

•LIMITATIONS OF THE TEST.
IMPORTANT TERMS
1) PARAMETRIC TEST: The test in which, the population
   constants like mean,std deviation, std error, correlation
   coefficient, proportion etc. and data tend to follow one
   assumed or established distribution such as normal,
   binomial, poisson etc.

2) NON PARAMETRIC TEST: the test in which no constant of a
   population is used. Data do not follow any specific
   distribution and no assumption are made in these tests. E.g.
   to classify good, better and best we just allocate arbitrary
   numbers or marks to each category.

3) HYPOTHESIS: It is a definite statement about the population
   parameters.
4) NULL HYPOTHESIS: (H0) states that no association exists
   between the two cross-tabulated variables in the population,
   and therefore the variables are statistically independent. E.g.
   if we want to compare 2 methods method A and method B for
   its superiority, and if the assumption is that both methods are
   equally good, then this assumption is called as NULL
   HYPOTHESIS.

5) ALTERNATIVE HYPOTHESIS: (H1) proposes that the two
   variables are related in the population. If we assume that
   from 2 methods, method A is superior than method B, then
   this assumption is called as ALTERNATIVE HYPOTHESIS.
6) DEGREE OF FREEDOM: It denotes the extent of
   independence (freedom) enjoyed by a given set of observed
   frequencies Suppose we are given a set of n observed
   frequencies which are subjected to k independent
   constraints(restrictions) then,

  d.f. = (number of frequencies) – (number of independent
                                      constraints on them)
  In other terms,
                        df = (r – 1)(c – 1)
  where
               r = the number of rows
               c = the number of columns

7) CONTINGENCY TABLE: When the table is prepared by
   enumeration of qualitative data by entering the actual
   frequencies, and if that table represents occurance of two
   sets of events, that table is called the contingency table.
   (Latin, con- together, tangere- to touch). It is also called as
   an association table.
INTRODUCTION
The chi-square test is an important test amongst the several
 tests of significance developed by statisticians.

Is was developed by Karl Pearson in1900.

CHI SQUARE TEST is a non parametric test not based on any
 assumption or distribution of any variable.

This statistical test follows a specific distribution known as chi
 square distribution.

In general The test we use to measure the differences between
 what is observed and what is expected according to an
 assumed hypothesis is called the chi-square test.
IMPORTANT CHARACTERISTICS OF A CHI
           SQUARE TEST

 This test (as a non-parametric test) is based on
  frequencies and not on the parameters like mean and
  standard deviation.

 The test is used for testing the hypothesis and is not
  useful for estimation.

 This test can also be applied to a complex contingency
   table with several classes and as such is a very useful
  test in research work.

 This test is an important non-parametric test as no rigid
   assumptions are necessary in regard to the type of
   population, no need of parameter values and relatively
  less mathematical details are involved.
CHI SQUARE DISTRIBUTION:
If X1, X2,….Xn are independent normal variates and each is
distributed normally with mean zero and standard deviation
unity, then X12+X22+……+Xn2= ∑ Xi2 is distributed as chi square (c2
)with n degrees of freedom (d.f.) where n is large. The chi square
curve for d.f. N=1,5 and 9 is as follows.
If degree of freedom > 2 : Distribution is bell shaped

If degree of freedom = 2 : Distribution is L shaped with
                           maximum ordinate at zero

If degree of freedom <2 (>0) : Distribution L shaped with
                               infinite ordinate at the origin.
APPLICATIONS OF A CHI SQUARE TEST.
This test can be used in

1) Goodness of fit of distributions

2) test of independence of attributes

3) test of homogenity.
1) TEST OF GOODNESS OF FIT OF DISTRIBUTIONS:

 This test enables us to see how well does the assumed
  theoretical distribution (such as Binomial distribution,
  Poisson distribution or Normal distribution) fit to the
  observed data.

The c2 test formula for goodness of fit is:

                                    (o  e)
                                               2

                              
                        2

                                        e
 Where,
 o = observed frequency
 e = expected frequency

If c2 (calculated) > c2 (tabulated), with (n-1) d.f, then null
 hypothesis is rejected otherwise accepted.

And if null hypothesis is accepted, then it can be concluded
that the given distribution follows theoretical distribution.
2) TEST OF INDEPENDENCE OF ATTRIBUTES

Test enables us to explain whether or not two attributes are
 associated.

For instance, we may be interested in knowing whether a new
 medicine is effective in controlling fever or not, c2 test is
 useful.

In such a situation, we proceed with the null hypothesis that
 the two attributes (viz., new medicine and control of fever) are
 independent which means that new medicine is not effective
 in controlling fever.

c2 (calculated) > c2 (tabulated) at a certain level of
 significancfe for given degrees of freedom, the null hypothesis
 is rejected, i.e. two variables are dependent.(i.e., the new
 medicine is effective in controlling the fever) and if, c2
 (calculated) <c2 (tabulated) ,the null hypothesis is accepted,
 i.e. 2 variables are independent.(i.e., the new medicine is not
 effective in controlling the fever).

when null hypothesis is rejected, it can be concluded that there is
a significant association between two attributes.
3) TEST OF HOMOGENITY

 This test can also be used to test whether the occurance of
  events follow uniformity or not e.g. the admission of
  patients in government hospital in all days of week is
  uniform or not can be tested with the help of chi square
  test.

c2 (calculated) < c2 (tabulated), then null hypothesis is
 accepted, and it can be concluded that there is a uniformity
 in the occurance of the events. (uniformity in the admission
 of patients through out the week)
CALCULATION OF CHI SQUARE

                       (o  e)
                                 2

                
            2

                            e
Where,
O = observed frequency
E = expected frequency
If two distributions (observed and theoretical) are exactly alike,
c2 = 0; (but generally due to sampling errors, c2 is not equal to
zero)
STEPS INVOLVED IN CALCULATING c2

1) Calculate the expected frequencies and the observed
   frequencies:

   Expected frequencies fe : the cell frequencies that would be
  expected in a contingency table if the two variables were
  statistically independent.

  Observed frequencies fo: the cell frequencies actually
  observed in a contingency table.

          fe = (column total)(row total)
                                N
 To obtain the expected frequencies for any cell in any cross-
 tabulation in which the two variables are assumed
 independent, multiply the row and column totals for that cell
 and divide the product by the total number of cases in the
 table.
2) Then c2 is calculated as follows:



                           ( fe  fo )
                                            2

     c           
          2

                                       fe
CONDITIONS FOR THE APPLICATION OF c2
                TEST

The following conditions should be satisfied before X2 test can
be applied:

1) The data must be in the form of frequencies
2) The frequency data must have a precise numerical value and
   must be organised into categories or groups.
3) Observations recorded and used are collected on a random
   basis.
4) All the itmes in the sample must be independent.
5) No group should contain very few items, say less than 10. In
   case where the frequencies are less than 10, regrouping is
   done by combining the frequencies of adjoining groups so
   that the new frequencies become greater than 10. (Some
   statisticians take this number as 5, but 10 is regarded as
   better by most of the statisticians.)
6) The overall number of items must also be reasonably large.
   It should normally be at least 50.
EXAMPLE
Chi square test
YATE’S CORRECTION
If in the 2*2 contingency table, the expected frequencies are small
say less than 5, then c2 test can’t be used. In that case, the direct
formula of the chi square test is modified and given by Yate’s
correction for continuity




                        R1R2C1C2
LIMITATIONS OF A CHI SQUARE TEST

1) The data is from a random sample.

2) This test applied in a four fould table, will not give a reliable
   result with one degree of freedom if the expected value in any
   cell is less than 5.
   in such case, Yate’s correction is necessry. i.e. reduction of
   the mode of (o – e) by half.

3) Even if Yate’s correction, the test may be misleading if any
   expected frequency is much below 5. in that case another
   appropriate test should be applied.

4) In contingency tables larger than 2*2, Yate’s correction
   cannot be applied.

5) Interprit this test with caution if sample total or total of
   values in all the cells is less than 50.
6) This test tells the presence or absence of an association
   between the events but doesn’t measure the strength of
   association.

7) This test doesn’t indicate the cause and effect, it only tells
   the probability of occurance of association by chance.

8) the test is to be applied only when the individual
   observations of sample are independent which means that
   the occurrence of one individual observation
   (event) has no effect upon the occurrence of any other
   observation (event) in the sample under consideration.
THANK YOU
1 von 23

Recomendados

Chi -square test von
Chi -square testChi -square test
Chi -square testVIVEK KUMAR SINGH
48.9K views18 Folien
Chi squared test von
Chi squared testChi squared test
Chi squared testRamakanth Gadepalli
64.7K views38 Folien
Measures of Central Tendency - Biostatstics von
Measures of Central Tendency - BiostatsticsMeasures of Central Tendency - Biostatstics
Measures of Central Tendency - BiostatsticsHarshit Jadav
183K views60 Folien
Analysis of variance (anova) von
Analysis of variance (anova)Analysis of variance (anova)
Analysis of variance (anova)Sadhana Singh
95.4K views30 Folien
Measures of Dispersion von
Measures of DispersionMeasures of Dispersion
Measures of DispersionBirinder Singh Gulati
102.5K views32 Folien

Más contenido relacionado

Was ist angesagt?

T test von
T testT test
T testsai precious
151K views14 Folien
Anova ppt von
Anova pptAnova ppt
Anova pptSravani Ganti
137.6K views25 Folien
Regression Analysis von
Regression AnalysisRegression Analysis
Regression AnalysisBirinder Singh Gulati
73.1K views26 Folien
Testing of hypothesis von
Testing of hypothesisTesting of hypothesis
Testing of hypothesisJags Jagdish
81K views19 Folien
Correlation and Regression von
Correlation and RegressionCorrelation and Regression
Correlation and RegressionRam Kumar Shah "Struggler"
59.6K views54 Folien
Test of significance in Statistics von
Test of significance in StatisticsTest of significance in Statistics
Test of significance in StatisticsVikash Keshri
79.3K views54 Folien

Was ist angesagt?(20)

Test of significance in Statistics von Vikash Keshri
Test of significance in StatisticsTest of significance in Statistics
Test of significance in Statistics
Vikash Keshri79.3K views
Sampling techniques von Bharat Paul
Sampling techniquesSampling techniques
Sampling techniques
Bharat Paul132.3K views
SAMPLING AND SAMPLING ERRORS von rambhu21
SAMPLING AND SAMPLING ERRORSSAMPLING AND SAMPLING ERRORS
SAMPLING AND SAMPLING ERRORS
rambhu21130.5K views
non parametric statistics von Anchal Garg
non parametric statisticsnon parametric statistics
non parametric statistics
Anchal Garg47.9K views
The Binomial, Poisson, and Normal Distributions von SCE.Surat
The Binomial, Poisson, and Normal DistributionsThe Binomial, Poisson, and Normal Distributions
The Binomial, Poisson, and Normal Distributions
SCE.Surat18.9K views
Type i and type ii errors von p24ssp
Type i and type ii errorsType i and type ii errors
Type i and type ii errors
p24ssp104.4K views

Destacado

Chi square test von
Chi square testChi square test
Chi square testAnandapadmanabhan Kottiyam
59.3K views39 Folien
Chi Square Worked Example von
Chi Square Worked ExampleChi Square Worked Example
Chi Square Worked ExampleJohn Barlow
184.4K views15 Folien
Null hypothesis for a chi-square goodness of fit test von
Null hypothesis for a chi-square goodness of fit testNull hypothesis for a chi-square goodness of fit test
Null hypothesis for a chi-square goodness of fit testKen Plummer
15.2K views23 Folien
Reporting chi square goodness of fit test of independence in apa von
Reporting chi square goodness of fit test of independence in apaReporting chi square goodness of fit test of independence in apa
Reporting chi square goodness of fit test of independence in apaKen Plummer
23.6K views14 Folien
Reporting Chi Square Test of Independence in APA von
Reporting Chi Square Test of Independence in APAReporting Chi Square Test of Independence in APA
Reporting Chi Square Test of Independence in APAKen Plummer
174.3K views13 Folien
Student t-test von
Student t-testStudent t-test
Student t-testSteve Bishop
51.3K views65 Folien

Destacado(20)

Chi Square Worked Example von John Barlow
Chi Square Worked ExampleChi Square Worked Example
Chi Square Worked Example
John Barlow184.4K views
Null hypothesis for a chi-square goodness of fit test von Ken Plummer
Null hypothesis for a chi-square goodness of fit testNull hypothesis for a chi-square goodness of fit test
Null hypothesis for a chi-square goodness of fit test
Ken Plummer15.2K views
Reporting chi square goodness of fit test of independence in apa von Ken Plummer
Reporting chi square goodness of fit test of independence in apaReporting chi square goodness of fit test of independence in apa
Reporting chi square goodness of fit test of independence in apa
Ken Plummer23.6K views
Reporting Chi Square Test of Independence in APA von Ken Plummer
Reporting Chi Square Test of Independence in APAReporting Chi Square Test of Independence in APA
Reporting Chi Square Test of Independence in APA
Ken Plummer174.3K views
Analysis of variance (ANOVA) von Sneh Kumari
Analysis of variance (ANOVA)Analysis of variance (ANOVA)
Analysis of variance (ANOVA)
Sneh Kumari156.7K views
Functional matrix revisited /certified fixed orthodontic courses by Indian... von Indian dental academy
Functional matrix revisited    /certified fixed orthodontic courses by Indian...Functional matrix revisited    /certified fixed orthodontic courses by Indian...
Functional matrix revisited /certified fixed orthodontic courses by Indian...
Standard Deviation von JRisi
Standard DeviationStandard Deviation
Standard Deviation
JRisi11.8K views
Null hypothesis for a Chi-Square Test of Independence von Ken Plummer
Null hypothesis for a Chi-Square Test of IndependenceNull hypothesis for a Chi-Square Test of Independence
Null hypothesis for a Chi-Square Test of Independence
Ken Plummer5.4K views
Chi-Square Test of Independence von Ken Plummer
Chi-Square Test of IndependenceChi-Square Test of Independence
Chi-Square Test of Independence
Ken Plummer5.4K views
The chi – square test von Majesty Ortiz
The chi – square testThe chi – square test
The chi – square test
Majesty Ortiz2.2K views
Chi square test final von Har Jindal
Chi square test finalChi square test final
Chi square test final
Har Jindal20.7K views
regression and correlation von Priya Sharma
regression and correlationregression and correlation
regression and correlation
Priya Sharma4.4K views

Similar a Chi square test

Chisquare von
ChisquareChisquare
Chisquarekeerthi samuel
409 views24 Folien
Parametric & non parametric von
Parametric & non parametricParametric & non parametric
Parametric & non parametricANCYBS
3.5K views48 Folien
Chi square von
Chi square Chi square
Chi square HemamaliniSakthivel
290 views19 Folien
Busniess stats von
Busniess statsBusniess stats
Busniess statsVishalHotchandani2
93 views14 Folien
Chi square test von
Chi square testChi square test
Chi square testAmanRathore54
2.1K views15 Folien
Contingency tables von
Contingency tables  Contingency tables
Contingency tables Long Beach City College
1.7K views17 Folien

Similar a Chi square test(20)

Parametric & non parametric von ANCYBS
Parametric & non parametricParametric & non parametric
Parametric & non parametric
ANCYBS3.5K views
Chi square test von Nayna Azad
Chi square testChi square test
Chi square test
Nayna Azad724 views
Inferential Statistics: Chi Square (X2) - DAY 6 - B.ED - 8614 - AIOU von EqraBaig
Inferential Statistics: Chi Square (X2) - DAY 6 - B.ED - 8614 - AIOUInferential Statistics: Chi Square (X2) - DAY 6 - B.ED - 8614 - AIOU
Inferential Statistics: Chi Square (X2) - DAY 6 - B.ED - 8614 - AIOU
EqraBaig312 views
Lect w7 t_test_amp_chi_test von Rione Drevale
Lect w7 t_test_amp_chi_testLect w7 t_test_amp_chi_test
Lect w7 t_test_amp_chi_test
Rione Drevale506 views
chi-Square. test- von shifanaz9
chi-Square. test-chi-Square. test-
chi-Square. test-
shifanaz983 views
CHI SQUARE TEST by Katha Sanyal von Katha Sanyal
CHI SQUARE TEST by Katha SanyalCHI SQUARE TEST by Katha Sanyal
CHI SQUARE TEST by Katha Sanyal
Katha Sanyal105 views
36086 Topic Discussion3Number of Pages 2 (Double Spaced).docx von rhetttrevannion
36086 Topic Discussion3Number of Pages 2 (Double Spaced).docx36086 Topic Discussion3Number of Pages 2 (Double Spaced).docx
36086 Topic Discussion3Number of Pages 2 (Double Spaced).docx
rhetttrevannion4 views
Statistical techniques used in measurement von ShivamKhajuria3
Statistical techniques used in measurementStatistical techniques used in measurement
Statistical techniques used in measurement
ShivamKhajuria399 views
Categorical data final von Monika
Categorical data finalCategorical data final
Categorical data final
Monika 140 views

Más de Patel Parth

Automated analysis by yatin sankharva copy von
Automated analysis by yatin sankharva   copyAutomated analysis by yatin sankharva   copy
Automated analysis by yatin sankharva copyPatel Parth
13.6K views38 Folien
Automated analysis 112070804013 von
Automated analysis 112070804013Automated analysis 112070804013
Automated analysis 112070804013Patel Parth
23.3K views27 Folien
Anda registration in us eu von
Anda registration in us  euAnda registration in us  eu
Anda registration in us euPatel Parth
5.5K views21 Folien
Anda ppt von
Anda pptAnda ppt
Anda pptPatel Parth
28.8K views25 Folien
Analytical tech in pre formulation 112070804009 von
Analytical tech in pre formulation 112070804009Analytical tech in pre formulation 112070804009
Analytical tech in pre formulation 112070804009Patel Parth
6.4K views38 Folien
Analysis of solid oral von
Analysis of solid oralAnalysis of solid oral
Analysis of solid oralPatel Parth
2.3K views25 Folien

Más de Patel Parth(20)

Automated analysis by yatin sankharva copy von Patel Parth
Automated analysis by yatin sankharva   copyAutomated analysis by yatin sankharva   copy
Automated analysis by yatin sankharva copy
Patel Parth13.6K views
Automated analysis 112070804013 von Patel Parth
Automated analysis 112070804013Automated analysis 112070804013
Automated analysis 112070804013
Patel Parth23.3K views
Anda registration in us eu von Patel Parth
Anda registration in us  euAnda registration in us  eu
Anda registration in us eu
Patel Parth5.5K views
Analytical tech in pre formulation 112070804009 von Patel Parth
Analytical tech in pre formulation 112070804009Analytical tech in pre formulation 112070804009
Analytical tech in pre formulation 112070804009
Patel Parth6.4K views
Analysis of solid oral von Patel Parth
Analysis of solid oralAnalysis of solid oral
Analysis of solid oral
Patel Parth2.3K views
Analysis of solid oral dosage forms 112070804010 von Patel Parth
Analysis of solid oral dosage forms 112070804010Analysis of solid oral dosage forms 112070804010
Analysis of solid oral dosage forms 112070804010
Patel Parth6.4K views
Analysis of parenteral dosage forms bjl final seminar von Patel Parth
Analysis of parenteral dosage forms bjl final seminarAnalysis of parenteral dosage forms bjl final seminar
Analysis of parenteral dosage forms bjl final seminar
Patel Parth9.4K views
Analysis of data (pratik) von Patel Parth
Analysis of data (pratik)Analysis of data (pratik)
Analysis of data (pratik)
Patel Parth1K views
Analysis of cosmetics 112070804018 von Patel Parth
Analysis of cosmetics 112070804018Analysis of cosmetics 112070804018
Analysis of cosmetics 112070804018
Patel Parth64.3K views
Agencies dhwani von Patel Parth
Agencies dhwaniAgencies dhwani
Agencies dhwani
Patel Parth1.5K views
A seminar on applications of various analytical technique von Patel Parth
A seminar on applications of various analytical techniqueA seminar on applications of various analytical technique
A seminar on applications of various analytical technique
Patel Parth4.7K views
23117 copy of oral solid dosage forms von Patel Parth
23117 copy of oral solid dosage forms23117 copy of oral solid dosage forms
23117 copy of oral solid dosage forms
Patel Parth4.4K views
4016 solid state analysis von Patel Parth
4016 solid state analysis4016 solid state analysis
4016 solid state analysis
Patel Parth6K views
4003 regulatory aspect_of_bulk,pharmaceutical,biotech von Patel Parth
4003 regulatory aspect_of_bulk,pharmaceutical,biotech4003 regulatory aspect_of_bulk,pharmaceutical,biotech
4003 regulatory aspect_of_bulk,pharmaceutical,biotech
Patel Parth1.5K views
A.a sequence analysis 112070804002 von Patel Parth
A.a sequence analysis 112070804002A.a sequence analysis 112070804002
A.a sequence analysis 112070804002
Patel Parth4.5K views
Sterility testing 112070804014 von Patel Parth
Sterility testing 112070804014Sterility testing 112070804014
Sterility testing 112070804014
Patel Parth2.8K views
Ria 112070804007 von Patel Parth
Ria 112070804007Ria 112070804007
Ria 112070804007
Patel Parth7.5K views
Qc lab 112070804001 von Patel Parth
Qc lab 112070804001Qc lab 112070804001
Qc lab 112070804001
Patel Parth6.5K views
Pyrogen testing 112070804005 von Patel Parth
Pyrogen testing  112070804005Pyrogen testing  112070804005
Pyrogen testing 112070804005
Patel Parth50.3K views

Chi square test

  • 2. CONTENTS: •IMPORTANT TERMS •INTRODUCTION •CHARACTERISTICS OF THE TEST •CHI SQUARE DISTRIBUTION •APPLICATIONS OF CHI SQUARE TEST •CALCULATION OF THE CHI SQUARE •CONDITION FOR THE APPLICATION OF THE TEST •EXAMPLE •YATE’S CORRECTION FOR CONTINUITY •LIMITATIONS OF THE TEST.
  • 3. IMPORTANT TERMS 1) PARAMETRIC TEST: The test in which, the population constants like mean,std deviation, std error, correlation coefficient, proportion etc. and data tend to follow one assumed or established distribution such as normal, binomial, poisson etc. 2) NON PARAMETRIC TEST: the test in which no constant of a population is used. Data do not follow any specific distribution and no assumption are made in these tests. E.g. to classify good, better and best we just allocate arbitrary numbers or marks to each category. 3) HYPOTHESIS: It is a definite statement about the population parameters.
  • 4. 4) NULL HYPOTHESIS: (H0) states that no association exists between the two cross-tabulated variables in the population, and therefore the variables are statistically independent. E.g. if we want to compare 2 methods method A and method B for its superiority, and if the assumption is that both methods are equally good, then this assumption is called as NULL HYPOTHESIS. 5) ALTERNATIVE HYPOTHESIS: (H1) proposes that the two variables are related in the population. If we assume that from 2 methods, method A is superior than method B, then this assumption is called as ALTERNATIVE HYPOTHESIS.
  • 5. 6) DEGREE OF FREEDOM: It denotes the extent of independence (freedom) enjoyed by a given set of observed frequencies Suppose we are given a set of n observed frequencies which are subjected to k independent constraints(restrictions) then, d.f. = (number of frequencies) – (number of independent constraints on them) In other terms, df = (r – 1)(c – 1) where r = the number of rows c = the number of columns 7) CONTINGENCY TABLE: When the table is prepared by enumeration of qualitative data by entering the actual frequencies, and if that table represents occurance of two sets of events, that table is called the contingency table. (Latin, con- together, tangere- to touch). It is also called as an association table.
  • 6. INTRODUCTION The chi-square test is an important test amongst the several tests of significance developed by statisticians. Is was developed by Karl Pearson in1900. CHI SQUARE TEST is a non parametric test not based on any assumption or distribution of any variable. This statistical test follows a specific distribution known as chi square distribution. In general The test we use to measure the differences between what is observed and what is expected according to an assumed hypothesis is called the chi-square test.
  • 7. IMPORTANT CHARACTERISTICS OF A CHI SQUARE TEST  This test (as a non-parametric test) is based on frequencies and not on the parameters like mean and standard deviation.  The test is used for testing the hypothesis and is not useful for estimation.  This test can also be applied to a complex contingency table with several classes and as such is a very useful test in research work.  This test is an important non-parametric test as no rigid assumptions are necessary in regard to the type of population, no need of parameter values and relatively less mathematical details are involved.
  • 8. CHI SQUARE DISTRIBUTION: If X1, X2,….Xn are independent normal variates and each is distributed normally with mean zero and standard deviation unity, then X12+X22+……+Xn2= ∑ Xi2 is distributed as chi square (c2 )with n degrees of freedom (d.f.) where n is large. The chi square curve for d.f. N=1,5 and 9 is as follows.
  • 9. If degree of freedom > 2 : Distribution is bell shaped If degree of freedom = 2 : Distribution is L shaped with maximum ordinate at zero If degree of freedom <2 (>0) : Distribution L shaped with infinite ordinate at the origin.
  • 10. APPLICATIONS OF A CHI SQUARE TEST. This test can be used in 1) Goodness of fit of distributions 2) test of independence of attributes 3) test of homogenity.
  • 11. 1) TEST OF GOODNESS OF FIT OF DISTRIBUTIONS:  This test enables us to see how well does the assumed theoretical distribution (such as Binomial distribution, Poisson distribution or Normal distribution) fit to the observed data. The c2 test formula for goodness of fit is: (o  e) 2    2 e Where, o = observed frequency e = expected frequency If c2 (calculated) > c2 (tabulated), with (n-1) d.f, then null hypothesis is rejected otherwise accepted. And if null hypothesis is accepted, then it can be concluded that the given distribution follows theoretical distribution.
  • 12. 2) TEST OF INDEPENDENCE OF ATTRIBUTES Test enables us to explain whether or not two attributes are associated. For instance, we may be interested in knowing whether a new medicine is effective in controlling fever or not, c2 test is useful. In such a situation, we proceed with the null hypothesis that the two attributes (viz., new medicine and control of fever) are independent which means that new medicine is not effective in controlling fever. c2 (calculated) > c2 (tabulated) at a certain level of significancfe for given degrees of freedom, the null hypothesis is rejected, i.e. two variables are dependent.(i.e., the new medicine is effective in controlling the fever) and if, c2 (calculated) <c2 (tabulated) ,the null hypothesis is accepted, i.e. 2 variables are independent.(i.e., the new medicine is not effective in controlling the fever). when null hypothesis is rejected, it can be concluded that there is a significant association between two attributes.
  • 13. 3) TEST OF HOMOGENITY  This test can also be used to test whether the occurance of events follow uniformity or not e.g. the admission of patients in government hospital in all days of week is uniform or not can be tested with the help of chi square test. c2 (calculated) < c2 (tabulated), then null hypothesis is accepted, and it can be concluded that there is a uniformity in the occurance of the events. (uniformity in the admission of patients through out the week)
  • 14. CALCULATION OF CHI SQUARE (o  e) 2    2 e Where, O = observed frequency E = expected frequency If two distributions (observed and theoretical) are exactly alike, c2 = 0; (but generally due to sampling errors, c2 is not equal to zero)
  • 15. STEPS INVOLVED IN CALCULATING c2 1) Calculate the expected frequencies and the observed frequencies: Expected frequencies fe : the cell frequencies that would be expected in a contingency table if the two variables were statistically independent. Observed frequencies fo: the cell frequencies actually observed in a contingency table. fe = (column total)(row total) N To obtain the expected frequencies for any cell in any cross- tabulation in which the two variables are assumed independent, multiply the row and column totals for that cell and divide the product by the total number of cases in the table.
  • 16. 2) Then c2 is calculated as follows: ( fe  fo ) 2 c   2 fe
  • 17. CONDITIONS FOR THE APPLICATION OF c2 TEST The following conditions should be satisfied before X2 test can be applied: 1) The data must be in the form of frequencies 2) The frequency data must have a precise numerical value and must be organised into categories or groups. 3) Observations recorded and used are collected on a random basis. 4) All the itmes in the sample must be independent. 5) No group should contain very few items, say less than 10. In case where the frequencies are less than 10, regrouping is done by combining the frequencies of adjoining groups so that the new frequencies become greater than 10. (Some statisticians take this number as 5, but 10 is regarded as better by most of the statisticians.) 6) The overall number of items must also be reasonably large. It should normally be at least 50.
  • 20. YATE’S CORRECTION If in the 2*2 contingency table, the expected frequencies are small say less than 5, then c2 test can’t be used. In that case, the direct formula of the chi square test is modified and given by Yate’s correction for continuity R1R2C1C2
  • 21. LIMITATIONS OF A CHI SQUARE TEST 1) The data is from a random sample. 2) This test applied in a four fould table, will not give a reliable result with one degree of freedom if the expected value in any cell is less than 5. in such case, Yate’s correction is necessry. i.e. reduction of the mode of (o – e) by half. 3) Even if Yate’s correction, the test may be misleading if any expected frequency is much below 5. in that case another appropriate test should be applied. 4) In contingency tables larger than 2*2, Yate’s correction cannot be applied. 5) Interprit this test with caution if sample total or total of values in all the cells is less than 50.
  • 22. 6) This test tells the presence or absence of an association between the events but doesn’t measure the strength of association. 7) This test doesn’t indicate the cause and effect, it only tells the probability of occurance of association by chance. 8) the test is to be applied only when the individual observations of sample are independent which means that the occurrence of one individual observation (event) has no effect upon the occurrence of any other observation (event) in the sample under consideration.