SlideShare ist ein Scribd-Unternehmen logo
1 von 147
Chi Square Test of Independence 
Conceptual
Questions of Independence
Questions of independence are actually 
the flip side of questions of relationship. If 
a variable is independent of another variable, 
then functions in one will not be accompanied 
by functions in the other.
Like the higher the temperature the stronger 
people are.
Like the higher the temperature the stronger 
people are.
Questions of Independence are often posed as 
questions of bias.
For example, the question, “Are admissions 
decisions at a local community college fair?” can 
reasonably be interpreted as a question of 
independence (or bias).
If fairness is taken to mean that there is 
proportional representation of minority and 
majority students that mirrors the local 
proportions, then a test of independence can 
estimate whether admissions are “fair”.
The question becomes “Are admissions 
decisions independent of majority/minority 
status?”
Assuming that majority students are similar in 
their preparation and motivation as minority 
students and they apply to the community 
college in proportionally similar numbers as 
minority students, then a fair admissions 
process should be independent of majority 
status and render proportions of admissions 
that are similar to proportions of majority and 
minority students in the local populations
INDEPENDENT EXAMPLE: If you are a minority 
you are neither more likely nor less likely to be 
admitted.
Failure to be independent would indicate bias.
Failure to be independent would indicate bias. 
BIAS EXAMPLE: If you are a minority you are 
more likely to be admitted.
Failure to be independent would indicate bias. 
BIAS EXAMPLE: If you are a minority you are 
more likely to be admitted. 
BIAS EXAMPLE: If you are a minority you less 
likely to be admitted
Failure to be independent would indicate bias. 
BIAS EXAMPLE: If you are a minority you are 
more likely to be admitted. 
BIAS EXAMPLE: If you are a minority you less 
likely to be admitted. 
You will use certain statistical methods (like the 
chi square test of independence) to determine if 
independence is significant or not.
Here is an example taken from 
http://omega.albany.edu:8008/mat108dir/chi2i 
ndependence/chi2in-m2h.html:
Here is an example taken from 
http://omega.albany.edu:8008/mat108dir/chi2i 
ndependence/chi2in-m2h.html: 
In a certain town, there are about one million 
eligible voters. A simple random sample of 
10,000 eligible voters was chosen to study the 
relationship between gender and participation 
in the last election.
Here is an example taken from 
http://omega.albany.edu:8008/mat108dir/chi2i 
ndependence/chi2in-m2h.html: 
In a certain town, there are about one million 
eligible voters. A simple random sample of 
10,000 eligible voters was chosen to study the 
relationship between gender and participation 
in the last election. The results 
are summarized in the following 
2X2 (read two by two) 
contingency table:
In a certain town, there are about one million 
eligible voters. A simple random sample of 
10,000 eligible voters was chosen to study the 
relationship between gender and participation 
in the last election. The results are summarized 
in the following 2X2 (read two by two) 
contingency table: 
Men Women 
__________________________ 
Voted 2792 3591 
Didn't vote 1486 2131
We want to check whether being a man or a 
woman (columns) is independent of having 
voted in the last election (rows). In other words 
is “gender and voting independent”? 
Men Women 
__________________________ 
Voted 2792 3591 
Didn't vote 1486 2131
Solution:
Solution: 
In order to answer the question we need to 
build a test of hypothesis. We have
Solution: 
In order to answer the question we need to 
build a test of hypothesis. We have 
Null Hypothesis = ‘Gender is independent of 
Voting’
Solution: 
In order to answer the question we need to 
build a test of hypothesis. We have 
Null Hypothesis = ‘Gender is independent of 
Voting’ 
Alternative Hypothesis = ‘Gender and Voting 
are dependent’
Solution: 
In order to answer the question we need to 
build a test of hypothesis. We have 
Null Hypothesis = ‘Gender is independent of 
Voting’ 
Alternative Hypothesis = ‘Gender and Voting 
are dependent’ 
After specifying the Null Hypothesis, we need to 
compute the expected table under the 
assumption that rows and columns are in fact 
independent.
As you can see we have the observed table 
below:
As you can see we have the observed table 
below: 
Men Women 
__________________________ 
Voted 2792 3591 
Didn't vote 1486 2131 
We need to create an expected table and then 
determine if the difference between the 
observed and expected are significant:
As you can see we have the observed table 
below: 
Men Women 
__________________________ 
Voted 2792 3591 
Didn't vote 1486 2131 
We need to create an expected table and then 
determine if the difference between the 
observed and expected are significant:
As you can see we have the observed table 
below: 
Men Women 
__________________________ 
Voted 2792 3591 
Didn't vote 1486 2131 
We need to create an expected table and then 
determine if the difference between the 
observed and expected are significant: 
Observed Numbers Expected Numbers Difference
Remember that the smaller the DIFFERENCE, 
the better the fit which in this case would favor 
INDEPENDENCE between gender and voting 
tendencies.
Remember that the smaller the DIFFERENCE, 
the better the fit which in this case would favor 
INDEPENDENCE between gender and voting 
tendencies. 
Observed Numbers Expected Numbers Difference
Inversely, the larger the DIFFERENCE the worse 
the fit which in this case would indicate that 
gender and voting tendencies are dependent 
upon one another.
Inversely, the larger the DIFFERENCE the worse 
the fit which in this case would indicate that 
gender and voting tendencies are dependent 
upon one another. 
Observed Numbers Expected Numbers Difference
We use Chi-Square distribution to determine if 
that difference is significant or not.
We use Chi-Square distribution to determine if 
that difference is significant or not. 
We will now show you how to compute the chi-square 
statistic for a test of independence.
We use Chi-Square distribution to determine if 
that difference is significant or not. 
We will now show you how to compute the chi-square 
statistic for a test of independence. 
First, we compute the row and column totals 
along with the grand total.
Men Women 
________________________________________ 
Voted 2792 3591 
Didn't vote 1486 2131
Total Who 
Voted 
Men Women 
________________________________________ 
Voted 2792 + 3591 = 6386 
Didn't vote 1486 2131
Men Women 
________________________________________ 
Voted 2792 3591 6386 
Didn't vote 1486 + 2131 = 3617 
Total Who 
Did Not 
Vote
Men Women 
________________________________________ 
Voted 2792 3591 6386 
Didn't vote + 1486 2131 3617 
= 4278 
Total Men
Men Women 
________________________________________ 
Voted 2792 3591 6386 
Didn't vote 1486 + 2131 3617 
4278 = 5722 
Total 
Women
Total Men & 
Women or Total 
Voted/Not Voted 
Men Women 
________________________________________ 
Voted 2792 3591 6386 
Didn't vote 1486 2131 3617 
4278 5722 10000
Now we have the information we need to create 
an expected table. Here is the equation for 
calculating the expected value for the cell “Men 
who Voted”:
Now we have the information we need to create 
an expected table. Here is the equation for 
calculating the expected value for the cell “Men 
who Voted”: 
Expected Value(Men who voted) = 
(Number (all who voted) * Number (all men)) 
Number(total number)
Observed Men Women 
_ 
Voted 2792 3591 6386 
Didn't vote 1486 2131 3617 
4278 5722 10000 
Men Who 
Voted
OBSERVED Men Women 
_ TABLE 
Voted 2792 3591 6386 
Didn't vote 1486 2131 3617 
4278 5722 10000 
Expected Value (Men who voted) = 
(6386 (all who voted) * Number (all men)) 
Number (total number)
OBSERVED Men Women 
_ TABLE 
Voted 2792 3591 6386 
Didn't vote 1486 2131 3617 
4278 5722 10000 
Expected Value (Men who voted) = 
(6386 (all who voted) * 4278 (all men) ) 
Number (total number)
OBSERVED Men Women 
_ TABLE 
Voted 2792 3591 6386 
Didn't vote 1486 2131 3617 
4278 5722 10000 
Expected Value (Men who voted) = 
(6386 (all who voted) * 4278 (all men) ) 
10000 (total number)
OBSERVED Men Women 
_ TABLE 
Voted 2792 3591 6386 
Didn't vote 1486 2131 3617 
4278 5722 10000 
Expected Value (Men who voted) = 
(27306474 (all who voted * all men)) 
10000 (total number)
OBSERVED Men Women 
_ TABLE 
Voted 2792 3591 6386 
Didn't vote 1486 2131 3617 
4278 5722 10000 
Expected Value (Men who voted) = 
2730.6474 ((all who voted * all men)/total number)
OBSERVED Men Women 
_ TABLE 
Voted 2792 3591 6386 
Didn't vote 1486 2131 3617 
4278 5722 10000 
Expected Value (Men who voted) = 
2731 ((all who voted * all men)/total number)
EXPECTED Men Women 
_ TABLE 
Voted 2731 3591 6386 
Didn't vote 1486 2131 3617 
4278 5722 10000 
Expected Value (Men who voted) = 
2731 ((all who voted * all men)/total number)
EXPECTED Men Women 
_ TABLE 
Voted 2731 3591 6386 
Didn't vote 1486 2131 3617 
4278 5722 10000 
What is the expected 
value for Women who 
Voted?
Women who voted:
Women who voted: 
OBSERVED Men Women 
_ TABLE 
Voted 2792 3591 6386 
Didn't vote 1486 2131 3617 
4278 5722 10000
Women who voted: 
OBSERVED Men Women 
_ TABLE 
Voted 2792 3591 6386 
Didn't vote 1486 2131 3617 
4278 5722 10000 
Expected Value (Women who voted) = 
(6386 (all who voted) * 5722 (all women) ) 
10000 (total number)
Women who voted: 
OBSERVED Men Women 
_ TABLE 
Voted 2792 3591 6386 
Didn't vote 1486 2131 3617 
4278 5722 10000 
Expected Value (Women who voted) = 
(6386 (all who voted) * 5722 (all women) ) 
10000 (total number)
Women who voted: 
OBSERVED Men Women 
_ TABLE 
Voted 2792 3591 6386 
Didn't vote 1486 2131 3617 
4278 5722 10000 
Expected Value (Women who voted) = 
(6386 (all who voted) * 5722 (all women) ) 
10000 (total number)
Women who voted: 
OBSERVED Men Women 
_ TABLE 
Voted 2792 3591 6386 
Didn't vote 1486 2131 3617 
4278 5722 10000 
Expected Value (Women who voted) = 
(36523526 ((all who voted) * (all women)) ) 
10000 (total number)
Women who voted: 
OBSERVED Men Women 
_ TABLE 
Voted 2792 3591 6386 
Didn't vote 1486 2131 3617 
4278 5722 10000 
Expected Value (Women who voted) = 
(3652.3526 ((all who voted) * (all women)))/total number
Women who voted: 
OBSERVED Men Women 
_ TABLE 
Voted 2792 3591 6386 
Didn't vote 1486 2131 3617 
4278 5722 10000 
Expected Value (Women who voted) = 
(3652 ((all who voted) * (all women)))/total number
Women who voted: 
EXPECTED Men Women 
_ TABLE 
Voted 2731 3652 6386 
Didn't vote 1486 2131 3617 
4278 5722 10000 
Expected Value (Women who voted) = 
(3652 ((all who voted) * (all women)))/total number
Women who voted: 
EXPECTED Men Women 
_ TABLE 
Voted 2731 3652 6386 
Didn't vote 1486 2131 3617 
4278 5722 10000 
What is the expected 
value for Men who 
Didn’t Vote?
Men who didn’t vote:
Men who didn’t vote: 
OBSERVED Men Women 
_ TABLE 
Voted 2792 3591 6386 
Didn't vote 1486 2131 3617 
4278 5722 10000
Men who didn’t vote: 
OBSERVED Men Women 
_ TABLE 
Voted 2792 3591 6386 
Didn't vote 1486 2131 3617 
4278 5722 10000 
Expected Value (Men who didn’t vote) = 
(3617 (all who didn’t vote) * 4278 (all men) ) 
10000 (total number)
Men who didn’t vote: 
OBSERVED Men Women 
_ TABLE 
Voted 2792 3591 6386 
Didn't vote 1486 2131 3617 
4278 5722 10000 
Expected Value (Men who didn’t vote) = 
(3617 (all who didn’t vote) * 4278 (all men) ) 
10000 (total number)
Men who didn’t vote: 
OBSERVED Men Women 
_ TABLE 
Voted 2792 3591 6386 
Didn't vote 1486 2131 3617 
4278 5722 10000 
Expected Value (Men who didn’t vote) = 
(3617 (all who didn’t vote) * 4278 (all men) ) 
10000 (total number)
Men who didn’t vote: 
OBSERVED Men Women 
_ TABLE 
Voted 2792 3591 6386 
Didn't vote 1486 2131 3617 
4278 5722 10000 
Expected Value (Men who didn’t vote) = 
(15473526 ((all who didn’t vote) * (all men)) ) 
10000 (total number)
Men who didn’t vote: 
OBSERVED Men Women 
_ TABLE 
Voted 2792 3591 6386 
Didn't vote 1486 2131 3617 
4278 5722 10000 
Expected Value (Men who didn’t vote) = 
(1547.3526 ((all who didn’t vote) * (all men)) / (total number))
Men who didn’t vote: 
EXPECTED Men Women 
_ TABLE 
Voted 2731 3652 6386 
Didn't vote 1547 2131 3617 
4278 5722 10000 
Expected Value (Men who didn’t vote) = 
(1547 ((all who didn’t vote) * (all men)) / (total number))
Men who didn’t vote: 
EXPECTED Men Women 
_ TABLE 
Voted 2731 3652 6386 
Didn't vote 1547 2131 3617 
4278 5722 10000 
What is the expected 
value for Women who 
Didn’t Vote?
Women who didn’t vote: 
OBSERVED Men Women 
_ TABLE 
Voted 2792 3591 6386 
Didn't vote 1486 2131 3617 
4278 5722 10000
Women who didn’t vote: 
OBSERVED Men Women 
_ TABLE 
Voted 2792 3591 6386 
Didn't vote 1486 2131 3617 
4278 5722 10000 
Expected Value (Women who didn’t vote) = 
(3617 (all who didn’t vote) * 5722 (all women) ) 
10000 (total number)
Women who didn’t vote: 
OBSERVED Men Women 
_ TABLE 
Voted 2792 3591 6386 
Didn't vote 1486 2131 3617 
4278 5722 10000 
Expected Value (Women who didn’t vote) = 
(3617 (all who didn’t vote) * 5722 (all women) ) 
10000 (total number)
Women who didn’t vote: 
OBSERVED Men Women 
_ TABLE 
Voted 2792 3591 6386 
Didn't vote 1486 2131 3617 
4278 5722 10000 
Expected Value (Women who didn’t vote) = 
(3617 (all who didn’t vote) * 5722 (all women) ) 
10000 (total number)
Women who didn’t vote: 
OBSERVED Men Women 
_ TABLE 
Voted 2792 3591 6386 
Didn't vote 1486 2131 3617 
4278 5722 10000 
Expected Value (Women who didn’t vote) = 
(20696474 (all who didn’t vote) * (all women) ) 
10000 (total number)
Women who didn’t vote: 
OBSERVED Men Women 
_ TABLE 
Voted 2792 3591 6386 
Didn't vote 1486 2131 3617 
4278 5722 10000 
Expected Value (Women who didn’t vote) = 
(2069.6474 (all who didn’t vote) * (all women)) /(total number)
Women who didn’t vote: 
OBSERVED Men Women 
_ TABLE 
Voted 2792 3591 6386 
Didn't vote 1486 2131 3617 
4278 5722 10000 
Expected Value (Women who didn’t vote) = 
(2070 (all who didn’t vote) * (all women)) /(total number)
Men who didn’t vote: 
EXPECTED Men Women 
_ TABLE 
Voted 2731 3652 6386 
Didn't vote 1547 2070 3617 
4278 5722 10000
OBSERVED Men Women 
TABLE 
Voted 2792 3591 
Didn't vote 1486 2131 
4278 5722 
10000 
EXPECTED Men Women 
TABLE 
Voted 2731 3652 
Didn't vote 1547 2070 
- = Difference 
4278 5722 
10000
OBSERVED Men Women 
TABLE 
Voted 2792 3591 
Didn't vote 1486 2131 
4278 5722 
10000 
EXPECTED Men Women 
TABLE 
Voted 2731 3652 
Didn't vote 1547 2070 
- = Difference 
4278 5722 
10000 
With the information above, we can now plug in 
the numbers using the Chi-square independence 
test.
OBSERVED Men Women 
TABLE 
Voted 2792 3591 
Didn't vote 1486 2131 
4278 5722 
10000 
EXPECTED Men Women 
TABLE 
Voted 2731 3652 
Didn't vote 1547 2070 
- = Difference 
4278 5722 
10000 
With the information above, we can now plug in 
the numbers using the Chi-square independence 
test. 
Note – this is the same equation that is used 
with the Chi-square goodness of fit test:
OBSERVED Men Women 
TABLE 
Voted 2792 3591 
Didn't vote 1486 2131 
4278 5722 
10000 
EXPECTED Men Women 
TABLE 
Voted 2731 3652 
Didn't vote 1547 2070 
- = Difference 
4278 5722 
10000 
With the information above, we can now plug in 
the numbers using the Chi-square independence 
test. 
Note – this is the same equation that is used 
with the Chi-square goodness of fit test: 
푥2 = Σ 
(푂 − 퐸)2 
퐸
푥2 = 횺 
(푂 − 퐸)2 
퐸
푥2 = 횺 
(푂 − 퐸)2 
퐸
푥2 = 횺 
(푂 − 퐸)2 
퐸 
Or in this case:
푥2 = 횺 
(푂 − 퐸)2 
퐸 
Or in this case: 
푥2 = 
(푂 − 퐸)2 
퐸 
+ 
(푂 − 퐸)2 
퐸 
+ 
(푂 − 퐸)2 
퐸 
+ 
(푂 − 퐸)2 
퐸
푥2 = 횺 
(푂 − 퐸)2 
퐸 
Or in this case: 
푥2 = 
(푂 − 퐸)2 
퐸 
+ 
(푂 − 퐸)2 
퐸 
+ 
(푂 − 퐸)2 
퐸 
+ 
(푂 − 퐸)2 
퐸
OBSERVED Men Women 
TABLE 
Voted 2792 3591 
Didn't vote 1486 2131 
4278 5722 
10000 
EXPECTED Men Women 
TABLE 
Voted 2731 3652 
Didn't vote 1547 2070 
- = Difference 
4278 5722 
10000
OBSERVED Men Women 
TABLE 
Voted 2792 3591 
Didn't vote 1486 2131 
4278 5722 
Voting Men 
10000 
EXPECTED Men Women 
TABLE 
Voted 2731 3652 
Didn't vote 1547 2070 
- = Difference 
4278 5722 
10000 
푥2 = 
(푂 − 퐸)2 
퐸 
+ 
(푂 − 퐸)2 
퐸 
+ 
(푂 − 퐸)2 
퐸 
+ 
(푂 − 퐸)2 
퐸 
Voting Men
OBSERVED Men Women 
TABLE 
Voted 2792 3591 
Didn't vote 1486 2131 
Voting Men 
EXPECTED Men Women 
TABLE 
Voted 2731 3652 
Didn't vote 1547 2070 
- = Difference 
4278 5722 
푥2 = 
(2792 − 퐸)2 
퐸 
+ 
(푂 − 퐸)2 
퐸 
+ 
(푂 − 퐸)2 
퐸 
+ 
(푂 − 퐸)2 
퐸 
Voting Men
OBSERVED Men Women 
TABLE 
Voted 2792 3591 
Didn't vote 1486 2131 
Voting Men 
EXPECTED Men Women 
TABLE 
Voted 2731 3652 
Didn't vote 1547 2070 
- = Difference 
4278 5722 
푥2 = 
(2792 − 퐸)2 
퐸 
+ 
(푂 − 퐸)2 
퐸 
+ 
(푂 − 퐸)2 
퐸 
+ 
(푂 − 퐸)2 
퐸 
Voting Men
EXPECTED Men Women 
TABLE 
Voted 2731 3652 
Didn't vote 1547 2070 
4278 5722 
OBSERVED Men Women 
TABLE 
Voted 2792 3591 
Didn't vote 1486 2131 
Voting Men 
- = Difference 
푥2 = 
(2792 − 2731)2 
퐸 
+ 
(푂 − 퐸)2 
퐸 
+ 
(푂 − 퐸)2 
퐸 
+ 
(푂 − 퐸)2 
퐸 
Voting Men
OBSERVED Men Women 
TABLE 
Voted 2792 3591 
Didn't vote 1486 2131 
Voting Men 
EXPECTED Men Women 
TABLE 
Voted 2731 3652 
Didn't vote 1547 2070 
- = Difference 
4278 5722 
푥2 = 
(2792 − 2731)2 
2731 
+ 
(푂 − 퐸)2 
퐸 
+ 
(푂 − 퐸)2 
퐸 
+ 
(푂 − 퐸)2 
퐸 
Voting Men
OBSERVED Men Women 
TABLE 
Voted 2792 3591 
Didn't vote 1486 2131 
EXPECTED Men Women 
TABLE 
Voted 2731 3652 
Didn't vote 1547 2070 
4278 5722 
Voting Men 
- = Difference 
푥2 = 
(2792 − 2731)2 
2731 
+ 
(푂 − 퐸)2 
퐸 
+ 
(푂 − 퐸)2 
퐸 
+ 
(푂 − 퐸)2 
퐸 
Voting Women
EXPECTED Men Women 
TABLE 
Voted 2731 3652 
Didn't vote 1547 2070 
4278 5722 
OBSERVED Men Women 
TABLE 
Voted 2792 3591 
Didn't vote 1486 2131 
Voting Men 
- = Difference 
푥2 = 
(2792 − 2731)2 
2731 
+ 
(푂 − 퐸)2 
퐸 
+ 
(푂 − 퐸)2 
퐸 
+ 
(푂 − 퐸)2 
퐸 
Voting Women
EXPECTED Men Women 
TABLE 
Voted 2731 3652 
Didn't vote 1547 2070 
4278 5722 
OBSERVED Men Women 
TABLE 
Voted 2792 3591 
Didn't vote 1486 2131 
Voting Men 
- = Difference 
푥2 = 
(2792 − 2731)2 
2731 
+ 
(3591 − 퐸)2 
퐸 
+ 
(푂 − 퐸)2 
퐸 
+ 
(푂 − 퐸)2 
퐸 
Voting Women
OBSERVED Men Women 
TABLE 
Voted 2792 3591 
Didn't vote 1486 2131 
Voting Men 
EXPECTED Men Women 
TABLE 
Voted 2731 3652 
Didn't vote 1547 2070 
- = Difference 
4278 5722 
Voting Women 
푥2 = 
(2792 − 2731)2 
2731 
+ 
(3591 − 퐸)2 
퐸 
+ 
(푂 − 퐸)2 
퐸 
+ 
(푂 − 퐸)2 
퐸
OBSERVED Men Women 
TABLE 
Voted 2792 3591 
Didn't vote 1486 2131 
Voting Men 
EXPECTED Men Women 
TABLE 
Voted 2731 3652 
Didn't vote 1547 2070 
- = Difference 
4278 5722 
Voting Women 
푥2 = 
(2792 − 2731)2 
2731 
+ 
(3591 − 3652)2 
3652 
+ 
(푂 − 퐸)2 
퐸 
+ 
(푂 − 퐸)2 
퐸
OBSERVED Men Women 
TABLE 
Voted 2792 3591 
Didn't vote 1486 2131 
Voting Men 
EXPECTED Men Women 
TABLE 
Voted 2731 3652 
Didn't vote 1547 2070 
- = Difference 
4278 5722 
Non-Voting Men 
푥2 = 
(2792 − 2731)2 
2731 
+ 
(3591 − 3652)2 
3652 
+ 
(푂 − 퐸)2 
퐸 
+ 
(푂 − 퐸)2 
퐸
OBSERVED Men Women 
TABLE 
Voted 2792 3591 
Didn't vote 1486 2131 
Voting Men 
EXPECTED Men Women 
TABLE 
Voted 2731 3652 
Didn't vote 1547 2070 
- = Difference 
4278 5722 
Non-Voting Men 
푥2 = 
(2792 − 2731)2 
2731 
+ 
(3591 − 3652)2 
3652 
+ 
(1486 − 퐸)2 
퐸 
+ 
(푂 − 퐸)2 
퐸
EXPECTED Men Women 
TABLE 
Voted 2731 3652 
Didn't vote 1547 2070 
4278 5722 
OBSERVED Men Women 
TABLE 
Voted 2792 3591 
Didn't vote 1486 2131 
Voting Men 
- = Difference 
Non-Voting Men 
푥2 = 
(2792 − 2731)2 
2731 
+ 
(3591 − 3652)2 
3652 
+ 
(1486 − 퐸)2 
퐸 
+ 
(푂 − 퐸)2 
퐸
EXPECTED Men Women 
TABLE 
Voted 2731 3652 
Didn't vote 1547 2070 
4278 5722 
OBSERVED Men Women 
TABLE 
Voted 2792 3591 
Didn't vote 1486 2131 
Voting Men 
- = Difference 
Non-Voting Men 
푥2 = 
(2792 − 2731)2 
2731 
+ 
(3591 − 3652)2 
3652 
+ 
(1486 − 1547)2 
1547 
+ 
(푂 − 퐸)2 
퐸
OBSERVED Men Women 
TABLE 
Voted 2792 3591 
Didn't vote 1486 2131 
Voting Men 
EXPECTED Men Women 
TABLE 
Voted 2731 3652 
Didn't vote 1547 2070 
- = Difference 
4278 5722 
Non-Voting 
Women 
푥2 = 
(2792 − 2731)2 
2731 
+ 
(3591 − 3652)2 
3652 
+ 
(1486 − 1547)2 
1547 
+ 
(푂 − 퐸)2 
퐸
OBSERVED Men Women 
TABLE 
Voted 2792 3591 
Didn't vote 1486 2131 
Voting Men 
EXPECTED Men Women 
TABLE 
Voted 2731 3652 
Didn't vote 1547 2070 
- = Difference 
4278 5722 
Non-Voting 
Women 
푥2 = 
(2792 − 2731)2 
2731 
+ 
(3591 − 3652)2 
3652 
+ 
(1486 − 1547)2 
1547 
+ 
(2131 − 퐸)2 
퐸
EXPECTED Men Women 
TABLE 
Voted 2731 3652 
Didn't vote 1547 2070 
4278 5722 
OBSERVED Men Women 
TABLE 
Voted 2792 3591 
Didn't vote 1486 2131 
Voting Men 
- = Difference 
Non-Voting 
Women 
푥2 = 
(2792 − 2731)2 
2731 
+ 
(3591 − 3652)2 
3652 
+ 
(1486 − 1547)2 
1547 
+ 
(2131 − 2070)2 
2070
Time to Calculate:
Time to Calculate: 
푥2 = 
(2792 − 2731)2 
2731 
+ 
(3591 − 3652)2 
3652 
+ 
(1486 − 1547)2 
1547 
+ 
(2131 − 2070)2 
2070
Time to Calculate: 
푥2 = 
(61)2 
2731 
+ 
(3591 − 3652)2 
3652 
+ 
(1486 − 1547)2 
1547 
+ 
(2131 − 2070)2 
2070
Time to Calculate: 
푥2 = 
3721 
2731 
+ 
(3591 − 3652)2 
3652 
+ 
(1486 − 1547)2 
1547 
+ 
(2131 − 2070)2 
2070
Time to Calculate: 
푥2 = 1.4 + 
(3591 − 3652)2 
3652 
+ 
(1486 − 1547)2 
1547 
+ 
(2131 − 2070)2 
2070
Time to Calculate: 
푥2 = 1.4 + 
(−61)2 
3652 
+ 
(1486 − 1547)2 
1547 
+ 
(2131 − 2070)2 
2070
Time to Calculate: 
푥2 = 1.4 + 
3721 
3652 
+ 
(1486 − 1547)2 
1547 
+ 
(2131 − 2070)2 
2070
Time to Calculate: 
푥2 = 1.4 + 1.0 + 
(1486 − 1547)2 
1547 
+ 
(2131 − 2070)2 
2070
Time to Calculate: 
푥2 = 1.4 + 1.0 + 
(61)2 
1547 
+ 
(2131 − 2070)2 
2070
Time to Calculate: 
푥2 = 1.4 + 1.0 + 
3721 
1547 
+ 
(2131 − 2070)2 
2070
Time to Calculate: 
푥2 = 1.4 + 1.0 + 2.4 + 
(2131 − 2070)2 
2070
Time to Calculate: 
푥2 = 1.4 + 1.0 + 2.4 + 
(61)2 
2070
Time to Calculate: 
푥2 = 1.4 + 1.0 + 2.4 + 
3721 
2070
Time to Calculate: 
푥2 = 1.4 + 1.0 + 2.4 + 1.8
Time to Calculate: 
푥2 = 6.6
Now we determine if a 푥2of 6.6 exceeds the 
critical 푥2 for terms.
To calculate the 푥2 critical we first must 
determine the degrees of freedom as well as set 
the probability level.
To calculate the 푥2 critical we first must 
determine the degrees of freedom as well as set 
the probability level. 
The probability or alpha level means the 
probability of a type 1 error we are willing to live 
with (i.e., this is the probability of being wrong 
when we reject the null hypothesis). Generally 
this value is .05 which is like saying we are 
willing to be wrong 5 out of 100 times (.05) 
before we will reject the null-hypothesis.
Degrees of Freedom are calculated by taking the 
number rows and subtracting them by 1 and 
then multiplying the result by taking the number 
of columns and subtracting them by 1.
Degrees of Freedom are calculated by taking the 
number rows and subtracting them by 1 and 
then multiplying the result by taking the number 
of columns and subtracting them by 1. (Two 
rows -1) or (2-1) X (2-1) or 1X1=1. Degrees of 
Freedom = 1.
We now have all of the information we need to 
determine the critical 푥2.
We now have all of the information we need to 
determine the critical 푥2. 
We go to the Chi-Square Distribution Table and 
locate the degrees of freedom:
We now have all of the information we need to 
determine the critical 푥2. 
We go to the Chi-Square Distribution Table and 
locate the degrees of freedom: 
df 0.100 0.050 0.025 
1 2.71 3.84 5.02 
2 4.61 5.99 7.38 
3 6.25 7.82 9.35 
4 7.78 9.49 11.14 
5 9.24 11.07 12.83 
6 10.64 12.59 14.45 
7 12.02 14.07 16.10 
8 13.36 15.51 17.54 
9 14.68 16.92 19.20 
… … … …
We now have all of the information we need to 
determine the critical 푥2. 
We go to the Chi-Square Distribution Table and 
locate the degrees of freedom: 
df 0.100 0.050 0.025 
1 2.71 3.84 5.02 
2 4.61 5.99 7.38 
3 6.25 7.82 9.35 
4 7.78 9.49 11.14 
5 9.24 11.07 12.83 
6 10.64 12.59 14.45 
7 12.02 14.07 16.10 
8 13.36 15.51 17.54 
9 14.68 16.92 19.20 
… … … … 
And then we locate the 
probability or alpha level:
We now have all of the information we need to 
determine the critical 푥2. 
We go to the Chi-Square Distribution Table and 
locate the degrees of freedom: 
df 0.100 0.050 0.025 
1 2.71 3.84 5.02 
2 4.61 5.99 7.38 
3 6.25 7.82 9.35 
4 7.78 9.49 11.14 
5 9.24 11.07 12.83 
6 10.64 12.59 14.45 
7 12.02 14.07 16.10 
8 13.36 15.51 17.54 
9 14.68 16.92 19.20 
… … … … 
And then we locate the 
probability or alpha level: 
Where these two values 
intersect in the table we find 
the critical 푥2.
Since the chi-square goodness of fit value (6.6) 
exceeds the critical 푥2 (3.84) we will reject the 
null-hypothesis.
Since the chi-square goodness of fit value (6.6) 
exceeds the critical 푥2 (3.84) we will reject the 
null-hypothesis. 
Voting patterns and gender status are not 
statistically significantly dependent on one 
another.
Since the chi-square goodness of fit value (6.6) 
exceeds the critical 푥2 (3.84) we will reject the 
null-hypothesis. 
Voting patterns and gender status are not 
statistically significantly dependent on one 
another.
Since the chi-square goodness of fit value (6.6) 
exceeds the critical 푥2 (3.84) we will reject the 
null-hypothesis. 
Voting patterns and gender status are not 
statistically significantly dependent on one 
another. 
There actually is a significant difference.
So what is the difference between chi-square 
test of goodness of fit and test of 
independence?
A goodness-of-fit test is a one variable Chi-square 
test.
A goodness-of-fit test is a one variable Chi-square 
test. 
In this example, a department chair wants to 
know if the enrollments across three professors 
are equally distributed.
A goodness-of-fit test is a one variable Chi-square 
test. 
In this example, a department chair wants to 
know if the enrollments across three professors 
are equally distributed. 
Here is the actual, or observed, data:
A goodness-of-fit test is a one variable Chi-square 
test. 
In this example, a department chair wants to 
know if the enrollments across three professors 
are equally distributed. 
Here is the actual, or observed, data: 
OBSERVED 
TABLE 
Prof A’s 
Class 
Prof B’s 
Class 
Prof C’s 
Class 
Students enrolled 31 25 10
A goodness-of-fit test is a one variable Chi-square 
test. 
OBSERVED 
TABLE 
Prof A’s 
Class 
Prof B’s 
Class 
Prof C’s 
Class 
Students enrolled 31 25 10
A goodness-of-fit test is a one variable Chi-square 
test. 
OBSERVED 
TABLE 
Prof A’s 
Class 
Prof B’s 
Class 
Prof C’s 
Class 
Students enrolled 31 25 10
A test of independence is a two variable Chi-square 
test.
A test of independence is a two variable Chi-square 
test. 
For example, a department chair wants to know 
if women and men enrollments are equally 
distributed across three professor classes.
A test of independence is a two variable Chi-square 
test. 
For example, a department chair wants to know 
if women and men enrollments are equally 
distributed across three professor classes. 
OBSERVED 
TABLE 
Prof A’s 
Class 
Prof B’s 
Class 
Prof C’s 
Class 
Men 21 7 7 
Women 10 18 3
A test of independence is a two variable 
(gender) Chi-square test. 
OBSERVED 
TABLE 
Prof A’s 
Class 
Prof B’s 
Class 
Prof C’s 
Class 
Men 21 7 7 
Women 10 18 3

Weitere ähnliche Inhalte

Was ist angesagt?

The chi square test of indep of categorical variables
The chi square test of indep of categorical variablesThe chi square test of indep of categorical variables
The chi square test of indep of categorical variablesRegent University
 
Inferential statistics (2)
Inferential statistics (2)Inferential statistics (2)
Inferential statistics (2)rajnulada
 
Statistics "Descriptive & Inferential"
Statistics "Descriptive & Inferential"Statistics "Descriptive & Inferential"
Statistics "Descriptive & Inferential"Dalia El-Shafei
 
Measures of dispersion or variation
Measures of dispersion or variationMeasures of dispersion or variation
Measures of dispersion or variationRaj Teotia
 
Advance Statistics - Wilcoxon Signed Rank Test
Advance Statistics - Wilcoxon Signed Rank TestAdvance Statistics - Wilcoxon Signed Rank Test
Advance Statistics - Wilcoxon Signed Rank TestJoshua Batalla
 
Normality evaluation in a data
Normality evaluation in a dataNormality evaluation in a data
Normality evaluation in a dataWaqar Akram
 
Two way analysis of variance (anova)
Two way analysis of variance (anova)Two way analysis of variance (anova)
Two way analysis of variance (anova)Randel Roy Raluto
 
Statistical inference: Estimation
Statistical inference: EstimationStatistical inference: Estimation
Statistical inference: EstimationParag Shah
 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statisticsAiden Yeh
 
Spearman Rank Correlation Presentation
Spearman Rank Correlation PresentationSpearman Rank Correlation Presentation
Spearman Rank Correlation Presentationcae_021
 

Was ist angesagt? (20)

The chi square test of indep of categorical variables
The chi square test of indep of categorical variablesThe chi square test of indep of categorical variables
The chi square test of indep of categorical variables
 
Inferential statistics (2)
Inferential statistics (2)Inferential statistics (2)
Inferential statistics (2)
 
Statistics "Descriptive & Inferential"
Statistics "Descriptive & Inferential"Statistics "Descriptive & Inferential"
Statistics "Descriptive & Inferential"
 
Measures of dispersion or variation
Measures of dispersion or variationMeasures of dispersion or variation
Measures of dispersion or variation
 
Advance Statistics - Wilcoxon Signed Rank Test
Advance Statistics - Wilcoxon Signed Rank TestAdvance Statistics - Wilcoxon Signed Rank Test
Advance Statistics - Wilcoxon Signed Rank Test
 
Mann Whitney U test
Mann Whitney U testMann Whitney U test
Mann Whitney U test
 
Normality evaluation in a data
Normality evaluation in a dataNormality evaluation in a data
Normality evaluation in a data
 
Statistical tests
Statistical tests Statistical tests
Statistical tests
 
T test and types of t-test
T test and types of t-testT test and types of t-test
T test and types of t-test
 
Two way analysis of variance (anova)
Two way analysis of variance (anova)Two way analysis of variance (anova)
Two way analysis of variance (anova)
 
Chi square
Chi square Chi square
Chi square
 
Goodness of fit (ppt)
Goodness of fit (ppt)Goodness of fit (ppt)
Goodness of fit (ppt)
 
Non parametric-tests
Non parametric-testsNon parametric-tests
Non parametric-tests
 
Measures of variability
Measures of variabilityMeasures of variability
Measures of variability
 
In Anova
In  AnovaIn  Anova
In Anova
 
Statistical inference: Estimation
Statistical inference: EstimationStatistical inference: Estimation
Statistical inference: Estimation
 
Chi square test
Chi square test Chi square test
Chi square test
 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statistics
 
Spearman Rank Correlation Presentation
Spearman Rank Correlation PresentationSpearman Rank Correlation Presentation
Spearman Rank Correlation Presentation
 
Chi square mahmoud
Chi square mahmoudChi square mahmoud
Chi square mahmoud
 

Ähnlich wie Chi Square Test of Independence Explained

Frequency Tables - Statistics
Frequency Tables - StatisticsFrequency Tables - Statistics
Frequency Tables - Statisticsmscartersmaths
 
Seventy years of RCTs
Seventy years of RCTsSeventy years of RCTs
Seventy years of RCTsStephen Senn
 
STAT200 Introduction to StatisticsDataset for Written Assignment.docx
STAT200 Introduction to StatisticsDataset for Written Assignment.docxSTAT200 Introduction to StatisticsDataset for Written Assignment.docx
STAT200 Introduction to StatisticsDataset for Written Assignment.docxsusanschei
 
e-Portfolio for Lab-Based Statistics (PSYC 3100) part 1
e-Portfolio for Lab-Based Statistics (PSYC 3100) part 1e-Portfolio for Lab-Based Statistics (PSYC 3100) part 1
e-Portfolio for Lab-Based Statistics (PSYC 3100) part 1Ella Anwar
 
Standard error and sample size
Standard error and sample sizeStandard error and sample size
Standard error and sample sizedrplayfoot
 
Homework #1SOCY 3115Spring 20Read the Syllabus and FAQ on ho.docx
Homework #1SOCY 3115Spring 20Read the Syllabus and FAQ on ho.docxHomework #1SOCY 3115Spring 20Read the Syllabus and FAQ on ho.docx
Homework #1SOCY 3115Spring 20Read the Syllabus and FAQ on ho.docxpooleavelina
 
Data Analysis for Graduate Studies Summary
Data Analysis for Graduate Studies SummaryData Analysis for Graduate Studies Summary
Data Analysis for Graduate Studies SummaryKelvinNMhina
 
Dataset for Written AssignmentsDescription of DatasetThe .docx
Dataset for Written AssignmentsDescription of DatasetThe .docxDataset for Written AssignmentsDescription of DatasetThe .docx
Dataset for Written AssignmentsDescription of DatasetThe .docxrandyburney60861
 
Question1The Tri-City School District has instituted a zero-tol.docx
Question1The Tri-City School District has instituted a zero-tol.docxQuestion1The Tri-City School District has instituted a zero-tol.docx
Question1The Tri-City School District has instituted a zero-tol.docxmakdul
 
Advanced business mathematics and statistics for entrepreneurs
Advanced business mathematics and statistics for entrepreneursAdvanced business mathematics and statistics for entrepreneurs
Advanced business mathematics and statistics for entrepreneursDr. Trilok Kumar Jain
 
Lecture 3 - Descriptive statistics Spring 2023.pptx
Lecture 3 - Descriptive statistics Spring 2023.pptxLecture 3 - Descriptive statistics Spring 2023.pptx
Lecture 3 - Descriptive statistics Spring 2023.pptxaazmeerrahman
 
Creating effective election materials and websites
Creating effective election materials and websitesCreating effective election materials and websites
Creating effective election materials and websitesCenter for Civic Design
 
Basic Statistics in 1 hour.pptx
Basic Statistics in 1 hour.pptxBasic Statistics in 1 hour.pptx
Basic Statistics in 1 hour.pptxParag Shah
 
ECO491 Group Project
ECO491 Group ProjectECO491 Group Project
ECO491 Group Projecttrijay1802
 
Statistics project on comparison of two batsmen
Statistics project on comparison of two batsmenStatistics project on comparison of two batsmen
Statistics project on comparison of two batsmenVIsva Bharati University
 
Reporting chi square goodness of fit test of independence in apa
Reporting chi square goodness of fit test of independence in apaReporting chi square goodness of fit test of independence in apa
Reporting chi square goodness of fit test of independence in apaKen Plummer
 
2023 Week 1 Lesson Powerpoint.pptx
2023 Week 1 Lesson Powerpoint.pptx2023 Week 1 Lesson Powerpoint.pptx
2023 Week 1 Lesson Powerpoint.pptxMaiThanh458453
 

Ähnlich wie Chi Square Test of Independence Explained (20)

Chi sq explanation
Chi sq explanationChi sq explanation
Chi sq explanation
 
Frequency Tables - Statistics
Frequency Tables - StatisticsFrequency Tables - Statistics
Frequency Tables - Statistics
 
Seventy years of RCTs
Seventy years of RCTsSeventy years of RCTs
Seventy years of RCTs
 
STAT200 Introduction to StatisticsDataset for Written Assignment.docx
STAT200 Introduction to StatisticsDataset for Written Assignment.docxSTAT200 Introduction to StatisticsDataset for Written Assignment.docx
STAT200 Introduction to StatisticsDataset for Written Assignment.docx
 
e-Portfolio for Lab-Based Statistics (PSYC 3100) part 1
e-Portfolio for Lab-Based Statistics (PSYC 3100) part 1e-Portfolio for Lab-Based Statistics (PSYC 3100) part 1
e-Portfolio for Lab-Based Statistics (PSYC 3100) part 1
 
Standard error and sample size
Standard error and sample sizeStandard error and sample size
Standard error and sample size
 
Homework #1SOCY 3115Spring 20Read the Syllabus and FAQ on ho.docx
Homework #1SOCY 3115Spring 20Read the Syllabus and FAQ on ho.docxHomework #1SOCY 3115Spring 20Read the Syllabus and FAQ on ho.docx
Homework #1SOCY 3115Spring 20Read the Syllabus and FAQ on ho.docx
 
Data Analysis for Graduate Studies Summary
Data Analysis for Graduate Studies SummaryData Analysis for Graduate Studies Summary
Data Analysis for Graduate Studies Summary
 
Dataset for Written AssignmentsDescription of DatasetThe .docx
Dataset for Written AssignmentsDescription of DatasetThe .docxDataset for Written AssignmentsDescription of DatasetThe .docx
Dataset for Written AssignmentsDescription of DatasetThe .docx
 
Question1The Tri-City School District has instituted a zero-tol.docx
Question1The Tri-City School District has instituted a zero-tol.docxQuestion1The Tri-City School District has instituted a zero-tol.docx
Question1The Tri-City School District has instituted a zero-tol.docx
 
Advanced business mathematics and statistics for entrepreneurs
Advanced business mathematics and statistics for entrepreneursAdvanced business mathematics and statistics for entrepreneurs
Advanced business mathematics and statistics for entrepreneurs
 
Lecture 3 - Descriptive statistics Spring 2023.pptx
Lecture 3 - Descriptive statistics Spring 2023.pptxLecture 3 - Descriptive statistics Spring 2023.pptx
Lecture 3 - Descriptive statistics Spring 2023.pptx
 
Creating effective election materials and websites
Creating effective election materials and websitesCreating effective election materials and websites
Creating effective election materials and websites
 
Basic Statistics in 1 hour.pptx
Basic Statistics in 1 hour.pptxBasic Statistics in 1 hour.pptx
Basic Statistics in 1 hour.pptx
 
ECO491 Group Project
ECO491 Group ProjectECO491 Group Project
ECO491 Group Project
 
Statistics project on comparison of two batsmen
Statistics project on comparison of two batsmenStatistics project on comparison of two batsmen
Statistics project on comparison of two batsmen
 
Reporting chi square goodness of fit test of independence in apa
Reporting chi square goodness of fit test of independence in apaReporting chi square goodness of fit test of independence in apa
Reporting chi square goodness of fit test of independence in apa
 
2023 Week 1 Lesson Powerpoint.pptx
2023 Week 1 Lesson Powerpoint.pptx2023 Week 1 Lesson Powerpoint.pptx
2023 Week 1 Lesson Powerpoint.pptx
 
1628 statistics
1628 statistics1628 statistics
1628 statistics
 
Statistics for ess
Statistics for essStatistics for ess
Statistics for ess
 

Mehr von Ken Plummer

Diff rel gof-fit - jejit - practice (5)
Diff rel gof-fit - jejit - practice (5)Diff rel gof-fit - jejit - practice (5)
Diff rel gof-fit - jejit - practice (5)Ken Plummer
 
Learn About Range - Copyright updated
Learn About Range - Copyright updatedLearn About Range - Copyright updated
Learn About Range - Copyright updatedKen Plummer
 
Inferential vs descriptive tutorial of when to use - Copyright Updated
Inferential vs descriptive tutorial of when to use - Copyright UpdatedInferential vs descriptive tutorial of when to use - Copyright Updated
Inferential vs descriptive tutorial of when to use - Copyright UpdatedKen Plummer
 
Diff rel ind-fit practice - Copyright Updated
Diff rel ind-fit practice - Copyright UpdatedDiff rel ind-fit practice - Copyright Updated
Diff rel ind-fit practice - Copyright UpdatedKen Plummer
 
Normal or skewed distributions (inferential) - Copyright updated
Normal or skewed distributions (inferential) - Copyright updatedNormal or skewed distributions (inferential) - Copyright updated
Normal or skewed distributions (inferential) - Copyright updatedKen Plummer
 
Normal or skewed distributions (descriptive both2) - Copyright updated
Normal or skewed distributions (descriptive both2) - Copyright updatedNormal or skewed distributions (descriptive both2) - Copyright updated
Normal or skewed distributions (descriptive both2) - Copyright updatedKen Plummer
 
Nature of the data practice - Copyright updated
Nature of the data practice - Copyright updatedNature of the data practice - Copyright updated
Nature of the data practice - Copyright updatedKen Plummer
 
Nature of the data (spread) - Copyright updated
Nature of the data (spread) - Copyright updatedNature of the data (spread) - Copyright updated
Nature of the data (spread) - Copyright updatedKen Plummer
 
Mode practice 1 - Copyright updated
Mode practice 1 - Copyright updatedMode practice 1 - Copyright updated
Mode practice 1 - Copyright updatedKen Plummer
 
Nature of the data (descriptive) - Copyright updated
Nature of the data (descriptive) - Copyright updatedNature of the data (descriptive) - Copyright updated
Nature of the data (descriptive) - Copyright updatedKen Plummer
 
Dichotomous or scaled
Dichotomous or scaledDichotomous or scaled
Dichotomous or scaledKen Plummer
 
Skewed less than 30 (ties)
Skewed less than 30 (ties)Skewed less than 30 (ties)
Skewed less than 30 (ties)Ken Plummer
 
Skewed sample size less than 30
Skewed sample size less than 30Skewed sample size less than 30
Skewed sample size less than 30Ken Plummer
 
Ordinal and nominal
Ordinal and nominalOrdinal and nominal
Ordinal and nominalKen Plummer
 
Relationship covariates
Relationship   covariatesRelationship   covariates
Relationship covariatesKen Plummer
 
Relationship nature of data
Relationship nature of dataRelationship nature of data
Relationship nature of dataKen Plummer
 
Number of variables (predictive)
Number of variables (predictive)Number of variables (predictive)
Number of variables (predictive)Ken Plummer
 
Levels of the iv
Levels of the ivLevels of the iv
Levels of the ivKen Plummer
 
Independent variables (2)
Independent variables (2)Independent variables (2)
Independent variables (2)Ken Plummer
 

Mehr von Ken Plummer (20)

Diff rel gof-fit - jejit - practice (5)
Diff rel gof-fit - jejit - practice (5)Diff rel gof-fit - jejit - practice (5)
Diff rel gof-fit - jejit - practice (5)
 
Learn About Range - Copyright updated
Learn About Range - Copyright updatedLearn About Range - Copyright updated
Learn About Range - Copyright updated
 
Inferential vs descriptive tutorial of when to use - Copyright Updated
Inferential vs descriptive tutorial of when to use - Copyright UpdatedInferential vs descriptive tutorial of when to use - Copyright Updated
Inferential vs descriptive tutorial of when to use - Copyright Updated
 
Diff rel ind-fit practice - Copyright Updated
Diff rel ind-fit practice - Copyright UpdatedDiff rel ind-fit practice - Copyright Updated
Diff rel ind-fit practice - Copyright Updated
 
Normal or skewed distributions (inferential) - Copyright updated
Normal or skewed distributions (inferential) - Copyright updatedNormal or skewed distributions (inferential) - Copyright updated
Normal or skewed distributions (inferential) - Copyright updated
 
Normal or skewed distributions (descriptive both2) - Copyright updated
Normal or skewed distributions (descriptive both2) - Copyright updatedNormal or skewed distributions (descriptive both2) - Copyright updated
Normal or skewed distributions (descriptive both2) - Copyright updated
 
Nature of the data practice - Copyright updated
Nature of the data practice - Copyright updatedNature of the data practice - Copyright updated
Nature of the data practice - Copyright updated
 
Nature of the data (spread) - Copyright updated
Nature of the data (spread) - Copyright updatedNature of the data (spread) - Copyright updated
Nature of the data (spread) - Copyright updated
 
Mode practice 1 - Copyright updated
Mode practice 1 - Copyright updatedMode practice 1 - Copyright updated
Mode practice 1 - Copyright updated
 
Nature of the data (descriptive) - Copyright updated
Nature of the data (descriptive) - Copyright updatedNature of the data (descriptive) - Copyright updated
Nature of the data (descriptive) - Copyright updated
 
Dichotomous or scaled
Dichotomous or scaledDichotomous or scaled
Dichotomous or scaled
 
Skewed less than 30 (ties)
Skewed less than 30 (ties)Skewed less than 30 (ties)
Skewed less than 30 (ties)
 
Skewed sample size less than 30
Skewed sample size less than 30Skewed sample size less than 30
Skewed sample size less than 30
 
Ordinal (ties)
Ordinal (ties)Ordinal (ties)
Ordinal (ties)
 
Ordinal and nominal
Ordinal and nominalOrdinal and nominal
Ordinal and nominal
 
Relationship covariates
Relationship   covariatesRelationship   covariates
Relationship covariates
 
Relationship nature of data
Relationship nature of dataRelationship nature of data
Relationship nature of data
 
Number of variables (predictive)
Number of variables (predictive)Number of variables (predictive)
Number of variables (predictive)
 
Levels of the iv
Levels of the ivLevels of the iv
Levels of the iv
 
Independent variables (2)
Independent variables (2)Independent variables (2)
Independent variables (2)
 

Kürzlich hochgeladen

Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3JemimahLaneBuaron
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformChameera Dedduwage
 
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting DataJhengPantaleon
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxOH TEIK BIN
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdfSoniaTolstoy
 
URLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppURLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppCeline George
 
MENTAL STATUS EXAMINATION format.docx
MENTAL     STATUS EXAMINATION format.docxMENTAL     STATUS EXAMINATION format.docx
MENTAL STATUS EXAMINATION format.docxPoojaSen20
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxGaneshChakor2
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Educationpboyjonauth
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAssociation for Project Management
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Krashi Coaching
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon AUnboundStockton
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docxPoojaSen20
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentInMediaRes1
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsanshu789521
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 

Kürzlich hochgeladen (20)

Staff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSDStaff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSD
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptx
 
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
 
URLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppURLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website App
 
MENTAL STATUS EXAMINATION format.docx
MENTAL     STATUS EXAMINATION format.docxMENTAL     STATUS EXAMINATION format.docx
MENTAL STATUS EXAMINATION format.docx
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptx
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Education
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across Sectors
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon A
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docx
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media Component
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha elections
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 

Chi Square Test of Independence Explained

  • 1. Chi Square Test of Independence Conceptual
  • 3. Questions of independence are actually the flip side of questions of relationship. If a variable is independent of another variable, then functions in one will not be accompanied by functions in the other.
  • 4. Like the higher the temperature the stronger people are.
  • 5. Like the higher the temperature the stronger people are.
  • 6. Questions of Independence are often posed as questions of bias.
  • 7. For example, the question, “Are admissions decisions at a local community college fair?” can reasonably be interpreted as a question of independence (or bias).
  • 8. If fairness is taken to mean that there is proportional representation of minority and majority students that mirrors the local proportions, then a test of independence can estimate whether admissions are “fair”.
  • 9. The question becomes “Are admissions decisions independent of majority/minority status?”
  • 10. Assuming that majority students are similar in their preparation and motivation as minority students and they apply to the community college in proportionally similar numbers as minority students, then a fair admissions process should be independent of majority status and render proportions of admissions that are similar to proportions of majority and minority students in the local populations
  • 11. INDEPENDENT EXAMPLE: If you are a minority you are neither more likely nor less likely to be admitted.
  • 12. Failure to be independent would indicate bias.
  • 13. Failure to be independent would indicate bias. BIAS EXAMPLE: If you are a minority you are more likely to be admitted.
  • 14. Failure to be independent would indicate bias. BIAS EXAMPLE: If you are a minority you are more likely to be admitted. BIAS EXAMPLE: If you are a minority you less likely to be admitted
  • 15. Failure to be independent would indicate bias. BIAS EXAMPLE: If you are a minority you are more likely to be admitted. BIAS EXAMPLE: If you are a minority you less likely to be admitted. You will use certain statistical methods (like the chi square test of independence) to determine if independence is significant or not.
  • 16. Here is an example taken from http://omega.albany.edu:8008/mat108dir/chi2i ndependence/chi2in-m2h.html:
  • 17. Here is an example taken from http://omega.albany.edu:8008/mat108dir/chi2i ndependence/chi2in-m2h.html: In a certain town, there are about one million eligible voters. A simple random sample of 10,000 eligible voters was chosen to study the relationship between gender and participation in the last election.
  • 18. Here is an example taken from http://omega.albany.edu:8008/mat108dir/chi2i ndependence/chi2in-m2h.html: In a certain town, there are about one million eligible voters. A simple random sample of 10,000 eligible voters was chosen to study the relationship between gender and participation in the last election. The results are summarized in the following 2X2 (read two by two) contingency table:
  • 19. In a certain town, there are about one million eligible voters. A simple random sample of 10,000 eligible voters was chosen to study the relationship between gender and participation in the last election. The results are summarized in the following 2X2 (read two by two) contingency table: Men Women __________________________ Voted 2792 3591 Didn't vote 1486 2131
  • 20. We want to check whether being a man or a woman (columns) is independent of having voted in the last election (rows). In other words is “gender and voting independent”? Men Women __________________________ Voted 2792 3591 Didn't vote 1486 2131
  • 22. Solution: In order to answer the question we need to build a test of hypothesis. We have
  • 23. Solution: In order to answer the question we need to build a test of hypothesis. We have Null Hypothesis = ‘Gender is independent of Voting’
  • 24. Solution: In order to answer the question we need to build a test of hypothesis. We have Null Hypothesis = ‘Gender is independent of Voting’ Alternative Hypothesis = ‘Gender and Voting are dependent’
  • 25. Solution: In order to answer the question we need to build a test of hypothesis. We have Null Hypothesis = ‘Gender is independent of Voting’ Alternative Hypothesis = ‘Gender and Voting are dependent’ After specifying the Null Hypothesis, we need to compute the expected table under the assumption that rows and columns are in fact independent.
  • 26. As you can see we have the observed table below:
  • 27. As you can see we have the observed table below: Men Women __________________________ Voted 2792 3591 Didn't vote 1486 2131 We need to create an expected table and then determine if the difference between the observed and expected are significant:
  • 28. As you can see we have the observed table below: Men Women __________________________ Voted 2792 3591 Didn't vote 1486 2131 We need to create an expected table and then determine if the difference between the observed and expected are significant:
  • 29. As you can see we have the observed table below: Men Women __________________________ Voted 2792 3591 Didn't vote 1486 2131 We need to create an expected table and then determine if the difference between the observed and expected are significant: Observed Numbers Expected Numbers Difference
  • 30. Remember that the smaller the DIFFERENCE, the better the fit which in this case would favor INDEPENDENCE between gender and voting tendencies.
  • 31. Remember that the smaller the DIFFERENCE, the better the fit which in this case would favor INDEPENDENCE between gender and voting tendencies. Observed Numbers Expected Numbers Difference
  • 32. Inversely, the larger the DIFFERENCE the worse the fit which in this case would indicate that gender and voting tendencies are dependent upon one another.
  • 33. Inversely, the larger the DIFFERENCE the worse the fit which in this case would indicate that gender and voting tendencies are dependent upon one another. Observed Numbers Expected Numbers Difference
  • 34. We use Chi-Square distribution to determine if that difference is significant or not.
  • 35. We use Chi-Square distribution to determine if that difference is significant or not. We will now show you how to compute the chi-square statistic for a test of independence.
  • 36. We use Chi-Square distribution to determine if that difference is significant or not. We will now show you how to compute the chi-square statistic for a test of independence. First, we compute the row and column totals along with the grand total.
  • 37. Men Women ________________________________________ Voted 2792 3591 Didn't vote 1486 2131
  • 38. Total Who Voted Men Women ________________________________________ Voted 2792 + 3591 = 6386 Didn't vote 1486 2131
  • 39. Men Women ________________________________________ Voted 2792 3591 6386 Didn't vote 1486 + 2131 = 3617 Total Who Did Not Vote
  • 40. Men Women ________________________________________ Voted 2792 3591 6386 Didn't vote + 1486 2131 3617 = 4278 Total Men
  • 41. Men Women ________________________________________ Voted 2792 3591 6386 Didn't vote 1486 + 2131 3617 4278 = 5722 Total Women
  • 42. Total Men & Women or Total Voted/Not Voted Men Women ________________________________________ Voted 2792 3591 6386 Didn't vote 1486 2131 3617 4278 5722 10000
  • 43. Now we have the information we need to create an expected table. Here is the equation for calculating the expected value for the cell “Men who Voted”:
  • 44. Now we have the information we need to create an expected table. Here is the equation for calculating the expected value for the cell “Men who Voted”: Expected Value(Men who voted) = (Number (all who voted) * Number (all men)) Number(total number)
  • 45. Observed Men Women _ Voted 2792 3591 6386 Didn't vote 1486 2131 3617 4278 5722 10000 Men Who Voted
  • 46. OBSERVED Men Women _ TABLE Voted 2792 3591 6386 Didn't vote 1486 2131 3617 4278 5722 10000 Expected Value (Men who voted) = (6386 (all who voted) * Number (all men)) Number (total number)
  • 47. OBSERVED Men Women _ TABLE Voted 2792 3591 6386 Didn't vote 1486 2131 3617 4278 5722 10000 Expected Value (Men who voted) = (6386 (all who voted) * 4278 (all men) ) Number (total number)
  • 48. OBSERVED Men Women _ TABLE Voted 2792 3591 6386 Didn't vote 1486 2131 3617 4278 5722 10000 Expected Value (Men who voted) = (6386 (all who voted) * 4278 (all men) ) 10000 (total number)
  • 49. OBSERVED Men Women _ TABLE Voted 2792 3591 6386 Didn't vote 1486 2131 3617 4278 5722 10000 Expected Value (Men who voted) = (27306474 (all who voted * all men)) 10000 (total number)
  • 50. OBSERVED Men Women _ TABLE Voted 2792 3591 6386 Didn't vote 1486 2131 3617 4278 5722 10000 Expected Value (Men who voted) = 2730.6474 ((all who voted * all men)/total number)
  • 51. OBSERVED Men Women _ TABLE Voted 2792 3591 6386 Didn't vote 1486 2131 3617 4278 5722 10000 Expected Value (Men who voted) = 2731 ((all who voted * all men)/total number)
  • 52. EXPECTED Men Women _ TABLE Voted 2731 3591 6386 Didn't vote 1486 2131 3617 4278 5722 10000 Expected Value (Men who voted) = 2731 ((all who voted * all men)/total number)
  • 53. EXPECTED Men Women _ TABLE Voted 2731 3591 6386 Didn't vote 1486 2131 3617 4278 5722 10000 What is the expected value for Women who Voted?
  • 55. Women who voted: OBSERVED Men Women _ TABLE Voted 2792 3591 6386 Didn't vote 1486 2131 3617 4278 5722 10000
  • 56. Women who voted: OBSERVED Men Women _ TABLE Voted 2792 3591 6386 Didn't vote 1486 2131 3617 4278 5722 10000 Expected Value (Women who voted) = (6386 (all who voted) * 5722 (all women) ) 10000 (total number)
  • 57. Women who voted: OBSERVED Men Women _ TABLE Voted 2792 3591 6386 Didn't vote 1486 2131 3617 4278 5722 10000 Expected Value (Women who voted) = (6386 (all who voted) * 5722 (all women) ) 10000 (total number)
  • 58. Women who voted: OBSERVED Men Women _ TABLE Voted 2792 3591 6386 Didn't vote 1486 2131 3617 4278 5722 10000 Expected Value (Women who voted) = (6386 (all who voted) * 5722 (all women) ) 10000 (total number)
  • 59. Women who voted: OBSERVED Men Women _ TABLE Voted 2792 3591 6386 Didn't vote 1486 2131 3617 4278 5722 10000 Expected Value (Women who voted) = (36523526 ((all who voted) * (all women)) ) 10000 (total number)
  • 60. Women who voted: OBSERVED Men Women _ TABLE Voted 2792 3591 6386 Didn't vote 1486 2131 3617 4278 5722 10000 Expected Value (Women who voted) = (3652.3526 ((all who voted) * (all women)))/total number
  • 61. Women who voted: OBSERVED Men Women _ TABLE Voted 2792 3591 6386 Didn't vote 1486 2131 3617 4278 5722 10000 Expected Value (Women who voted) = (3652 ((all who voted) * (all women)))/total number
  • 62. Women who voted: EXPECTED Men Women _ TABLE Voted 2731 3652 6386 Didn't vote 1486 2131 3617 4278 5722 10000 Expected Value (Women who voted) = (3652 ((all who voted) * (all women)))/total number
  • 63. Women who voted: EXPECTED Men Women _ TABLE Voted 2731 3652 6386 Didn't vote 1486 2131 3617 4278 5722 10000 What is the expected value for Men who Didn’t Vote?
  • 65. Men who didn’t vote: OBSERVED Men Women _ TABLE Voted 2792 3591 6386 Didn't vote 1486 2131 3617 4278 5722 10000
  • 66. Men who didn’t vote: OBSERVED Men Women _ TABLE Voted 2792 3591 6386 Didn't vote 1486 2131 3617 4278 5722 10000 Expected Value (Men who didn’t vote) = (3617 (all who didn’t vote) * 4278 (all men) ) 10000 (total number)
  • 67. Men who didn’t vote: OBSERVED Men Women _ TABLE Voted 2792 3591 6386 Didn't vote 1486 2131 3617 4278 5722 10000 Expected Value (Men who didn’t vote) = (3617 (all who didn’t vote) * 4278 (all men) ) 10000 (total number)
  • 68. Men who didn’t vote: OBSERVED Men Women _ TABLE Voted 2792 3591 6386 Didn't vote 1486 2131 3617 4278 5722 10000 Expected Value (Men who didn’t vote) = (3617 (all who didn’t vote) * 4278 (all men) ) 10000 (total number)
  • 69. Men who didn’t vote: OBSERVED Men Women _ TABLE Voted 2792 3591 6386 Didn't vote 1486 2131 3617 4278 5722 10000 Expected Value (Men who didn’t vote) = (15473526 ((all who didn’t vote) * (all men)) ) 10000 (total number)
  • 70. Men who didn’t vote: OBSERVED Men Women _ TABLE Voted 2792 3591 6386 Didn't vote 1486 2131 3617 4278 5722 10000 Expected Value (Men who didn’t vote) = (1547.3526 ((all who didn’t vote) * (all men)) / (total number))
  • 71. Men who didn’t vote: EXPECTED Men Women _ TABLE Voted 2731 3652 6386 Didn't vote 1547 2131 3617 4278 5722 10000 Expected Value (Men who didn’t vote) = (1547 ((all who didn’t vote) * (all men)) / (total number))
  • 72. Men who didn’t vote: EXPECTED Men Women _ TABLE Voted 2731 3652 6386 Didn't vote 1547 2131 3617 4278 5722 10000 What is the expected value for Women who Didn’t Vote?
  • 73. Women who didn’t vote: OBSERVED Men Women _ TABLE Voted 2792 3591 6386 Didn't vote 1486 2131 3617 4278 5722 10000
  • 74. Women who didn’t vote: OBSERVED Men Women _ TABLE Voted 2792 3591 6386 Didn't vote 1486 2131 3617 4278 5722 10000 Expected Value (Women who didn’t vote) = (3617 (all who didn’t vote) * 5722 (all women) ) 10000 (total number)
  • 75. Women who didn’t vote: OBSERVED Men Women _ TABLE Voted 2792 3591 6386 Didn't vote 1486 2131 3617 4278 5722 10000 Expected Value (Women who didn’t vote) = (3617 (all who didn’t vote) * 5722 (all women) ) 10000 (total number)
  • 76. Women who didn’t vote: OBSERVED Men Women _ TABLE Voted 2792 3591 6386 Didn't vote 1486 2131 3617 4278 5722 10000 Expected Value (Women who didn’t vote) = (3617 (all who didn’t vote) * 5722 (all women) ) 10000 (total number)
  • 77. Women who didn’t vote: OBSERVED Men Women _ TABLE Voted 2792 3591 6386 Didn't vote 1486 2131 3617 4278 5722 10000 Expected Value (Women who didn’t vote) = (20696474 (all who didn’t vote) * (all women) ) 10000 (total number)
  • 78. Women who didn’t vote: OBSERVED Men Women _ TABLE Voted 2792 3591 6386 Didn't vote 1486 2131 3617 4278 5722 10000 Expected Value (Women who didn’t vote) = (2069.6474 (all who didn’t vote) * (all women)) /(total number)
  • 79. Women who didn’t vote: OBSERVED Men Women _ TABLE Voted 2792 3591 6386 Didn't vote 1486 2131 3617 4278 5722 10000 Expected Value (Women who didn’t vote) = (2070 (all who didn’t vote) * (all women)) /(total number)
  • 80. Men who didn’t vote: EXPECTED Men Women _ TABLE Voted 2731 3652 6386 Didn't vote 1547 2070 3617 4278 5722 10000
  • 81. OBSERVED Men Women TABLE Voted 2792 3591 Didn't vote 1486 2131 4278 5722 10000 EXPECTED Men Women TABLE Voted 2731 3652 Didn't vote 1547 2070 - = Difference 4278 5722 10000
  • 82. OBSERVED Men Women TABLE Voted 2792 3591 Didn't vote 1486 2131 4278 5722 10000 EXPECTED Men Women TABLE Voted 2731 3652 Didn't vote 1547 2070 - = Difference 4278 5722 10000 With the information above, we can now plug in the numbers using the Chi-square independence test.
  • 83. OBSERVED Men Women TABLE Voted 2792 3591 Didn't vote 1486 2131 4278 5722 10000 EXPECTED Men Women TABLE Voted 2731 3652 Didn't vote 1547 2070 - = Difference 4278 5722 10000 With the information above, we can now plug in the numbers using the Chi-square independence test. Note – this is the same equation that is used with the Chi-square goodness of fit test:
  • 84. OBSERVED Men Women TABLE Voted 2792 3591 Didn't vote 1486 2131 4278 5722 10000 EXPECTED Men Women TABLE Voted 2731 3652 Didn't vote 1547 2070 - = Difference 4278 5722 10000 With the information above, we can now plug in the numbers using the Chi-square independence test. Note – this is the same equation that is used with the Chi-square goodness of fit test: 푥2 = Σ (푂 − 퐸)2 퐸
  • 85. 푥2 = 횺 (푂 − 퐸)2 퐸
  • 86. 푥2 = 횺 (푂 − 퐸)2 퐸
  • 87. 푥2 = 횺 (푂 − 퐸)2 퐸 Or in this case:
  • 88. 푥2 = 횺 (푂 − 퐸)2 퐸 Or in this case: 푥2 = (푂 − 퐸)2 퐸 + (푂 − 퐸)2 퐸 + (푂 − 퐸)2 퐸 + (푂 − 퐸)2 퐸
  • 89. 푥2 = 횺 (푂 − 퐸)2 퐸 Or in this case: 푥2 = (푂 − 퐸)2 퐸 + (푂 − 퐸)2 퐸 + (푂 − 퐸)2 퐸 + (푂 − 퐸)2 퐸
  • 90. OBSERVED Men Women TABLE Voted 2792 3591 Didn't vote 1486 2131 4278 5722 10000 EXPECTED Men Women TABLE Voted 2731 3652 Didn't vote 1547 2070 - = Difference 4278 5722 10000
  • 91. OBSERVED Men Women TABLE Voted 2792 3591 Didn't vote 1486 2131 4278 5722 Voting Men 10000 EXPECTED Men Women TABLE Voted 2731 3652 Didn't vote 1547 2070 - = Difference 4278 5722 10000 푥2 = (푂 − 퐸)2 퐸 + (푂 − 퐸)2 퐸 + (푂 − 퐸)2 퐸 + (푂 − 퐸)2 퐸 Voting Men
  • 92. OBSERVED Men Women TABLE Voted 2792 3591 Didn't vote 1486 2131 Voting Men EXPECTED Men Women TABLE Voted 2731 3652 Didn't vote 1547 2070 - = Difference 4278 5722 푥2 = (2792 − 퐸)2 퐸 + (푂 − 퐸)2 퐸 + (푂 − 퐸)2 퐸 + (푂 − 퐸)2 퐸 Voting Men
  • 93. OBSERVED Men Women TABLE Voted 2792 3591 Didn't vote 1486 2131 Voting Men EXPECTED Men Women TABLE Voted 2731 3652 Didn't vote 1547 2070 - = Difference 4278 5722 푥2 = (2792 − 퐸)2 퐸 + (푂 − 퐸)2 퐸 + (푂 − 퐸)2 퐸 + (푂 − 퐸)2 퐸 Voting Men
  • 94. EXPECTED Men Women TABLE Voted 2731 3652 Didn't vote 1547 2070 4278 5722 OBSERVED Men Women TABLE Voted 2792 3591 Didn't vote 1486 2131 Voting Men - = Difference 푥2 = (2792 − 2731)2 퐸 + (푂 − 퐸)2 퐸 + (푂 − 퐸)2 퐸 + (푂 − 퐸)2 퐸 Voting Men
  • 95. OBSERVED Men Women TABLE Voted 2792 3591 Didn't vote 1486 2131 Voting Men EXPECTED Men Women TABLE Voted 2731 3652 Didn't vote 1547 2070 - = Difference 4278 5722 푥2 = (2792 − 2731)2 2731 + (푂 − 퐸)2 퐸 + (푂 − 퐸)2 퐸 + (푂 − 퐸)2 퐸 Voting Men
  • 96. OBSERVED Men Women TABLE Voted 2792 3591 Didn't vote 1486 2131 EXPECTED Men Women TABLE Voted 2731 3652 Didn't vote 1547 2070 4278 5722 Voting Men - = Difference 푥2 = (2792 − 2731)2 2731 + (푂 − 퐸)2 퐸 + (푂 − 퐸)2 퐸 + (푂 − 퐸)2 퐸 Voting Women
  • 97. EXPECTED Men Women TABLE Voted 2731 3652 Didn't vote 1547 2070 4278 5722 OBSERVED Men Women TABLE Voted 2792 3591 Didn't vote 1486 2131 Voting Men - = Difference 푥2 = (2792 − 2731)2 2731 + (푂 − 퐸)2 퐸 + (푂 − 퐸)2 퐸 + (푂 − 퐸)2 퐸 Voting Women
  • 98. EXPECTED Men Women TABLE Voted 2731 3652 Didn't vote 1547 2070 4278 5722 OBSERVED Men Women TABLE Voted 2792 3591 Didn't vote 1486 2131 Voting Men - = Difference 푥2 = (2792 − 2731)2 2731 + (3591 − 퐸)2 퐸 + (푂 − 퐸)2 퐸 + (푂 − 퐸)2 퐸 Voting Women
  • 99. OBSERVED Men Women TABLE Voted 2792 3591 Didn't vote 1486 2131 Voting Men EXPECTED Men Women TABLE Voted 2731 3652 Didn't vote 1547 2070 - = Difference 4278 5722 Voting Women 푥2 = (2792 − 2731)2 2731 + (3591 − 퐸)2 퐸 + (푂 − 퐸)2 퐸 + (푂 − 퐸)2 퐸
  • 100. OBSERVED Men Women TABLE Voted 2792 3591 Didn't vote 1486 2131 Voting Men EXPECTED Men Women TABLE Voted 2731 3652 Didn't vote 1547 2070 - = Difference 4278 5722 Voting Women 푥2 = (2792 − 2731)2 2731 + (3591 − 3652)2 3652 + (푂 − 퐸)2 퐸 + (푂 − 퐸)2 퐸
  • 101. OBSERVED Men Women TABLE Voted 2792 3591 Didn't vote 1486 2131 Voting Men EXPECTED Men Women TABLE Voted 2731 3652 Didn't vote 1547 2070 - = Difference 4278 5722 Non-Voting Men 푥2 = (2792 − 2731)2 2731 + (3591 − 3652)2 3652 + (푂 − 퐸)2 퐸 + (푂 − 퐸)2 퐸
  • 102. OBSERVED Men Women TABLE Voted 2792 3591 Didn't vote 1486 2131 Voting Men EXPECTED Men Women TABLE Voted 2731 3652 Didn't vote 1547 2070 - = Difference 4278 5722 Non-Voting Men 푥2 = (2792 − 2731)2 2731 + (3591 − 3652)2 3652 + (1486 − 퐸)2 퐸 + (푂 − 퐸)2 퐸
  • 103. EXPECTED Men Women TABLE Voted 2731 3652 Didn't vote 1547 2070 4278 5722 OBSERVED Men Women TABLE Voted 2792 3591 Didn't vote 1486 2131 Voting Men - = Difference Non-Voting Men 푥2 = (2792 − 2731)2 2731 + (3591 − 3652)2 3652 + (1486 − 퐸)2 퐸 + (푂 − 퐸)2 퐸
  • 104. EXPECTED Men Women TABLE Voted 2731 3652 Didn't vote 1547 2070 4278 5722 OBSERVED Men Women TABLE Voted 2792 3591 Didn't vote 1486 2131 Voting Men - = Difference Non-Voting Men 푥2 = (2792 − 2731)2 2731 + (3591 − 3652)2 3652 + (1486 − 1547)2 1547 + (푂 − 퐸)2 퐸
  • 105. OBSERVED Men Women TABLE Voted 2792 3591 Didn't vote 1486 2131 Voting Men EXPECTED Men Women TABLE Voted 2731 3652 Didn't vote 1547 2070 - = Difference 4278 5722 Non-Voting Women 푥2 = (2792 − 2731)2 2731 + (3591 − 3652)2 3652 + (1486 − 1547)2 1547 + (푂 − 퐸)2 퐸
  • 106. OBSERVED Men Women TABLE Voted 2792 3591 Didn't vote 1486 2131 Voting Men EXPECTED Men Women TABLE Voted 2731 3652 Didn't vote 1547 2070 - = Difference 4278 5722 Non-Voting Women 푥2 = (2792 − 2731)2 2731 + (3591 − 3652)2 3652 + (1486 − 1547)2 1547 + (2131 − 퐸)2 퐸
  • 107. EXPECTED Men Women TABLE Voted 2731 3652 Didn't vote 1547 2070 4278 5722 OBSERVED Men Women TABLE Voted 2792 3591 Didn't vote 1486 2131 Voting Men - = Difference Non-Voting Women 푥2 = (2792 − 2731)2 2731 + (3591 − 3652)2 3652 + (1486 − 1547)2 1547 + (2131 − 2070)2 2070
  • 109. Time to Calculate: 푥2 = (2792 − 2731)2 2731 + (3591 − 3652)2 3652 + (1486 − 1547)2 1547 + (2131 − 2070)2 2070
  • 110. Time to Calculate: 푥2 = (61)2 2731 + (3591 − 3652)2 3652 + (1486 − 1547)2 1547 + (2131 − 2070)2 2070
  • 111. Time to Calculate: 푥2 = 3721 2731 + (3591 − 3652)2 3652 + (1486 − 1547)2 1547 + (2131 − 2070)2 2070
  • 112. Time to Calculate: 푥2 = 1.4 + (3591 − 3652)2 3652 + (1486 − 1547)2 1547 + (2131 − 2070)2 2070
  • 113. Time to Calculate: 푥2 = 1.4 + (−61)2 3652 + (1486 − 1547)2 1547 + (2131 − 2070)2 2070
  • 114. Time to Calculate: 푥2 = 1.4 + 3721 3652 + (1486 − 1547)2 1547 + (2131 − 2070)2 2070
  • 115. Time to Calculate: 푥2 = 1.4 + 1.0 + (1486 − 1547)2 1547 + (2131 − 2070)2 2070
  • 116. Time to Calculate: 푥2 = 1.4 + 1.0 + (61)2 1547 + (2131 − 2070)2 2070
  • 117. Time to Calculate: 푥2 = 1.4 + 1.0 + 3721 1547 + (2131 − 2070)2 2070
  • 118. Time to Calculate: 푥2 = 1.4 + 1.0 + 2.4 + (2131 − 2070)2 2070
  • 119. Time to Calculate: 푥2 = 1.4 + 1.0 + 2.4 + (61)2 2070
  • 120. Time to Calculate: 푥2 = 1.4 + 1.0 + 2.4 + 3721 2070
  • 121. Time to Calculate: 푥2 = 1.4 + 1.0 + 2.4 + 1.8
  • 122. Time to Calculate: 푥2 = 6.6
  • 123. Now we determine if a 푥2of 6.6 exceeds the critical 푥2 for terms.
  • 124. To calculate the 푥2 critical we first must determine the degrees of freedom as well as set the probability level.
  • 125. To calculate the 푥2 critical we first must determine the degrees of freedom as well as set the probability level. The probability or alpha level means the probability of a type 1 error we are willing to live with (i.e., this is the probability of being wrong when we reject the null hypothesis). Generally this value is .05 which is like saying we are willing to be wrong 5 out of 100 times (.05) before we will reject the null-hypothesis.
  • 126. Degrees of Freedom are calculated by taking the number rows and subtracting them by 1 and then multiplying the result by taking the number of columns and subtracting them by 1.
  • 127. Degrees of Freedom are calculated by taking the number rows and subtracting them by 1 and then multiplying the result by taking the number of columns and subtracting them by 1. (Two rows -1) or (2-1) X (2-1) or 1X1=1. Degrees of Freedom = 1.
  • 128. We now have all of the information we need to determine the critical 푥2.
  • 129. We now have all of the information we need to determine the critical 푥2. We go to the Chi-Square Distribution Table and locate the degrees of freedom:
  • 130. We now have all of the information we need to determine the critical 푥2. We go to the Chi-Square Distribution Table and locate the degrees of freedom: df 0.100 0.050 0.025 1 2.71 3.84 5.02 2 4.61 5.99 7.38 3 6.25 7.82 9.35 4 7.78 9.49 11.14 5 9.24 11.07 12.83 6 10.64 12.59 14.45 7 12.02 14.07 16.10 8 13.36 15.51 17.54 9 14.68 16.92 19.20 … … … …
  • 131. We now have all of the information we need to determine the critical 푥2. We go to the Chi-Square Distribution Table and locate the degrees of freedom: df 0.100 0.050 0.025 1 2.71 3.84 5.02 2 4.61 5.99 7.38 3 6.25 7.82 9.35 4 7.78 9.49 11.14 5 9.24 11.07 12.83 6 10.64 12.59 14.45 7 12.02 14.07 16.10 8 13.36 15.51 17.54 9 14.68 16.92 19.20 … … … … And then we locate the probability or alpha level:
  • 132. We now have all of the information we need to determine the critical 푥2. We go to the Chi-Square Distribution Table and locate the degrees of freedom: df 0.100 0.050 0.025 1 2.71 3.84 5.02 2 4.61 5.99 7.38 3 6.25 7.82 9.35 4 7.78 9.49 11.14 5 9.24 11.07 12.83 6 10.64 12.59 14.45 7 12.02 14.07 16.10 8 13.36 15.51 17.54 9 14.68 16.92 19.20 … … … … And then we locate the probability or alpha level: Where these two values intersect in the table we find the critical 푥2.
  • 133. Since the chi-square goodness of fit value (6.6) exceeds the critical 푥2 (3.84) we will reject the null-hypothesis.
  • 134. Since the chi-square goodness of fit value (6.6) exceeds the critical 푥2 (3.84) we will reject the null-hypothesis. Voting patterns and gender status are not statistically significantly dependent on one another.
  • 135. Since the chi-square goodness of fit value (6.6) exceeds the critical 푥2 (3.84) we will reject the null-hypothesis. Voting patterns and gender status are not statistically significantly dependent on one another.
  • 136. Since the chi-square goodness of fit value (6.6) exceeds the critical 푥2 (3.84) we will reject the null-hypothesis. Voting patterns and gender status are not statistically significantly dependent on one another. There actually is a significant difference.
  • 137. So what is the difference between chi-square test of goodness of fit and test of independence?
  • 138. A goodness-of-fit test is a one variable Chi-square test.
  • 139. A goodness-of-fit test is a one variable Chi-square test. In this example, a department chair wants to know if the enrollments across three professors are equally distributed.
  • 140. A goodness-of-fit test is a one variable Chi-square test. In this example, a department chair wants to know if the enrollments across three professors are equally distributed. Here is the actual, or observed, data:
  • 141. A goodness-of-fit test is a one variable Chi-square test. In this example, a department chair wants to know if the enrollments across three professors are equally distributed. Here is the actual, or observed, data: OBSERVED TABLE Prof A’s Class Prof B’s Class Prof C’s Class Students enrolled 31 25 10
  • 142. A goodness-of-fit test is a one variable Chi-square test. OBSERVED TABLE Prof A’s Class Prof B’s Class Prof C’s Class Students enrolled 31 25 10
  • 143. A goodness-of-fit test is a one variable Chi-square test. OBSERVED TABLE Prof A’s Class Prof B’s Class Prof C’s Class Students enrolled 31 25 10
  • 144. A test of independence is a two variable Chi-square test.
  • 145. A test of independence is a two variable Chi-square test. For example, a department chair wants to know if women and men enrollments are equally distributed across three professor classes.
  • 146. A test of independence is a two variable Chi-square test. For example, a department chair wants to know if women and men enrollments are equally distributed across three professor classes. OBSERVED TABLE Prof A’s Class Prof B’s Class Prof C’s Class Men 21 7 7 Women 10 18 3
  • 147. A test of independence is a two variable (gender) Chi-square test. OBSERVED TABLE Prof A’s Class Prof B’s Class Prof C’s Class Men 21 7 7 Women 10 18 3