SlideShare ist ein Scribd-Unternehmen logo
1 von 4
Downloaden Sie, um offline zu lesen
International Journal of Mathematics and Statistics Invention (IJMSI)
E-ISSN: 2321 – 4767 || P-ISSN: 2321 - 4759
www.ijmsi.org || Volume 2 || Issue 2 || February - 2014 || PP-07-10

On The Use of Analysis of Variance under Unequal group
variances
Abidoye, A. O, 2Jolayemi, E. T, 3Sanni, O. O. M and 4Oyejola, B. A
Department of Statistics, University of Ilorin, Ilorin, Kwara State, Nigeria.

ABSTRACT: In this study, we imposed Analysis of variances test (ANOVA) under unequal variances when
2

the groups variances differ. It will be very inappropriate to use the pooled sample variance ( S P ) as a single
2

value for the variances, instead the sample harmonic mean of variances ( S H ) is proposed, examined and found
useful for unequal variances. Data set from Kwara State Ministry of Health on the incidence of diabetes diseases
for male patients was used to illustrate the relevance of our proposed test statistic.

KEYWORDS: Harmonic mean of variances, Analysis of variance (ANOVA) ,chi- square distribution,
modified t –

I.

INTRODUCTION

Analysis of Variance is one of the most popular models in statistics. In general, interest is in testing the
homogeneity of the different group means using the classical Analysis of Variance (ANOVA). However, the
standard assumption of homogeneous error variances which is crucial in ANOVA is seldomly met in statistical
practice. In such a case one has to assume a model with heteroscedastic error variances. Analysis of variance
assumes that the sample data sets have been drawn from populations that follow a normal distribution. One-way
analysis of variance for example, evaluates the effect of a single factor on a single response variable. For
example, a clinician may be interested in determining whether or not there are differences in the age distribution
of patients enrolled in different study groups. To satisfy the assumptions, the patients must be selected randomly
from each of the population groups, a value for age for the response variable is recorded for each sampled
patient, the distribution of the response variable can then be assumed to be normally distributed. See Ott, 1984
and Abidoye (2012).
The analysis of variance uses a linear regression approach and consequently supports unequal sample
sizes. This is important because designers of experiments seldom have complete control over the ultimate
sample sizes in their studies. However, two-way analysis of variance for example does not support empty cells
(factor levels with no sample data points). Each of the factors must have two or more levels and the factor
combination must have one or more sample observations.
The conventional analysis of variance (ANOVA) is also based on the assumption of normality,
independence of errors and equality of the error variances. Studies have shown that the F-test is not robust under
the violation of equal error variances, especially if the sample sizes are not equal and some authors have
developed an exact Analysis of variance for testing the means of g independent normal populations by using one
or two stage procedures. See Jonckheere (1954), Dunnett (1964), Montgomery (1981), Dunnet and Tamhane
(1997), Yahya and Jolayemi (2003).
This test procedure having a specified type one error rate of α should be powerful to determine
whether the differences among the sample means are large enough to imply that the corresponding population
means are different. See Abidoye et.al (2013a, 2013b)
II. METHODOLOGY
We are interested in developing a suitable test procedure to test the hypothesis:

H 0 : 1   2  ...   g   against non-directional alternative, H1: i   , for at least one i,
i  1,2,...,g ………………………………………………………………………………………………………
……………….….…………..(2.1)
where the error term

eij ~ N (0, i2 ) i  1,2,...,g .
www.ijmsi.org

7|Page
On The Use of Analysis of Variance under…
If we define

 i  i   ,

…………………………………………………………………………………….…………………….…….(2.2)
then,

H 0 :  i  i    0  i vs H1 :  i  0

The unbiased estimate of

i

for at least one i

is

ˆ
i  X i  X …………………………………………………………………….…………………………………
…………………………...(2.3)
Therefore

ˆ
i ~ N[i , V (i )]
where

V ( i )  V (Yi )

 V ( Xi  X )
 V ( X i )  V ( X )  2 cov(X i , X )
 V ( X i )  V ( X )  2 cov(X i ,

X )
i

g

 V ( X i )  V ( X )  2 cov(X i ,


 i2



 i2

ni
ni






2
H

n
2
H

n

Xi
)
g



2
V ( Xi )
g



2 i2
gni , see Abidoye (2012) and Abidoye et.al (2013c)

2
H

 22
 1   i
n  g  ni



………………………………………………………………………………………..…(2.4)
Therefore,

V ( i ) 

 i2
ni



2
n

……………………………………………………………………..........................................

.................(2.5)

1
2 
21
  H   (1  ) 
n n
g  …………………………………………………………………………………………
i


………………………..(2.6)
2
ˆ2
 H  SH

1  1 1
1 
   2  2  ...  2 
s g 
 g  s1 s2

 
S2

1

S2

where i is the variance of the ith group. H has r degrees of freedom, where r = 22.096 + 0.266(ng) – 0.000029(n-g)2 as defined in Abidoye et. al (2013).
2
SH

is likened to the common variance as defined in one – way Analysis of variance ANOVA. The one – way
ANOVA is presented in Table 1 below. In the table one the sum of squares between groups and adjusted sum of
squares do not necessarily sum to total sum of squares, also the degrees of freedom.

www.ijmsi.org

8|Page
On The Use of Analysis of Variance under…
ANALYSIS OF VARIANCE TABLE
Table 1: One – way analysis of variance with unequal group variances
Source variation

d.f

Treatment (between groups)

t-1

SS

MS

Y
SSt 

( Yij ) 2

2
i.

t



F

SSt/t-1 =MSSt

n

MSSt /

2
MS H

Adjusted error

Total

2
r  SH

r

2
SH

SST   Yij2 

n-1

( Yij ) 2
n

where r =  = [r+⅟2]
.
Small simulations show that without any loss of generality nearest integer values can be used to
approximate the r degrees of freedom.
III. APPLICATION
The data used in this study is applicable in Medicine; the data used were secondary data, collected
primarily by Kwara State Ministry of Health, Ilorin, Kwara State, Nigeria. They were extracts from incidence
of diabetes diseases for male patients for ten consecutive years, covering the period 2001 – 2010.
Table 2: Showing the incidence of diabetes diseases for male patients in Kwara State for ten
years (2001- 2010).
Years
Zone A
Zone B
Zone C
Zone D

1
37
14
15
11

2
80
19
18
19

3
58
12
11
10

4
48
21
19
18

5
35
23
22
23

6
46
13
14
12

7
53
15
13
14

8
39
16
15
16

9
64
11
10
12

10
76
14
13
15

We need to verify the equality of the variances between these four zones. That is, testing the
hypothesis;
2
2
2
2
H 0 :  A   B   C   D vs H1 :  i2   2 for atleast
j

(i, j )

i  A, B, C , D

, j  A, B , C , D
Table 3: Levene test for variance equality
Response

Levene Statistic
10.975

df1
3

df2
36

P-value
0.000

Since P- value < 0.05 we therefore reject H0 and therefore conclude that the variances are not equal.
Computation on incidence of diabetes diseases for male patients: From the data above the following summary
statistics were obtained:
Zone A:
Zone B:
Zone C:

2
YA  53 .6, S A  250 .04 , n A  10
2
YB  15.8, S B  15.7, nB  10

2
YC  15 .0, S C  13 .9, nC  10

www.ijmsi.org

9|Page
On The Use of Analysis of Variance under…
Zone D:

2
YD  15.0, S D  16.7, nD  10

1

4

In the above, data set,

ni  10 , g = 4 , n   ni  40
i 1

1 4 1 
S   2
4 s 
 i 1 i  ,
,
2
H

2
S H  20 .05

The main hypothesis is

H 0 :  A   B  C   D   against H1: i   , for at least one i, i.e i  A, B,...,D

SST   Yij2 

( Yij ) 2
n

=372+802 +….+122 + 152 – 7182.4
=31,209.6

Y
SSt 

2
i.

t



( Yij ) 2
n

= 35726 – 7182.4
= 28543.6
SSE = SST – SSt
= 31209.6 – 28 543.6
= 2666

Source variation

Analysis Of Variance Table
d.f
SS
MS

Between groups
Ajusted error

3
31.63

28543.6
634.18

Total

39

31209.6

9514.5
20.05

F
474.54


F3, 31.63,  F3, 32 ,  8.62
Analysis of variance shows that incidence rate of diabetes diseases in the four zones are significantly
different at 5% level of significance which support the earlier results (Abidoye et. al 2013c), even though the
sum of squares due to zones plus the adjusted error sum of square did not sum up to total sum of squares.
IV. CONCLUSION
In this work we have established that ANOVA test may suffice if the adjusted error (the Harmonic
mean of the population variances) is known. The sum of square treatments and sum of square of adjusted error
may not necessarily sum up to sum of squares; and the adjusted error degrees of freedom need not necessarily be
an integer.

REFERENCES
[1]
[2]

[3]

[4]

Abidoye, A. O (2012): Development of Hypothesis Testing Technique for Ordered Alternatives under heterogeneous variances.
Unpublished Ph.D Thesis submitted to Dept. of Statistics, University of Ilorin, Ilorin.
Abidoye, A.O, Jolayemi, E.T, Sanni, O.O.M and Oyejola, B.A (2013a): On Hypothesis Testing Under Unequal Group
Variances:The Use of the Harmonic variance.The International Journal of Institute for science, Technology and Education. Vol.
3, No 7, Page 11 - 15
Abidoye, A.O, Jolayemi, E.T, Sanni, O.O.M and Oyejola, B.A (2013b): Development of a TestStatistic for Testing Equality of
Two Means Under Unequal Population Variances..The International Journal of Institute for science, Technology and Education.
Vol. 3, No 10, page 10 -14.
Abidoye, A.O, Jolayemi, E.T, Sanni, O.O.M and Oyejola, B.A (2013c): Development of a Test Statistic for Testing Equality of
Means Under Unequal Population Variances. The International Journal of Mathematics and Statistics Invention. Vol. 1, page 38 -43.

[5]
[6]
[7]
[8]
[9]
[10]

Dunnett, C. W and Tamhane, A. C (1997): Multiple testing to establish superiority / equivalence of a new treatment compare
with k standard treatments. Statistics in Medicine 16, 2489 – 2506.
Dunnett , C. W (1964): New tables for multiple comparison with a control. Biometric, 20, 482-491.
Jonckheere, A.R (1954): “A distribution-free k-sample test against ordered alternative.” Biometrical, 41, 133-145.
Montgomery, D.C (1981): “Design and Analysis of Experiment.” Second Edition. John Wiley
and sons Inc. New York.
Ott, L (1984): “An Introduction To Statistical Methods and Data Analysis”. Second edition. P.W.S Publisher. Boston.
Yahya, W.B and Jolayemi, E.T (2003): Testing Ordered Means against a Control. JNSA,16, 40- 51.

www.ijmsi.org

10 | P a g e

Weitere ähnliche Inhalte

Was ist angesagt?

One way anova final ppt.
One way anova final ppt.One way anova final ppt.
One way anova final ppt.Aadab Mushrib
 
One Way Anova
One Way AnovaOne Way Anova
One Way Anovashoffma5
 
Medical Statistics Pt 1
Medical Statistics Pt 1Medical Statistics Pt 1
Medical Statistics Pt 1Fastbleep
 
One-way ANOVA for Randomized Complete Block Design (RCBD)
One-way ANOVA for Randomized Complete Block Design (RCBD)One-way ANOVA for Randomized Complete Block Design (RCBD)
One-way ANOVA for Randomized Complete Block Design (RCBD)Siti Nur Adila Hamzah
 
BioSHaRE: Analysis of mixed effects models using federated data analysis appr...
BioSHaRE: Analysis of mixed effects models using federated data analysis appr...BioSHaRE: Analysis of mixed effects models using federated data analysis appr...
BioSHaRE: Analysis of mixed effects models using federated data analysis appr...Lisette Giepmans
 
RESIDUALS AND INFLUENCE IN NONLINEAR REGRESSION FOR REPEATED MEASUREMENT DATA
RESIDUALS AND INFLUENCE IN NONLINEAR REGRESSION FOR REPEATED MEASUREMENT DATARESIDUALS AND INFLUENCE IN NONLINEAR REGRESSION FOR REPEATED MEASUREMENT DATA
RESIDUALS AND INFLUENCE IN NONLINEAR REGRESSION FOR REPEATED MEASUREMENT DATAorajjournal
 
Anova in easyest way
Anova in easyest wayAnova in easyest way
Anova in easyest wayBidyut Ghosh
 
Categorical data final
Categorical data finalCategorical data final
Categorical data finalMonika
 
Proportion test using Chi square
Proportion test using Chi squareProportion test using Chi square
Proportion test using Chi squareParag Shah
 
Matching Weights to Simultaneously Compare Three Treatment Groups: a Simulati...
Matching Weights to Simultaneously Compare Three Treatment Groups: a Simulati...Matching Weights to Simultaneously Compare Three Treatment Groups: a Simulati...
Matching Weights to Simultaneously Compare Three Treatment Groups: a Simulati...Kazuki Yoshida
 
8 2008-normative data for the letter cancellation task in school children
8 2008-normative data for the letter cancellation task in school children8 2008-normative data for the letter cancellation task in school children
8 2008-normative data for the letter cancellation task in school childrenElsa von Licy
 

Was ist angesagt? (20)

Comparing means
Comparing meansComparing means
Comparing means
 
Medical statistics2
Medical statistics2Medical statistics2
Medical statistics2
 
One way anova final ppt.
One way anova final ppt.One way anova final ppt.
One way anova final ppt.
 
One Way Anova
One Way AnovaOne Way Anova
One Way Anova
 
Medical Statistics Pt 1
Medical Statistics Pt 1Medical Statistics Pt 1
Medical Statistics Pt 1
 
Analysis of variance anova
Analysis of variance anovaAnalysis of variance anova
Analysis of variance anova
 
One-way ANOVA for Randomized Complete Block Design (RCBD)
One-way ANOVA for Randomized Complete Block Design (RCBD)One-way ANOVA for Randomized Complete Block Design (RCBD)
One-way ANOVA for Randomized Complete Block Design (RCBD)
 
BioSHaRE: Analysis of mixed effects models using federated data analysis appr...
BioSHaRE: Analysis of mixed effects models using federated data analysis appr...BioSHaRE: Analysis of mixed effects models using federated data analysis appr...
BioSHaRE: Analysis of mixed effects models using federated data analysis appr...
 
Propensity Scores in Medical Device Trials
Propensity Scores in Medical Device TrialsPropensity Scores in Medical Device Trials
Propensity Scores in Medical Device Trials
 
RESIDUALS AND INFLUENCE IN NONLINEAR REGRESSION FOR REPEATED MEASUREMENT DATA
RESIDUALS AND INFLUENCE IN NONLINEAR REGRESSION FOR REPEATED MEASUREMENT DATARESIDUALS AND INFLUENCE IN NONLINEAR REGRESSION FOR REPEATED MEASUREMENT DATA
RESIDUALS AND INFLUENCE IN NONLINEAR REGRESSION FOR REPEATED MEASUREMENT DATA
 
Anova in easyest way
Anova in easyest wayAnova in easyest way
Anova in easyest way
 
Categorical data final
Categorical data finalCategorical data final
Categorical data final
 
Proportion test using Chi square
Proportion test using Chi squareProportion test using Chi square
Proportion test using Chi square
 
Chi squared test
Chi squared test Chi squared test
Chi squared test
 
Matching Weights to Simultaneously Compare Three Treatment Groups: a Simulati...
Matching Weights to Simultaneously Compare Three Treatment Groups: a Simulati...Matching Weights to Simultaneously Compare Three Treatment Groups: a Simulati...
Matching Weights to Simultaneously Compare Three Treatment Groups: a Simulati...
 
Crd tutorial
Crd tutorialCrd tutorial
Crd tutorial
 
Non parametric test
Non parametric testNon parametric test
Non parametric test
 
8 2008-normative data for the letter cancellation task in school children
8 2008-normative data for the letter cancellation task in school children8 2008-normative data for the letter cancellation task in school children
8 2008-normative data for the letter cancellation task in school children
 
Assumptions of ANOVA
Assumptions of ANOVAAssumptions of ANOVA
Assumptions of ANOVA
 
Nonparametric and Distribution- Free Statistics _contd
Nonparametric and Distribution- Free Statistics _contdNonparametric and Distribution- Free Statistics _contd
Nonparametric and Distribution- Free Statistics _contd
 

Andere mochten auch

Omd xmas report2012
Omd xmas report2012Omd xmas report2012
Omd xmas report2012artfull
 
Распределение
РаспределениеРаспределение
Распределениеi-fa
 
At&t 2014 eft metrics feb 2013 final
At&t 2014 eft metrics feb 2013 finalAt&t 2014 eft metrics feb 2013 final
At&t 2014 eft metrics feb 2013 finalchristinakornrich
 
デジタルサイネージ製品のご提案 英語幼児教育向け
デジタルサイネージ製品のご提案 英語幼児教育向けデジタルサイネージ製品のご提案 英語幼児教育向け
デジタルサイネージ製品のご提案 英語幼児教育向けkanefukutrading
 
Earthquake top list [autosaved]
Earthquake top list [autosaved]Earthquake top list [autosaved]
Earthquake top list [autosaved]Stiven Peter
 
Vocabulario de navidad 1
Vocabulario de navidad 1Vocabulario de navidad 1
Vocabulario de navidad 1Voyenbicicleta
 
Presentación los andes
Presentación los andesPresentación los andes
Presentación los andeskriveram22
 
Pueblos Fumigados 2 do Encuentro; San Lorenzo
Pueblos Fumigados  2 do Encuentro; San LorenzoPueblos Fumigados  2 do Encuentro; San Lorenzo
Pueblos Fumigados 2 do Encuentro; San LorenzoAnabela Tramontini
 
La Experiencia De La Rehabilitacion De La VíA Ferrea En China
La Experiencia De La Rehabilitacion De La VíA Ferrea En ChinaLa Experiencia De La Rehabilitacion De La VíA Ferrea En China
La Experiencia De La Rehabilitacion De La VíA Ferrea En ChinaGrupo Riel
 
Traballo Voluntario
Traballo VoluntarioTraballo Voluntario
Traballo Voluntarioguestb9eef
 
La familia como unidad de superviviencia de sentido y de cambio
La familia como unidad de superviviencia de sentido y de cambioLa familia como unidad de superviviencia de sentido y de cambio
La familia como unidad de superviviencia de sentido y de cambiofamiliacles
 

Andere mochten auch (20)

Omd xmas report2012
Omd xmas report2012Omd xmas report2012
Omd xmas report2012
 
Jvc
JvcJvc
Jvc
 
Распределение
РаспределениеРаспределение
Распределение
 
Fsm voluntarios
Fsm voluntariosFsm voluntarios
Fsm voluntarios
 
At&t 2014 eft metrics feb 2013 final
At&t 2014 eft metrics feb 2013 finalAt&t 2014 eft metrics feb 2013 final
At&t 2014 eft metrics feb 2013 final
 
Redes on line
Redes on lineRedes on line
Redes on line
 
デジタルサイネージ製品のご提案 英語幼児教育向け
デジタルサイネージ製品のご提案 英語幼児教育向けデジタルサイネージ製品のご提案 英語幼児教育向け
デジタルサイネージ製品のご提案 英語幼児教育向け
 
Earthquake top list [autosaved]
Earthquake top list [autosaved]Earthquake top list [autosaved]
Earthquake top list [autosaved]
 
L
LL
L
 
Qué es un blog2
Qué es un blog2Qué es un blog2
Qué es un blog2
 
Vocabulario de navidad 1
Vocabulario de navidad 1Vocabulario de navidad 1
Vocabulario de navidad 1
 
A paz
A pazA paz
A paz
 
Presentación los andes
Presentación los andesPresentación los andes
Presentación los andes
 
2011 Antal International Global PresentationespañOl
2011 Antal International  Global PresentationespañOl2011 Antal International  Global PresentationespañOl
2011 Antal International Global PresentationespañOl
 
Starbucks
StarbucksStarbucks
Starbucks
 
Pueblos Fumigados 2 do Encuentro; San Lorenzo
Pueblos Fumigados  2 do Encuentro; San LorenzoPueblos Fumigados  2 do Encuentro; San Lorenzo
Pueblos Fumigados 2 do Encuentro; San Lorenzo
 
La Experiencia De La Rehabilitacion De La VíA Ferrea En China
La Experiencia De La Rehabilitacion De La VíA Ferrea En ChinaLa Experiencia De La Rehabilitacion De La VíA Ferrea En China
La Experiencia De La Rehabilitacion De La VíA Ferrea En China
 
Seminariopresen
SeminariopresenSeminariopresen
Seminariopresen
 
Traballo Voluntario
Traballo VoluntarioTraballo Voluntario
Traballo Voluntario
 
La familia como unidad de superviviencia de sentido y de cambio
La familia como unidad de superviviencia de sentido y de cambioLa familia como unidad de superviviencia de sentido y de cambio
La familia como unidad de superviviencia de sentido y de cambio
 

Ähnlich wie International Journal of Mathematics and Statistics Invention (IJMSI)

anovappt-141025002857-conversion-gate01 (1).pdf
anovappt-141025002857-conversion-gate01 (1).pdfanovappt-141025002857-conversion-gate01 (1).pdf
anovappt-141025002857-conversion-gate01 (1).pdfGorachandChakraborty
 
anovappt-141025002857-conversion-gate01 (1)_240403_185855 (2).docx
anovappt-141025002857-conversion-gate01 (1)_240403_185855 (2).docxanovappt-141025002857-conversion-gate01 (1)_240403_185855 (2).docx
anovappt-141025002857-conversion-gate01 (1)_240403_185855 (2).docxprasad439227
 
Analyzing experimental research data
Analyzing experimental research dataAnalyzing experimental research data
Analyzing experimental research dataAtula Ahuja
 
Data analysis using spss for two sample t-test tutorial
Data analysis using spss for two sample t-test tutorialData analysis using spss for two sample t-test tutorial
Data analysis using spss for two sample t-test tutorialDaniel Sarpong
 
Epidemiological study design and it's significance
Epidemiological study design and it's significanceEpidemiological study design and it's significance
Epidemiological study design and it's significanceGurunathVhanmane1
 
25_Anderson_Biostatistics_and_Epidemiology.ppt
25_Anderson_Biostatistics_and_Epidemiology.ppt25_Anderson_Biostatistics_and_Epidemiology.ppt
25_Anderson_Biostatistics_and_Epidemiology.pptPriyankaSharma89719
 
Analyzing experimental research data
Analyzing experimental research dataAnalyzing experimental research data
Analyzing experimental research dataAtula Ahuja
 
Tugasan kumpulan anova
Tugasan kumpulan anovaTugasan kumpulan anova
Tugasan kumpulan anovapraba karan
 
2.AA.anova sesion applied biostat III (2).ppt
2.AA.anova sesion applied biostat III (2).ppt2.AA.anova sesion applied biostat III (2).ppt
2.AA.anova sesion applied biostat III (2).pptssuser504dda
 
Lecture-6 (t-test and one way ANOVA.ppt
Lecture-6 (t-test and one way ANOVA.pptLecture-6 (t-test and one way ANOVA.ppt
Lecture-6 (t-test and one way ANOVA.ppthabtamu biazin
 

Ähnlich wie International Journal of Mathematics and Statistics Invention (IJMSI) (20)

ANOVA.pdf
ANOVA.pdfANOVA.pdf
ANOVA.pdf
 
anovappt-141025002857-conversion-gate01 (1).pdf
anovappt-141025002857-conversion-gate01 (1).pdfanovappt-141025002857-conversion-gate01 (1).pdf
anovappt-141025002857-conversion-gate01 (1).pdf
 
Anova ppt
Anova pptAnova ppt
Anova ppt
 
SNM-1.pdf
SNM-1.pdfSNM-1.pdf
SNM-1.pdf
 
anovappt-141025002857-conversion-gate01 (1)_240403_185855 (2).docx
anovappt-141025002857-conversion-gate01 (1)_240403_185855 (2).docxanovappt-141025002857-conversion-gate01 (1)_240403_185855 (2).docx
anovappt-141025002857-conversion-gate01 (1)_240403_185855 (2).docx
 
Analyzing experimental research data
Analyzing experimental research dataAnalyzing experimental research data
Analyzing experimental research data
 
Data analysis using spss for two sample t-test tutorial
Data analysis using spss for two sample t-test tutorialData analysis using spss for two sample t-test tutorial
Data analysis using spss for two sample t-test tutorial
 
Epidemiological study design and it's significance
Epidemiological study design and it's significanceEpidemiological study design and it's significance
Epidemiological study design and it's significance
 
25_Anderson_Biostatistics_and_Epidemiology.ppt
25_Anderson_Biostatistics_and_Epidemiology.ppt25_Anderson_Biostatistics_and_Epidemiology.ppt
25_Anderson_Biostatistics_and_Epidemiology.ppt
 
Analyzing experimental research data
Analyzing experimental research dataAnalyzing experimental research data
Analyzing experimental research data
 
Aca 22-407
Aca 22-407Aca 22-407
Aca 22-407
 
K.A.Sindhura-t,z,f tests
K.A.Sindhura-t,z,f testsK.A.Sindhura-t,z,f tests
K.A.Sindhura-t,z,f tests
 
Tugasan kumpulan anova
Tugasan kumpulan anovaTugasan kumpulan anova
Tugasan kumpulan anova
 
Parametric tests
Parametric  testsParametric  tests
Parametric tests
 
2.AA.anova sesion applied biostat III (2).ppt
2.AA.anova sesion applied biostat III (2).ppt2.AA.anova sesion applied biostat III (2).ppt
2.AA.anova sesion applied biostat III (2).ppt
 
Parametric tests
Parametric testsParametric tests
Parametric tests
 
Lecture-6 (t-test and one way ANOVA.ppt
Lecture-6 (t-test and one way ANOVA.pptLecture-6 (t-test and one way ANOVA.ppt
Lecture-6 (t-test and one way ANOVA.ppt
 
Day2 statistical tests
Day2 statistical testsDay2 statistical tests
Day2 statistical tests
 
Survival.pptx
Survival.pptxSurvival.pptx
Survival.pptx
 
Hypothesis Testing
Hypothesis TestingHypothesis Testing
Hypothesis Testing
 

Kürzlich hochgeladen

08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 

Kürzlich hochgeladen (20)

08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 

International Journal of Mathematics and Statistics Invention (IJMSI)

  • 1. International Journal of Mathematics and Statistics Invention (IJMSI) E-ISSN: 2321 – 4767 || P-ISSN: 2321 - 4759 www.ijmsi.org || Volume 2 || Issue 2 || February - 2014 || PP-07-10 On The Use of Analysis of Variance under Unequal group variances Abidoye, A. O, 2Jolayemi, E. T, 3Sanni, O. O. M and 4Oyejola, B. A Department of Statistics, University of Ilorin, Ilorin, Kwara State, Nigeria. ABSTRACT: In this study, we imposed Analysis of variances test (ANOVA) under unequal variances when 2 the groups variances differ. It will be very inappropriate to use the pooled sample variance ( S P ) as a single 2 value for the variances, instead the sample harmonic mean of variances ( S H ) is proposed, examined and found useful for unequal variances. Data set from Kwara State Ministry of Health on the incidence of diabetes diseases for male patients was used to illustrate the relevance of our proposed test statistic. KEYWORDS: Harmonic mean of variances, Analysis of variance (ANOVA) ,chi- square distribution, modified t – I. INTRODUCTION Analysis of Variance is one of the most popular models in statistics. In general, interest is in testing the homogeneity of the different group means using the classical Analysis of Variance (ANOVA). However, the standard assumption of homogeneous error variances which is crucial in ANOVA is seldomly met in statistical practice. In such a case one has to assume a model with heteroscedastic error variances. Analysis of variance assumes that the sample data sets have been drawn from populations that follow a normal distribution. One-way analysis of variance for example, evaluates the effect of a single factor on a single response variable. For example, a clinician may be interested in determining whether or not there are differences in the age distribution of patients enrolled in different study groups. To satisfy the assumptions, the patients must be selected randomly from each of the population groups, a value for age for the response variable is recorded for each sampled patient, the distribution of the response variable can then be assumed to be normally distributed. See Ott, 1984 and Abidoye (2012). The analysis of variance uses a linear regression approach and consequently supports unequal sample sizes. This is important because designers of experiments seldom have complete control over the ultimate sample sizes in their studies. However, two-way analysis of variance for example does not support empty cells (factor levels with no sample data points). Each of the factors must have two or more levels and the factor combination must have one or more sample observations. The conventional analysis of variance (ANOVA) is also based on the assumption of normality, independence of errors and equality of the error variances. Studies have shown that the F-test is not robust under the violation of equal error variances, especially if the sample sizes are not equal and some authors have developed an exact Analysis of variance for testing the means of g independent normal populations by using one or two stage procedures. See Jonckheere (1954), Dunnett (1964), Montgomery (1981), Dunnet and Tamhane (1997), Yahya and Jolayemi (2003). This test procedure having a specified type one error rate of α should be powerful to determine whether the differences among the sample means are large enough to imply that the corresponding population means are different. See Abidoye et.al (2013a, 2013b) II. METHODOLOGY We are interested in developing a suitable test procedure to test the hypothesis: H 0 : 1   2  ...   g   against non-directional alternative, H1: i   , for at least one i, i  1,2,...,g ……………………………………………………………………………………………………… ……………….….…………..(2.1) where the error term eij ~ N (0, i2 ) i  1,2,...,g . www.ijmsi.org 7|Page
  • 2. On The Use of Analysis of Variance under… If we define  i  i   , …………………………………………………………………………………….…………………….…….(2.2) then, H 0 :  i  i    0  i vs H1 :  i  0 The unbiased estimate of i for at least one i is ˆ i  X i  X …………………………………………………………………….………………………………… …………………………...(2.3) Therefore ˆ i ~ N[i , V (i )] where V ( i )  V (Yi )  V ( Xi  X )  V ( X i )  V ( X )  2 cov(X i , X )  V ( X i )  V ( X )  2 cov(X i , X ) i g  V ( X i )  V ( X )  2 cov(X i ,   i2   i2 ni ni    2 H n 2 H n Xi ) g  2 V ( Xi ) g  2 i2 gni , see Abidoye (2012) and Abidoye et.al (2013c) 2 H  22  1   i n  g  ni   ………………………………………………………………………………………..…(2.4) Therefore, V ( i )   i2 ni  2 n …………………………………………………………………….......................................... .................(2.5) 1 2  21   H   (1  )  n n g  ………………………………………………………………………………………… i   ………………………..(2.6) 2 ˆ2  H  SH 1  1 1 1     2  2  ...  2  s g   g  s1 s2    S2 1 S2 where i is the variance of the ith group. H has r degrees of freedom, where r = 22.096 + 0.266(ng) – 0.000029(n-g)2 as defined in Abidoye et. al (2013). 2 SH is likened to the common variance as defined in one – way Analysis of variance ANOVA. The one – way ANOVA is presented in Table 1 below. In the table one the sum of squares between groups and adjusted sum of squares do not necessarily sum to total sum of squares, also the degrees of freedom. www.ijmsi.org 8|Page
  • 3. On The Use of Analysis of Variance under… ANALYSIS OF VARIANCE TABLE Table 1: One – way analysis of variance with unequal group variances Source variation d.f Treatment (between groups) t-1 SS MS Y SSt  ( Yij ) 2 2 i. t  F SSt/t-1 =MSSt n MSSt / 2 MS H Adjusted error Total 2 r  SH r 2 SH SST   Yij2  n-1 ( Yij ) 2 n where r =  = [r+⅟2] . Small simulations show that without any loss of generality nearest integer values can be used to approximate the r degrees of freedom. III. APPLICATION The data used in this study is applicable in Medicine; the data used were secondary data, collected primarily by Kwara State Ministry of Health, Ilorin, Kwara State, Nigeria. They were extracts from incidence of diabetes diseases for male patients for ten consecutive years, covering the period 2001 – 2010. Table 2: Showing the incidence of diabetes diseases for male patients in Kwara State for ten years (2001- 2010). Years Zone A Zone B Zone C Zone D 1 37 14 15 11 2 80 19 18 19 3 58 12 11 10 4 48 21 19 18 5 35 23 22 23 6 46 13 14 12 7 53 15 13 14 8 39 16 15 16 9 64 11 10 12 10 76 14 13 15 We need to verify the equality of the variances between these four zones. That is, testing the hypothesis; 2 2 2 2 H 0 :  A   B   C   D vs H1 :  i2   2 for atleast j (i, j ) i  A, B, C , D , j  A, B , C , D Table 3: Levene test for variance equality Response Levene Statistic 10.975 df1 3 df2 36 P-value 0.000 Since P- value < 0.05 we therefore reject H0 and therefore conclude that the variances are not equal. Computation on incidence of diabetes diseases for male patients: From the data above the following summary statistics were obtained: Zone A: Zone B: Zone C: 2 YA  53 .6, S A  250 .04 , n A  10 2 YB  15.8, S B  15.7, nB  10 2 YC  15 .0, S C  13 .9, nC  10 www.ijmsi.org 9|Page
  • 4. On The Use of Analysis of Variance under… Zone D: 2 YD  15.0, S D  16.7, nD  10 1 4 In the above, data set, ni  10 , g = 4 , n   ni  40 i 1 1 4 1  S   2 4 s   i 1 i  , , 2 H 2 S H  20 .05 The main hypothesis is H 0 :  A   B  C   D   against H1: i   , for at least one i, i.e i  A, B,...,D SST   Yij2  ( Yij ) 2 n =372+802 +….+122 + 152 – 7182.4 =31,209.6 Y SSt  2 i. t  ( Yij ) 2 n = 35726 – 7182.4 = 28543.6 SSE = SST – SSt = 31209.6 – 28 543.6 = 2666 Source variation Analysis Of Variance Table d.f SS MS Between groups Ajusted error 3 31.63 28543.6 634.18 Total 39 31209.6 9514.5 20.05 F 474.54  F3, 31.63,  F3, 32 ,  8.62 Analysis of variance shows that incidence rate of diabetes diseases in the four zones are significantly different at 5% level of significance which support the earlier results (Abidoye et. al 2013c), even though the sum of squares due to zones plus the adjusted error sum of square did not sum up to total sum of squares. IV. CONCLUSION In this work we have established that ANOVA test may suffice if the adjusted error (the Harmonic mean of the population variances) is known. The sum of square treatments and sum of square of adjusted error may not necessarily sum up to sum of squares; and the adjusted error degrees of freedom need not necessarily be an integer. REFERENCES [1] [2] [3] [4] Abidoye, A. O (2012): Development of Hypothesis Testing Technique for Ordered Alternatives under heterogeneous variances. Unpublished Ph.D Thesis submitted to Dept. of Statistics, University of Ilorin, Ilorin. Abidoye, A.O, Jolayemi, E.T, Sanni, O.O.M and Oyejola, B.A (2013a): On Hypothesis Testing Under Unequal Group Variances:The Use of the Harmonic variance.The International Journal of Institute for science, Technology and Education. Vol. 3, No 7, Page 11 - 15 Abidoye, A.O, Jolayemi, E.T, Sanni, O.O.M and Oyejola, B.A (2013b): Development of a TestStatistic for Testing Equality of Two Means Under Unequal Population Variances..The International Journal of Institute for science, Technology and Education. Vol. 3, No 10, page 10 -14. Abidoye, A.O, Jolayemi, E.T, Sanni, O.O.M and Oyejola, B.A (2013c): Development of a Test Statistic for Testing Equality of Means Under Unequal Population Variances. The International Journal of Mathematics and Statistics Invention. Vol. 1, page 38 -43. [5] [6] [7] [8] [9] [10] Dunnett, C. W and Tamhane, A. C (1997): Multiple testing to establish superiority / equivalence of a new treatment compare with k standard treatments. Statistics in Medicine 16, 2489 – 2506. Dunnett , C. W (1964): New tables for multiple comparison with a control. Biometric, 20, 482-491. Jonckheere, A.R (1954): “A distribution-free k-sample test against ordered alternative.” Biometrical, 41, 133-145. Montgomery, D.C (1981): “Design and Analysis of Experiment.” Second Edition. John Wiley and sons Inc. New York. Ott, L (1984): “An Introduction To Statistical Methods and Data Analysis”. Second edition. P.W.S Publisher. Boston. Yahya, W.B and Jolayemi, E.T (2003): Testing Ordered Means against a Control. JNSA,16, 40- 51. www.ijmsi.org 10 | P a g e