Sample size calculations

Sample size calculations
Dr Vinodh Kumar O.R
Division of Epidemiology
ICAR-Indian Veterinary Research Institute
Izatnagar, Bareilly-243 122

NEED FOR SAMPLE SIZE CALCULATION
• Sample-size determination is often an important step in
planning an epidemiological study
• An adequate sample size helps ensure that the study will yield
reliable information.
• Conducting a study with an inadequate sample size is not
only futile, it is also un ethical.
• Different study design need different method of sample size
calculation and one formula cannot be used in all
designs.
• Determining sample size is a very important issue because
samples that are too large may waste time, resources and
money, while samples that are too small may lead to
inaccurate results.

• Sampling frame: It is a complete
enumeration of the sampling units in
the study population, which may be a
list, directory, map, arial
configuration.
• Sampling unit: It may be an
individual, a household or a school.
Non-representativeness
of the study population
results in a lowered
accuracy
Small sample size
leads to low precision

Knowledge of the population
parameters
• By pilot surveys
• By use of results of previous surveys
• By intelligent guess

α and confidence level
• Alpha (α ): The
significance level of a
test: the probability of
rejecting the null
hypothesis when it is true
(or the probability of
making a Type I error).
• Confidence level: The
probability that an
estimate of a population
parameter is within
certain specified limits of
the true value; commonly
denoted by “1- α”.

• Beta( β) : The probability of
failing to reject the null
hypothesis when it is false (or the
probability of making a Type II
error).
• Power: The probability of
correctly rejecting the null
hypothesis when it is false;
commonly denoted by “1- β”
• Precision: A measure of how
close an estimate is to the true
value of a population parameter.
It may be expressed in absolute
terms or relative to the estimate.
• Degree of precision is the margin
of permissible error between the
estimated value and the
population value.

Basis for determining the size of sample
• Specification of a precision level.
• Specification of level of confidence.
• Power: The likelihood of rejecting the null
hypothesis when the null hypothesis is false.

Margin of error/sampling error
• The margin of error is a statistic expressing the amount of
random sampling error in a survey's results
• Larger the margin of error, the less confidence.
• The difference between the sample statistic and the related
population parameter is called the sampling error.
Margin of error Sample size

https://www.surveymonkey.com/mp/margin-of-error-calculator/

Sample size
• The choosing of sample size depends on non-
statistical and statistical considerations.
• Nonstatistical: availability of manpower and
sampling frames.
• Statistical considerations : Precision of the
estimate of prevalence and the expected
prevalence of the disease.

Sample size required for estimating
population mean
• Suppose we want an interval that extends d units on either side of the
estimator
d = (reliability coefficient) x (Standard error)
• If sampling is from a population sufficiently large size, the equation is:
d = z σ
n
• When solved for n gives:
n = z2
σ2
d2
width of the confidence interval (d)
level of confidence (z)
population variance (σ2)

• A farm has 1000 young pigs with an initial weight of about 50 kgs. They put
them on a new diet for 3 weeks and want to know how many pigs to sample
so that they can estimate the average weight gain. We want the results to be
within 2 Kgs with 90% confidence level.
• We have no idea of σ or SD
Sample size for population mean
90% confidence level =1.645

Sample size required for estimating
proportions
n
z
• Same as for population mean.
• Assuming random sampling and approximate
normality in the distribution of p, brings us to the
formula for n if sampling is with replacement, from a
population sufficiently large to warrant ignoring the
finite population correction :
Where q = 1 – p
pq=
2
2d

What Sample Size for proportion
• A researcher wants to estimate the true FMD immunization coverage in a village of cattle
population
• As per literature review , the immunization coverage should be somewhere around 80%
• Precision (absolute): we’d like the result to be within 4% of the true value
• Confidence level: conventional = 95% = 1 - α; therefore, α = 0.05 and z(1-a/2) = 1.96 = value of
the standard normal distribution corresponding to a significance level of 0.05 (1.96 for a 2-
sided test at the 0.05 level)
• d = absolute precision = 0.04
• p = expected proportion in the population = 0.80
• z(1-a/2) = 1.96 = value of the standard normal distribution corresponding to a significance level
of a (1.96 for a 2-sided test at the 0.05 level)
z2 . p . (1-p)
n = -------------------------
d2
(1.96)2 (.80) (.20)
= ------------------------------
(0.04)2
= 384

Descriptive studies
• In general, these studies can only identify
patterns or trends in disease occurrence over
time or in different geographical locations, but
cannot ascertain the causal agent or degree of
exposure.
• To calculate the required sample size in a
descriptive study, we need to know the level of
precision, level of confidence or risk and
degree of variability.

Finite population correction factor
• When population sizes are less than 10 times the
estimated sample size, it is possible to use a finite
population correction factor.
• The finite population correction factor measures how
much extra precision we achieve when the sample size
becomes close to the population size.
N is the size of the population and n is the size of
the sample.
If fpc is close to 1, then there is almost no effect.
When fpc is much smaller than 1, then sampling a
large fraction of the population is indeed having an effect
on precision.

Independent case-control studies
α = alpha, β = 1 – power, ψ = odds ratio
m– number of
control subjects per case subject, p1 – probability
of exposure in controls. p0 can be estimated as the
population prevalence
of exposure, nc is the continuity corrected sample
size and Zp is the standard normal deviate for
probability p

Sample size for matched case-control
studies

Sample size for independent cohort
studies

Sample size for paired cohort studies

Sample size calculation for cross sectional
studies/surveys
For qualitative variable

Sample size calculation for cross
sectional studies/surveys
For quantitative variable

Case – control study
Qualitative variable

Sample size calculation for testing a hypothesis
(Clinical trials or clinical interventional studies)

Resource equation method
• It depends on the size of the whole experiment and
the number of treatment groups, not the individual
group sizes.
• If a value of E is less than 10 then more animal should
be included and if it is more than 20 then sample size
should be decreased.
• The resource equation method is useful when there is
no previous estimate of the standard deviation.

• For example, if a factorial experiment is planned
with both sexes and three dose levels then there
will be six treatment groups. If it is proposed that
there should be eight animals in each treatment
group (as is common), there will be 48 animals in
total and E = 48 – 6 = 42. This experiment is
unnecessarily large.
• Redesigning it with four animals per group, E =
24 – 6 = 18, which is within the suggested limits of
10 – 20.
• A power analysis should be used in preference to
the resource equation method wherever possible.
• Unfortunately, power analysis is not so easy to use
when there are more than two groups because it
is more difficult (but not impossible) to specify
the effect size of interest.
Resource equation method example

What factors affect the power of a
test?
To increase the power of your test, you may do
any of the following:
1. Increase the effect size (the difference between
the null and alternative values) to be detected
2. Increase the sample size(s)
3. Decrease the variability in the sample(s)
4. Increase the significance level (alpha) of the test

Sample size calculation tools
Websites
http://statpages.info/
http://www.openepi.com/Menu/OE_Menu.htm

NCSS PASS
Sample size calculation for ANOVA

Sample size calculations

Empfohlen

Empfohlen

Weitere ähnliche Inhalte

Was ist angesagt?

Was ist angesagt? (20)

Ähnlich wie Sample size calculations

Ähnlich wie Sample size calculations (20)

Mehr von Obli Rajendran VinodhKumar, ICAR- Indian Veterinary Research Institute, Bareilly

Mehr von Obli Rajendran VinodhKumar, ICAR- Indian Veterinary Research Institute, Bareilly (20)

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

Sample size calculations

Hinweis der Redaktion