SlideShare ist ein Scribd-Unternehmen logo
1 von 32
Downloaden Sie, um offline zu lesen
Power Analysis and Sample Size
        Determination




           AK Dhamija
Researchers differ
A researcher conducted a study comparing the
effect of an intervention vs placebo on reducing
body weight, and found 5 kg reduction among
the intervention group with P=0.01.

Another researcher conducted a similar study
comparing the effect of the same intervention vs
the same placebo on reducing body weight, and
found the same 5 kg reduction with the
intervention group but could not claim that the
intervention was effective because P=0.35.
Agenda
Power
Sample Size Calculations
Examples
Changes in the basic formulae
Flaws in Statements
Power is Effected by…..
Variation in the outcome (σ2)
  ↓ σ2    → power ↑
Significance level (α)
  ↑α      → power ↑
Difference (effect) to be detected (δ)
  ↑δ      → power ↑
One-tailed vs. two-tailed tests
  Power is greater in one-tailed tests than in
  comparable two-tailed tests
Power Changes
2n = 32, 2 sample test, 81% power, δ=2,
σ = 2, α = 0.05, 2-sided test
Variance/Standard deviation
  σ: 2 → 1 Power: 81% → 99.99%
  σ: 2 → 3 Power: 81% → 47%
Significance level (α)
  α : 0.05 → 0.01 Power: 81% → 69%
  α : 0.05 → 0.10 Power: 81% → 94%
Power Changes
2n = 32, 2 sample test, 81% power, δ=2, σ = 2,
α = 0.05, 2-sided test
Difference to be detected (δ)
  δ : 2 → 1 Power: 81% → 29%
  δ : 2 → 3 Power: 81% → 99%
Sample size (n)
  n: 32 → 64 Power: 81% → 98%
  n: 32 → 28 Power: 81% → 75%
One-tailed vs. two-tailed tests
  Power: 81% → 88%
Power Formula
Depends on study design
Not hard, but can be VERY algebra
intensive
May want to use a computer program or
statistician
How Big a Sample We Need?
Fundamental research question
Should be addressed after determining the primary
objective and study design
Too Few Patients in a clinical study
– May fail to detect a clinically important difference
Too Many
– Involve extra patients
– Therapy may have risks
– Cost more
How Big a Sample We Need?
Fundamentalresearch question
How Big?
 18
 180
 1,800
 18,000
 180,000
Sample Size Formula Information
Variables of interest
  type of data e.g. continuous, categorical
Desired power
Desired significance level
Effect/difference of clinical importance
Standard deviations of continuous outcome
variables
One or two-sided tests
Sample Size & Study Design
Randomized controlled trial (RCT)
Block/stratified-block randomized trial
Equivalence trial
Non-randomized intervention study
Observational study
Prevalence study
Measuring sensitivity and specificity
Sample Size & Data Structure
Paired data
Repeated measures
Groups of equal sizes
Hierarchical data
Sample Size
Non-randomized studies looking for differences or
associations
   require larger sample to allow adjustment for confounding factors
Absolute sample size is of interest
   surveys sometimes take % of population approach
Study’s primary outcome is the variable you do the sample
size calculation for
   If secondary outcome variables considered important make sure
   sample size is sufficient
Increase the ‘real’ sample size to reflect loss to follow up,
expected response rate, lack of compliance, etc.
   Make the link between the calculation and increase
Steps
Step 1. Define Primary Objective
      To see if feeding milk to 5 year old kids enhances
      growth.
Step 2. Study          Design
                        Extra Milk Diet

  5    yr olds
                       Normal Milk Diet
                   Outcome:    height (cm)
Step 3. Define clinically                 significant difference
  one wishes to detect
      Difference       (∆) of 0.5 cm
Steps
Step 4. Define degree of certainty of                    finding this
  difference
     beta (β) or type II error : The probability of      NOT detecting a
     significant       difference when there really is one.

     Risk of a false-negative finding ie Risk of declaring no significant
     difference in height between the milk diets when a difference
     really does exist.

     Set at ≤ 20%

     Power of the Test: Probability of detecting a predefined clinically
     significant difference.

     Power = (1- β) = 1 -20% = 80%
Steps
Step 5. Define significance level
  Alpha (α) or type I error: The probability of detecting a significant difference
  when the treatments are really equally effective

  Risk of a false-positive finding

  Set at 5% :
        One has a 5% chance or 1 in 20 odds of declaring a significant difference
        between the milk diets when in fact they are really equal.

        We are willing to accept that 1 time out of 20 we will produce a false
        positive finding
For the Milk Study
Type I error (α) = 0.05
Type II error (β) = 0.20
Power = (1- β) = 0.80
Clinically significant diff (∆) = 0.5cm
Measure of variation (SD) = 2.0 cm
– Exists in literature or “Guesstimate”
     Formula                                           Beta

N = 2(SD)2 x f(α, β)
                f(α                     Alpha   0.05   0.10   0.20   0.50
         ∆2                             0.10    10.8   8.6    6.2    2.7
= 2(2)2 x 7.9 / 0.52
                                        0.05    13.0   10.5   7.9    3.8

= 252.8 (each group)                    0.02    15.8   13.0   10.0   5.4

                                        0.01    17.8   14.9   11.7   6.6
Simple Method
   Nomogram
   Standardized difference
       = smallest medically relevant diff
        estimated standard deviation
       = 0.5/2.0 = 0.25
Assumptions:
   1. 2 sample comparison only
   2. Same number of subjects
      per group
   3. Variable is a continuous
       measure that is normally
      distributed


                                            500
1 sample test
    Study Objective : Study effect of new sleep aid
    Baseline to sleep time after taking the medication for one week
    Two-sided test, α = 0.05, power = 1-β = 90%
    Difference(δ) = 1 (4 hours of sleep to 5)
    Standard deviation(σ) = 2 hr
                       2   2                    2   2
    ( Z1   /2   Z1 )           (1.960 1.282) 2
n               2
                                                        42.04 43
                                      12
    Change δ from 1hr to 2 hr makes n goes from 43 to 11
                               2   2
      (1.960 1.282) 2
    n          2
                                        10.51 11
             2
1 sample test
Change power from 90% to 80% makes n goes from 11 to 8
(Small sample: start thinking about using the t distribution)
          (1.960 0.841) 2 22
        n                                 7.85 8
                 22
Change the standard deviation from 2 to 3 makes n goes from 8 to 18
                                  2 2
        (1.960 0.841) 3
      n          2
                                            17.65 18
               2
Sleep Aid Example: 2 Sample
      Original design (2-sided test, α = 0.05, 1-β = 90%, σ = 2hr, δ = 1 hr)
      Two sample randomized parallel design
      Needed 43 in the one-sample design
      In 2-sample need twice that, in each group!
      4 times as many people are needed in this design
     2( Z1   /2   Z1 ) 2   2
                               2(1.960 1.282)2 22
n                 2
                                                     84.1 85     170 total!
                                      12
             Change δ from 1hr to 2 hr makes n goes from 72 to 44
      2(1.960 1.282) 2 22
    n                                   21.02       22     44 total
              22
Sleep Aid Example: 2 Sample
    Change power from 90% to 80% makes n goes from 44 to 32
  2(1.960 0.841)2 22
n                                15.69 16           32 total
         22
Change the standard deviation from 2 to 3 makes n goes from 32 to 72

  2(1.960 0.841)2 32
n                               35.31 36          72 total
         22
Summary

   Changes in the detectable difference have
HUGE impacts on sample size
  20 point difference → 25 patients/group
  10 point difference → 100 patients/group
   5 point difference → 400 patients/group

Changes in α, β, σ, number of samples, if it is a 1-
or 2-sided test can all have a large impact on your
sample size calculation
Matched Pair Designs
Similar to 1-sample formula
Means (paired t-test)
  Mean difference from paired data
  Variance of differences
Proportions
  Based on discordant pairs
Difference in Proportion
Study Objective
To increase survival by 5% with a new cancer drug
P1 = % survival (std) = 85%
P2 = % survival (new) = 90%
Power = 90%

N = P1 (100 - P1) + P2 (100 - P2) x f (α, β) = 913.5 (each group)
             (P2 - P1)2

     = 1827 Total

A very large study has the power to demonstrate statistical
significance for very small, even clinically inconsequential
differences.
Changes in basic formulae
Unequal #s in Each Group
  Ratio of cases to controls
  Use if want λ patients randomized to the treatment arm for every patient randomized
  to the placebo arm

  Take no more than 4-5 controls/case

    n2         n1           controls for every case
                                    2      2       2
           ( Z1     /2     Z1 ) (         1        2   / )
    n1                              2
# of Covariates & # of Subjects
At least 10 subjects for every variable investigated
  In logistic regression
  No general justification
  This is stability, not power
  Peduzzi et al., (1985) biased regression coefficients and
  variance estimates

Principle component analysis (PCA) (Thorndike
1978 p 184): N≥10m+50 or even N ≥ m2 + 50
Balanced Designs: Easier
Equal numbers in two groups is the easiest
to handle
If you have more than two groups, still,
equal sample sizes easiest
Complicated design = simulations
  Done by the statistician
Multiple Comparisons
If you have 4 groups
  All 2 way comparisons of means
  6 different tests
Bonferroni: divide α by # of tests
  0.025/6 ≈ 0.0042
High-throughput laboratory tests
Flaws in Statements
"A previous study in this area recruited 150 subjects and found highly
significant results (p=0.014), and therefore a similar sample size should
be sufficient here."
   Previous studies may have been 'lucky' to find significant results, due to random
   sampling variation.
"Sample sizes are not provided because there is no prior information on
which to base them."
   Find previously published information
   Conduct small pre-study
   If a very preliminary pilot study, sample size calculations not usually necessary

No prior information on standard deviations
   Give the size of difference that may be detected in terms of number of standard
   deviations
Roadmap
1.   Do a sample size calculation before you start
     collecting data
2.   Collect data
3.   Perform statistical test : IF p value < 0.05, declare
     statistical significance
4.   Consider clinical significance by looking at the size of
     the difference
References

“Sample Size Estimation”, Phil Hahn Queen’s
University
”Sample Size and Power”, Laura Lee Johnson,
Ph.D., Statistician, National Center for
Complementary and Alternative Medicine
”Sample Size Estimation and Power Analysis”,
Ayumi Shintani, PhD, MPH Department of
Biostatistics, Vanderbilt University

Weitere ähnliche Inhalte

Was ist angesagt?

Was ist angesagt? (20)

Blinding in clinical trilas
Blinding in clinical trilas Blinding in clinical trilas
Blinding in clinical trilas
 
P value, Power, Type 1 and 2 errors
P value, Power, Type 1 and 2 errorsP value, Power, Type 1 and 2 errors
P value, Power, Type 1 and 2 errors
 
Randomisation techniques
Randomisation techniquesRandomisation techniques
Randomisation techniques
 
PPT on Sample Size, Importance of Sample Size,
PPT on Sample Size, Importance of Sample Size,PPT on Sample Size, Importance of Sample Size,
PPT on Sample Size, Importance of Sample Size,
 
The mann whitney u test
The mann whitney u testThe mann whitney u test
The mann whitney u test
 
Bias in clinical research
Bias in clinical research Bias in clinical research
Bias in clinical research
 
Non parametric test
Non parametric testNon parametric test
Non parametric test
 
Kruskal Wall Test
Kruskal Wall TestKruskal Wall Test
Kruskal Wall Test
 
Student t-test
Student t-testStudent t-test
Student t-test
 
Test of significance in Statistics
Test of significance in StatisticsTest of significance in Statistics
Test of significance in Statistics
 
Test of significance
Test of significanceTest of significance
Test of significance
 
Sample size calculation
Sample size calculationSample size calculation
Sample size calculation
 
Blinding Techniques
Blinding TechniquesBlinding Techniques
Blinding Techniques
 
Experimental study design
Experimental study designExperimental study design
Experimental study design
 
Confidence interval
Confidence intervalConfidence interval
Confidence interval
 
Odds ratio and confidence interval
Odds ratio and confidence intervalOdds ratio and confidence interval
Odds ratio and confidence interval
 
P value
P valueP value
P value
 
Parametric Test
Parametric TestParametric Test
Parametric Test
 
wilcoxon signed rank test
wilcoxon signed rank testwilcoxon signed rank test
wilcoxon signed rank test
 
Observational study design
Observational study designObservational study design
Observational study design
 

Andere mochten auch

Sample size
Sample sizeSample size
Sample sizezubis
 
determination of sample size
determination of sample sizedetermination of sample size
determination of sample sizeJijo Varghese
 
Sample size determination
Sample size determinationSample size determination
Sample size determinationGopal Kumar
 
7. sampling &amp; sample size determination ldr 280
7. sampling &amp; sample size determination ldr 2807. sampling &amp; sample size determination ldr 280
7. sampling &amp; sample size determination ldr 280shifa najam
 
Sample size calculation
Sample  size calculationSample  size calculation
Sample size calculationSwati Singh
 
Presentation on determination of size of sample (n)
Presentation on    determination of size of sample (n)Presentation on    determination of size of sample (n)
Presentation on determination of size of sample (n)Partnered Health
 
Low Power Design and Verification
Low Power Design and VerificationLow Power Design and Verification
Low Power Design and VerificationDVClub
 
Power Analysis for Beginners
Power Analysis for BeginnersPower Analysis for Beginners
Power Analysis for Beginnersgfb1
 
Statistical inference: Probability and Distribution
Statistical inference: Probability and DistributionStatistical inference: Probability and Distribution
Statistical inference: Probability and DistributionEugene Yan Ziyou
 
Statistical inference: Statistical Power, ANOVA, and Post Hoc tests
Statistical inference: Statistical Power, ANOVA, and Post Hoc testsStatistical inference: Statistical Power, ANOVA, and Post Hoc tests
Statistical inference: Statistical Power, ANOVA, and Post Hoc testsEugene Yan Ziyou
 
Sampling design, sampling errors, sample size determination
Sampling design, sampling errors, sample size determinationSampling design, sampling errors, sample size determination
Sampling design, sampling errors, sample size determinationVishnupriya T H
 
RESEARCH METHOD - SAMPLING
RESEARCH METHOD - SAMPLINGRESEARCH METHOD - SAMPLING
RESEARCH METHOD - SAMPLINGHafizah Hajimia
 
Hypothesis testing; z test, t-test. f-test
Hypothesis testing; z test, t-test. f-testHypothesis testing; z test, t-test. f-test
Hypothesis testing; z test, t-test. f-testShakehand with Life
 
Sampling and sample size determination
Sampling and sample size determinationSampling and sample size determination
Sampling and sample size determinationRemas Mohamed
 

Andere mochten auch (20)

Sample size
Sample sizeSample size
Sample size
 
determination of sample size
determination of sample sizedetermination of sample size
determination of sample size
 
Sample size calculation
Sample size calculationSample size calculation
Sample size calculation
 
Sample size determination
Sample size determinationSample size determination
Sample size determination
 
Determining the Sample Size
Determining the Sample SizeDetermining the Sample Size
Determining the Sample Size
 
7. sampling &amp; sample size determination ldr 280
7. sampling &amp; sample size determination ldr 2807. sampling &amp; sample size determination ldr 280
7. sampling &amp; sample size determination ldr 280
 
Sample size calculation
Sample  size calculationSample  size calculation
Sample size calculation
 
Presentation on determination of size of sample (n)
Presentation on    determination of size of sample (n)Presentation on    determination of size of sample (n)
Presentation on determination of size of sample (n)
 
Low Power Design and Verification
Low Power Design and VerificationLow Power Design and Verification
Low Power Design and Verification
 
Power Analysis for Beginners
Power Analysis for BeginnersPower Analysis for Beginners
Power Analysis for Beginners
 
Sample Size Determination
Sample Size Determination Sample Size Determination
Sample Size Determination
 
Statistical inference: Probability and Distribution
Statistical inference: Probability and DistributionStatistical inference: Probability and Distribution
Statistical inference: Probability and Distribution
 
Statistical inference: Statistical Power, ANOVA, and Post Hoc tests
Statistical inference: Statistical Power, ANOVA, and Post Hoc testsStatistical inference: Statistical Power, ANOVA, and Post Hoc tests
Statistical inference: Statistical Power, ANOVA, and Post Hoc tests
 
Sampling design, sampling errors, sample size determination
Sampling design, sampling errors, sample size determinationSampling design, sampling errors, sample size determination
Sampling design, sampling errors, sample size determination
 
Student's T-Test
Student's T-TestStudent's T-Test
Student's T-Test
 
RESEARCH METHOD - SAMPLING
RESEARCH METHOD - SAMPLINGRESEARCH METHOD - SAMPLING
RESEARCH METHOD - SAMPLING
 
Hypothesis testing; z test, t-test. f-test
Hypothesis testing; z test, t-test. f-testHypothesis testing; z test, t-test. f-test
Hypothesis testing; z test, t-test. f-test
 
Sampling and sample size determination
Sampling and sample size determinationSampling and sample size determination
Sampling and sample size determination
 
Validity and Screening Test
Validity and Screening TestValidity and Screening Test
Validity and Screening Test
 
Statistical analysis definitions
Statistical analysis definitionsStatistical analysis definitions
Statistical analysis definitions
 

Ähnlich wie Power Analysis and Sample Size Determination

การสุ่มตัวอย่างในงานวิจัยสาธารณสุข
การสุ่มตัวอย่างในงานวิจัยสาธารณสุขการสุ่มตัวอย่างในงานวิจัยสาธารณสุข
การสุ่มตัวอย่างในงานวิจัยสาธารณสุขUltraman Taro
 
Stat 3203 -multphase sampling
Stat 3203 -multphase samplingStat 3203 -multphase sampling
Stat 3203 -multphase samplingKhulna University
 
Determination of sample size in scientific research.pptx
Determination of sample size in scientific research.pptxDetermination of sample size in scientific research.pptx
Determination of sample size in scientific research.pptxSam Edeson
 
Sample size in general
Sample size in generalSample size in general
Sample size in generalMmedsc Hahm
 
ANOVA Lec 1 (alternate).pptx
ANOVA Lec 1 (alternate).pptxANOVA Lec 1 (alternate).pptx
ANOVA Lec 1 (alternate).pptxMohsinIqbalQazi
 
10 ch ken black solution
10 ch ken black solution10 ch ken black solution
10 ch ken black solutionKrunal Shah
 
Hypothesis Test _Two-sample t-test, Z-test, Proportion Z-test
Hypothesis Test _Two-sample t-test, Z-test, Proportion Z-testHypothesis Test _Two-sample t-test, Z-test, Proportion Z-test
Hypothesis Test _Two-sample t-test, Z-test, Proportion Z-testRavindra Nath Shukla
 
Webinar slides how to reduce sample size ethically and responsibly
Webinar slides   how to reduce sample size ethically and responsiblyWebinar slides   how to reduce sample size ethically and responsibly
Webinar slides how to reduce sample size ethically and responsiblynQuery
 
The stepped wedge cluster randomised trial workshop: session 3
The stepped wedge cluster randomised trial workshop: session 3The stepped wedge cluster randomised trial workshop: session 3
The stepped wedge cluster randomised trial workshop: session 3Karla hemming
 
Research and reporting methods for the stepped wedge cluster randomized trial...
Research and reporting methods for the stepped wedge cluster randomized trial...Research and reporting methods for the stepped wedge cluster randomized trial...
Research and reporting methods for the stepped wedge cluster randomized trial...NIHR CLAHRC West Midlands
 
Approximate ANCOVA
Approximate ANCOVAApproximate ANCOVA
Approximate ANCOVAStephen Senn
 

Ähnlich wie Power Analysis and Sample Size Determination (20)

การสุ่มตัวอย่างในงานวิจัยสาธารณสุข
การสุ่มตัวอย่างในงานวิจัยสาธารณสุขการสุ่มตัวอย่างในงานวิจัยสาธารณสุข
การสุ่มตัวอย่างในงานวิจัยสาธารณสุข
 
Stat5 the t test
Stat5 the t testStat5 the t test
Stat5 the t test
 
Stat5 the t test
Stat5 the t testStat5 the t test
Stat5 the t test
 
Student t t est
Student t t estStudent t t est
Student t t est
 
Stat 3203 -multphase sampling
Stat 3203 -multphase samplingStat 3203 -multphase sampling
Stat 3203 -multphase sampling
 
Determination of sample size in scientific research.pptx
Determination of sample size in scientific research.pptxDetermination of sample size in scientific research.pptx
Determination of sample size in scientific research.pptx
 
Sample size in general
Sample size in generalSample size in general
Sample size in general
 
Chapter07.pdf
Chapter07.pdfChapter07.pdf
Chapter07.pdf
 
Two Means, Independent Samples
Two Means, Independent SamplesTwo Means, Independent Samples
Two Means, Independent Samples
 
ANOVA Lec 1 (alternate).pptx
ANOVA Lec 1 (alternate).pptxANOVA Lec 1 (alternate).pptx
ANOVA Lec 1 (alternate).pptx
 
10 ch ken black solution
10 ch ken black solution10 ch ken black solution
10 ch ken black solution
 
Hypothesis Test _Two-sample t-test, Z-test, Proportion Z-test
Hypothesis Test _Two-sample t-test, Z-test, Proportion Z-testHypothesis Test _Two-sample t-test, Z-test, Proportion Z-test
Hypothesis Test _Two-sample t-test, Z-test, Proportion Z-test
 
Cairo 02 Stat Inference
Cairo 02 Stat InferenceCairo 02 Stat Inference
Cairo 02 Stat Inference
 
Two dependent samples (matched pairs)
Two dependent samples (matched pairs) Two dependent samples (matched pairs)
Two dependent samples (matched pairs)
 
Webinar slides how to reduce sample size ethically and responsibly
Webinar slides   how to reduce sample size ethically and responsiblyWebinar slides   how to reduce sample size ethically and responsibly
Webinar slides how to reduce sample size ethically and responsibly
 
The stepped wedge cluster randomised trial workshop: session 3
The stepped wedge cluster randomised trial workshop: session 3The stepped wedge cluster randomised trial workshop: session 3
The stepped wedge cluster randomised trial workshop: session 3
 
Research and reporting methods for the stepped wedge cluster randomized trial...
Research and reporting methods for the stepped wedge cluster randomized trial...Research and reporting methods for the stepped wedge cluster randomized trial...
Research and reporting methods for the stepped wedge cluster randomized trial...
 
T test statistics
T test statisticsT test statistics
T test statistics
 
Design of experiments(
Design of experiments(Design of experiments(
Design of experiments(
 
Approximate ANCOVA
Approximate ANCOVAApproximate ANCOVA
Approximate ANCOVA
 

Mehr von Ajay Dhamija

fm3-05-301 (copy).pdf
fm3-05-301 (copy).pdffm3-05-301 (copy).pdf
fm3-05-301 (copy).pdfAjay Dhamija
 
Ethical hacking & Information Security
Ethical hacking & Information SecurityEthical hacking & Information Security
Ethical hacking & Information SecurityAjay Dhamija
 
Karmarkar's Algorithm For Linear Programming Problem
Karmarkar's Algorithm For Linear Programming ProblemKarmarkar's Algorithm For Linear Programming Problem
Karmarkar's Algorithm For Linear Programming ProblemAjay Dhamija
 
Verizon - A Case Study
Verizon - A Case StudyVerizon - A Case Study
Verizon - A Case StudyAjay Dhamija
 
Dabur India Ltd - A Case Study
Dabur India Ltd  - A Case StudyDabur India Ltd  - A Case Study
Dabur India Ltd - A Case StudyAjay Dhamija
 
Non Banking Financial Company
Non Banking Financial CompanyNon Banking Financial Company
Non Banking Financial CompanyAjay Dhamija
 
The Financial Sector Reforms in India
The Financial Sector Reforms in IndiaThe Financial Sector Reforms in India
The Financial Sector Reforms in IndiaAjay Dhamija
 
Hosting Inviting Introduction Guest Relations
Hosting Inviting Introduction Guest RelationsHosting Inviting Introduction Guest Relations
Hosting Inviting Introduction Guest RelationsAjay Dhamija
 
Global Fiancial Meltdown of 2007
Global Fiancial Meltdown of 2007Global Fiancial Meltdown of 2007
Global Fiancial Meltdown of 2007Ajay Dhamija
 
IRT - Item response Theory
IRT - Item response TheoryIRT - Item response Theory
IRT - Item response TheoryAjay Dhamija
 
Goody Research - Research Methods Flaws
Goody Research - Research Methods FlawsGoody Research - Research Methods Flaws
Goody Research - Research Methods FlawsAjay Dhamija
 
Randomization Tests
Randomization Tests Randomization Tests
Randomization Tests Ajay Dhamija
 

Mehr von Ajay Dhamija (15)

fm3-05-301 (copy).pdf
fm3-05-301 (copy).pdffm3-05-301 (copy).pdf
fm3-05-301 (copy).pdf
 
Carbon Finance
Carbon FinanceCarbon Finance
Carbon Finance
 
Ethical hacking & Information Security
Ethical hacking & Information SecurityEthical hacking & Information Security
Ethical hacking & Information Security
 
Karmarkar's Algorithm For Linear Programming Problem
Karmarkar's Algorithm For Linear Programming ProblemKarmarkar's Algorithm For Linear Programming Problem
Karmarkar's Algorithm For Linear Programming Problem
 
Verizon - A Case Study
Verizon - A Case StudyVerizon - A Case Study
Verizon - A Case Study
 
Dabur India Ltd - A Case Study
Dabur India Ltd  - A Case StudyDabur India Ltd  - A Case Study
Dabur India Ltd - A Case Study
 
Non Banking Financial Company
Non Banking Financial CompanyNon Banking Financial Company
Non Banking Financial Company
 
The Financial Sector Reforms in India
The Financial Sector Reforms in IndiaThe Financial Sector Reforms in India
The Financial Sector Reforms in India
 
Hosting Inviting Introduction Guest Relations
Hosting Inviting Introduction Guest RelationsHosting Inviting Introduction Guest Relations
Hosting Inviting Introduction Guest Relations
 
TRIZ
TRIZ TRIZ
TRIZ
 
Global Fiancial Meltdown of 2007
Global Fiancial Meltdown of 2007Global Fiancial Meltdown of 2007
Global Fiancial Meltdown of 2007
 
IRT - Item response Theory
IRT - Item response TheoryIRT - Item response Theory
IRT - Item response Theory
 
Goody Research - Research Methods Flaws
Goody Research - Research Methods FlawsGoody Research - Research Methods Flaws
Goody Research - Research Methods Flaws
 
Randomization Tests
Randomization Tests Randomization Tests
Randomization Tests
 
Research Design
Research DesignResearch Design
Research Design
 

Kürzlich hochgeladen

Organizational Structure Running A Successful Business
Organizational Structure Running A Successful BusinessOrganizational Structure Running A Successful Business
Organizational Structure Running A Successful BusinessSeta Wicaksana
 
Memorándum de Entendimiento (MoU) entre Codelco y SQM
Memorándum de Entendimiento (MoU) entre Codelco y SQMMemorándum de Entendimiento (MoU) entre Codelco y SQM
Memorándum de Entendimiento (MoU) entre Codelco y SQMVoces Mineras
 
Youth Involvement in an Innovative Coconut Value Chain by Mwalimu Menza
Youth Involvement in an Innovative Coconut Value Chain by Mwalimu MenzaYouth Involvement in an Innovative Coconut Value Chain by Mwalimu Menza
Youth Involvement in an Innovative Coconut Value Chain by Mwalimu Menzaictsugar
 
8447779800, Low rate Call girls in Uttam Nagar Delhi NCR
8447779800, Low rate Call girls in Uttam Nagar Delhi NCR8447779800, Low rate Call girls in Uttam Nagar Delhi NCR
8447779800, Low rate Call girls in Uttam Nagar Delhi NCRashishs7044
 
Intro to BCG's Carbon Emissions Benchmark_vF.pdf
Intro to BCG's Carbon Emissions Benchmark_vF.pdfIntro to BCG's Carbon Emissions Benchmark_vF.pdf
Intro to BCG's Carbon Emissions Benchmark_vF.pdfpollardmorgan
 
8447779800, Low rate Call girls in Kotla Mubarakpur Delhi NCR
8447779800, Low rate Call girls in Kotla Mubarakpur Delhi NCR8447779800, Low rate Call girls in Kotla Mubarakpur Delhi NCR
8447779800, Low rate Call girls in Kotla Mubarakpur Delhi NCRashishs7044
 
Investment in The Coconut Industry by Nancy Cheruiyot
Investment in The Coconut Industry by Nancy CheruiyotInvestment in The Coconut Industry by Nancy Cheruiyot
Investment in The Coconut Industry by Nancy Cheruiyotictsugar
 
8447779800, Low rate Call girls in New Ashok Nagar Delhi NCR
8447779800, Low rate Call girls in New Ashok Nagar Delhi NCR8447779800, Low rate Call girls in New Ashok Nagar Delhi NCR
8447779800, Low rate Call girls in New Ashok Nagar Delhi NCRashishs7044
 
Kenya’s Coconut Value Chain by Gatsby Africa
Kenya’s Coconut Value Chain by Gatsby AfricaKenya’s Coconut Value Chain by Gatsby Africa
Kenya’s Coconut Value Chain by Gatsby Africaictsugar
 
APRIL2024_UKRAINE_xml_0000000000000 .pdf
APRIL2024_UKRAINE_xml_0000000000000 .pdfAPRIL2024_UKRAINE_xml_0000000000000 .pdf
APRIL2024_UKRAINE_xml_0000000000000 .pdfRbc Rbcua
 
Ten Organizational Design Models to align structure and operations to busines...
Ten Organizational Design Models to align structure and operations to busines...Ten Organizational Design Models to align structure and operations to busines...
Ten Organizational Design Models to align structure and operations to busines...Seta Wicaksana
 
Case study on tata clothing brand zudio in detail
Case study on tata clothing brand zudio in detailCase study on tata clothing brand zudio in detail
Case study on tata clothing brand zudio in detailAriel592675
 
Marketplace and Quality Assurance Presentation - Vincent Chirchir
Marketplace and Quality Assurance Presentation - Vincent ChirchirMarketplace and Quality Assurance Presentation - Vincent Chirchir
Marketplace and Quality Assurance Presentation - Vincent Chirchirictsugar
 
FULL ENJOY Call girls in Paharganj Delhi | 8377087607
FULL ENJOY Call girls in Paharganj Delhi | 8377087607FULL ENJOY Call girls in Paharganj Delhi | 8377087607
FULL ENJOY Call girls in Paharganj Delhi | 8377087607dollysharma2066
 
Annual General Meeting Presentation Slides
Annual General Meeting Presentation SlidesAnnual General Meeting Presentation Slides
Annual General Meeting Presentation SlidesKeppelCorporation
 
India Consumer 2024 Redacted Sample Report
India Consumer 2024 Redacted Sample ReportIndia Consumer 2024 Redacted Sample Report
India Consumer 2024 Redacted Sample ReportMintel Group
 
IoT Insurance Observatory: summary 2024
IoT Insurance Observatory:  summary 2024IoT Insurance Observatory:  summary 2024
IoT Insurance Observatory: summary 2024Matteo Carbone
 
Market Sizes Sample Report - 2024 Edition
Market Sizes Sample Report - 2024 EditionMarket Sizes Sample Report - 2024 Edition
Market Sizes Sample Report - 2024 EditionMintel Group
 
NewBase 19 April 2024 Energy News issue - 1717 by Khaled Al Awadi.pdf
NewBase  19 April  2024  Energy News issue - 1717 by Khaled Al Awadi.pdfNewBase  19 April  2024  Energy News issue - 1717 by Khaled Al Awadi.pdf
NewBase 19 April 2024 Energy News issue - 1717 by Khaled Al Awadi.pdfKhaled Al Awadi
 

Kürzlich hochgeladen (20)

Organizational Structure Running A Successful Business
Organizational Structure Running A Successful BusinessOrganizational Structure Running A Successful Business
Organizational Structure Running A Successful Business
 
Memorándum de Entendimiento (MoU) entre Codelco y SQM
Memorándum de Entendimiento (MoU) entre Codelco y SQMMemorándum de Entendimiento (MoU) entre Codelco y SQM
Memorándum de Entendimiento (MoU) entre Codelco y SQM
 
Youth Involvement in an Innovative Coconut Value Chain by Mwalimu Menza
Youth Involvement in an Innovative Coconut Value Chain by Mwalimu MenzaYouth Involvement in an Innovative Coconut Value Chain by Mwalimu Menza
Youth Involvement in an Innovative Coconut Value Chain by Mwalimu Menza
 
8447779800, Low rate Call girls in Uttam Nagar Delhi NCR
8447779800, Low rate Call girls in Uttam Nagar Delhi NCR8447779800, Low rate Call girls in Uttam Nagar Delhi NCR
8447779800, Low rate Call girls in Uttam Nagar Delhi NCR
 
Intro to BCG's Carbon Emissions Benchmark_vF.pdf
Intro to BCG's Carbon Emissions Benchmark_vF.pdfIntro to BCG's Carbon Emissions Benchmark_vF.pdf
Intro to BCG's Carbon Emissions Benchmark_vF.pdf
 
8447779800, Low rate Call girls in Kotla Mubarakpur Delhi NCR
8447779800, Low rate Call girls in Kotla Mubarakpur Delhi NCR8447779800, Low rate Call girls in Kotla Mubarakpur Delhi NCR
8447779800, Low rate Call girls in Kotla Mubarakpur Delhi NCR
 
Investment in The Coconut Industry by Nancy Cheruiyot
Investment in The Coconut Industry by Nancy CheruiyotInvestment in The Coconut Industry by Nancy Cheruiyot
Investment in The Coconut Industry by Nancy Cheruiyot
 
8447779800, Low rate Call girls in New Ashok Nagar Delhi NCR
8447779800, Low rate Call girls in New Ashok Nagar Delhi NCR8447779800, Low rate Call girls in New Ashok Nagar Delhi NCR
8447779800, Low rate Call girls in New Ashok Nagar Delhi NCR
 
Kenya’s Coconut Value Chain by Gatsby Africa
Kenya’s Coconut Value Chain by Gatsby AfricaKenya’s Coconut Value Chain by Gatsby Africa
Kenya’s Coconut Value Chain by Gatsby Africa
 
APRIL2024_UKRAINE_xml_0000000000000 .pdf
APRIL2024_UKRAINE_xml_0000000000000 .pdfAPRIL2024_UKRAINE_xml_0000000000000 .pdf
APRIL2024_UKRAINE_xml_0000000000000 .pdf
 
Ten Organizational Design Models to align structure and operations to busines...
Ten Organizational Design Models to align structure and operations to busines...Ten Organizational Design Models to align structure and operations to busines...
Ten Organizational Design Models to align structure and operations to busines...
 
Case study on tata clothing brand zudio in detail
Case study on tata clothing brand zudio in detailCase study on tata clothing brand zudio in detail
Case study on tata clothing brand zudio in detail
 
Marketplace and Quality Assurance Presentation - Vincent Chirchir
Marketplace and Quality Assurance Presentation - Vincent ChirchirMarketplace and Quality Assurance Presentation - Vincent Chirchir
Marketplace and Quality Assurance Presentation - Vincent Chirchir
 
FULL ENJOY Call girls in Paharganj Delhi | 8377087607
FULL ENJOY Call girls in Paharganj Delhi | 8377087607FULL ENJOY Call girls in Paharganj Delhi | 8377087607
FULL ENJOY Call girls in Paharganj Delhi | 8377087607
 
Annual General Meeting Presentation Slides
Annual General Meeting Presentation SlidesAnnual General Meeting Presentation Slides
Annual General Meeting Presentation Slides
 
India Consumer 2024 Redacted Sample Report
India Consumer 2024 Redacted Sample ReportIndia Consumer 2024 Redacted Sample Report
India Consumer 2024 Redacted Sample Report
 
IoT Insurance Observatory: summary 2024
IoT Insurance Observatory:  summary 2024IoT Insurance Observatory:  summary 2024
IoT Insurance Observatory: summary 2024
 
Market Sizes Sample Report - 2024 Edition
Market Sizes Sample Report - 2024 EditionMarket Sizes Sample Report - 2024 Edition
Market Sizes Sample Report - 2024 Edition
 
Japan IT Week 2024 Brochure by 47Billion (English)
Japan IT Week 2024 Brochure by 47Billion (English)Japan IT Week 2024 Brochure by 47Billion (English)
Japan IT Week 2024 Brochure by 47Billion (English)
 
NewBase 19 April 2024 Energy News issue - 1717 by Khaled Al Awadi.pdf
NewBase  19 April  2024  Energy News issue - 1717 by Khaled Al Awadi.pdfNewBase  19 April  2024  Energy News issue - 1717 by Khaled Al Awadi.pdf
NewBase 19 April 2024 Energy News issue - 1717 by Khaled Al Awadi.pdf
 

Power Analysis and Sample Size Determination

  • 1. Power Analysis and Sample Size Determination AK Dhamija
  • 2. Researchers differ A researcher conducted a study comparing the effect of an intervention vs placebo on reducing body weight, and found 5 kg reduction among the intervention group with P=0.01. Another researcher conducted a similar study comparing the effect of the same intervention vs the same placebo on reducing body weight, and found the same 5 kg reduction with the intervention group but could not claim that the intervention was effective because P=0.35.
  • 3. Agenda Power Sample Size Calculations Examples Changes in the basic formulae Flaws in Statements
  • 4. Power is Effected by….. Variation in the outcome (σ2) ↓ σ2 → power ↑ Significance level (α) ↑α → power ↑ Difference (effect) to be detected (δ) ↑δ → power ↑ One-tailed vs. two-tailed tests Power is greater in one-tailed tests than in comparable two-tailed tests
  • 5. Power Changes 2n = 32, 2 sample test, 81% power, δ=2, σ = 2, α = 0.05, 2-sided test Variance/Standard deviation σ: 2 → 1 Power: 81% → 99.99% σ: 2 → 3 Power: 81% → 47% Significance level (α) α : 0.05 → 0.01 Power: 81% → 69% α : 0.05 → 0.10 Power: 81% → 94%
  • 6. Power Changes 2n = 32, 2 sample test, 81% power, δ=2, σ = 2, α = 0.05, 2-sided test Difference to be detected (δ) δ : 2 → 1 Power: 81% → 29% δ : 2 → 3 Power: 81% → 99% Sample size (n) n: 32 → 64 Power: 81% → 98% n: 32 → 28 Power: 81% → 75% One-tailed vs. two-tailed tests Power: 81% → 88%
  • 7. Power Formula Depends on study design Not hard, but can be VERY algebra intensive May want to use a computer program or statistician
  • 8. How Big a Sample We Need? Fundamental research question Should be addressed after determining the primary objective and study design Too Few Patients in a clinical study – May fail to detect a clinically important difference Too Many – Involve extra patients – Therapy may have risks – Cost more
  • 9. How Big a Sample We Need? Fundamentalresearch question How Big? 18 180 1,800 18,000 180,000
  • 10. Sample Size Formula Information Variables of interest type of data e.g. continuous, categorical Desired power Desired significance level Effect/difference of clinical importance Standard deviations of continuous outcome variables One or two-sided tests
  • 11. Sample Size & Study Design Randomized controlled trial (RCT) Block/stratified-block randomized trial Equivalence trial Non-randomized intervention study Observational study Prevalence study Measuring sensitivity and specificity
  • 12. Sample Size & Data Structure Paired data Repeated measures Groups of equal sizes Hierarchical data
  • 13. Sample Size Non-randomized studies looking for differences or associations require larger sample to allow adjustment for confounding factors Absolute sample size is of interest surveys sometimes take % of population approach Study’s primary outcome is the variable you do the sample size calculation for If secondary outcome variables considered important make sure sample size is sufficient Increase the ‘real’ sample size to reflect loss to follow up, expected response rate, lack of compliance, etc. Make the link between the calculation and increase
  • 14. Steps Step 1. Define Primary Objective To see if feeding milk to 5 year old kids enhances growth. Step 2. Study Design Extra Milk Diet 5 yr olds Normal Milk Diet Outcome: height (cm) Step 3. Define clinically significant difference one wishes to detect Difference (∆) of 0.5 cm
  • 15. Steps Step 4. Define degree of certainty of finding this difference beta (β) or type II error : The probability of NOT detecting a significant difference when there really is one. Risk of a false-negative finding ie Risk of declaring no significant difference in height between the milk diets when a difference really does exist. Set at ≤ 20% Power of the Test: Probability of detecting a predefined clinically significant difference. Power = (1- β) = 1 -20% = 80%
  • 16. Steps Step 5. Define significance level Alpha (α) or type I error: The probability of detecting a significant difference when the treatments are really equally effective Risk of a false-positive finding Set at 5% : One has a 5% chance or 1 in 20 odds of declaring a significant difference between the milk diets when in fact they are really equal. We are willing to accept that 1 time out of 20 we will produce a false positive finding
  • 17. For the Milk Study Type I error (α) = 0.05 Type II error (β) = 0.20 Power = (1- β) = 0.80 Clinically significant diff (∆) = 0.5cm Measure of variation (SD) = 2.0 cm – Exists in literature or “Guesstimate” Formula Beta N = 2(SD)2 x f(α, β) f(α Alpha 0.05 0.10 0.20 0.50 ∆2 0.10 10.8 8.6 6.2 2.7 = 2(2)2 x 7.9 / 0.52 0.05 13.0 10.5 7.9 3.8 = 252.8 (each group) 0.02 15.8 13.0 10.0 5.4 0.01 17.8 14.9 11.7 6.6
  • 18. Simple Method Nomogram Standardized difference = smallest medically relevant diff estimated standard deviation = 0.5/2.0 = 0.25 Assumptions: 1. 2 sample comparison only 2. Same number of subjects per group 3. Variable is a continuous measure that is normally distributed 500
  • 19. 1 sample test Study Objective : Study effect of new sleep aid Baseline to sleep time after taking the medication for one week Two-sided test, α = 0.05, power = 1-β = 90% Difference(δ) = 1 (4 hours of sleep to 5) Standard deviation(σ) = 2 hr 2 2 2 2 ( Z1 /2 Z1 ) (1.960 1.282) 2 n 2 42.04 43 12 Change δ from 1hr to 2 hr makes n goes from 43 to 11 2 2 (1.960 1.282) 2 n 2 10.51 11 2
  • 20. 1 sample test Change power from 90% to 80% makes n goes from 11 to 8 (Small sample: start thinking about using the t distribution) (1.960 0.841) 2 22 n 7.85 8 22 Change the standard deviation from 2 to 3 makes n goes from 8 to 18 2 2 (1.960 0.841) 3 n 2 17.65 18 2
  • 21. Sleep Aid Example: 2 Sample Original design (2-sided test, α = 0.05, 1-β = 90%, σ = 2hr, δ = 1 hr) Two sample randomized parallel design Needed 43 in the one-sample design In 2-sample need twice that, in each group! 4 times as many people are needed in this design 2( Z1 /2 Z1 ) 2 2 2(1.960 1.282)2 22 n 2 84.1 85 170 total! 12 Change δ from 1hr to 2 hr makes n goes from 72 to 44 2(1.960 1.282) 2 22 n 21.02 22 44 total 22
  • 22. Sleep Aid Example: 2 Sample Change power from 90% to 80% makes n goes from 44 to 32 2(1.960 0.841)2 22 n 15.69 16 32 total 22 Change the standard deviation from 2 to 3 makes n goes from 32 to 72 2(1.960 0.841)2 32 n 35.31 36 72 total 22
  • 23. Summary Changes in the detectable difference have HUGE impacts on sample size 20 point difference → 25 patients/group 10 point difference → 100 patients/group 5 point difference → 400 patients/group Changes in α, β, σ, number of samples, if it is a 1- or 2-sided test can all have a large impact on your sample size calculation
  • 24. Matched Pair Designs Similar to 1-sample formula Means (paired t-test) Mean difference from paired data Variance of differences Proportions Based on discordant pairs
  • 25. Difference in Proportion Study Objective To increase survival by 5% with a new cancer drug P1 = % survival (std) = 85% P2 = % survival (new) = 90% Power = 90% N = P1 (100 - P1) + P2 (100 - P2) x f (α, β) = 913.5 (each group) (P2 - P1)2 = 1827 Total A very large study has the power to demonstrate statistical significance for very small, even clinically inconsequential differences.
  • 26. Changes in basic formulae Unequal #s in Each Group Ratio of cases to controls Use if want λ patients randomized to the treatment arm for every patient randomized to the placebo arm Take no more than 4-5 controls/case n2 n1 controls for every case 2 2 2 ( Z1 /2 Z1 ) ( 1 2 / ) n1 2
  • 27. # of Covariates & # of Subjects At least 10 subjects for every variable investigated In logistic regression No general justification This is stability, not power Peduzzi et al., (1985) biased regression coefficients and variance estimates Principle component analysis (PCA) (Thorndike 1978 p 184): N≥10m+50 or even N ≥ m2 + 50
  • 28. Balanced Designs: Easier Equal numbers in two groups is the easiest to handle If you have more than two groups, still, equal sample sizes easiest Complicated design = simulations Done by the statistician
  • 29. Multiple Comparisons If you have 4 groups All 2 way comparisons of means 6 different tests Bonferroni: divide α by # of tests 0.025/6 ≈ 0.0042 High-throughput laboratory tests
  • 30. Flaws in Statements "A previous study in this area recruited 150 subjects and found highly significant results (p=0.014), and therefore a similar sample size should be sufficient here." Previous studies may have been 'lucky' to find significant results, due to random sampling variation. "Sample sizes are not provided because there is no prior information on which to base them." Find previously published information Conduct small pre-study If a very preliminary pilot study, sample size calculations not usually necessary No prior information on standard deviations Give the size of difference that may be detected in terms of number of standard deviations
  • 31. Roadmap 1. Do a sample size calculation before you start collecting data 2. Collect data 3. Perform statistical test : IF p value < 0.05, declare statistical significance 4. Consider clinical significance by looking at the size of the difference
  • 32. References “Sample Size Estimation”, Phil Hahn Queen’s University ”Sample Size and Power”, Laura Lee Johnson, Ph.D., Statistician, National Center for Complementary and Alternative Medicine ”Sample Size Estimation and Power Analysis”, Ayumi Shintani, PhD, MPH Department of Biostatistics, Vanderbilt University