SlideShare a Scribd company logo
1 of 19
Download to read offline
Central Limit Theorem( )
for 14.310x students.
Ryosuke ISHII
(ryouen)
About author
• Ryosuke ISHII (call me ryo / ryouen)
• From Tokyo, Japan
• Graduated from The University of Tokyo.
• Current: Researcher, Grad School of
System Design and Management, Keio Univ.
• Enjoining MITx 14.310x and learn from a lot.
• Also MITx 14.100x (Microeconomics) and
HarvardX PHP525.x (Statistics) on edX.
According to CLT,
When the population is following 𝜇(population mean) and 𝜎2(population variance),
we took some sample and the sample size = 𝑛,
This 𝑛 means how many items in the group. It is different from “the number of samples”
If we took many samples repeatedly, we can calculate each of sample’s mean (this is
sample mean ഥ𝑥𝑖) and the sample mean is also a random variable. And the sample mean
follows: ҧ𝑥 ~ N(𝜇,
𝜎2
𝑛
)
↑
𝜎
𝜇
𝑠 𝑥 =
𝜎
𝑛
Sample size is different from the number of samples.
If we compare 10 males and 15 females
The sample size of the male group is 10.
The sample size of the female group is 15.
The number of samples (or the number of groups) is 2.
The number of samples and the sample size can potentially be
confusing. Sample size is the number of items within a group. Number
of samples is the number of groups.”
*Metin Çakanyıldırım,
Computing the Standard Deviation of Sample Means
(if you wish, you can simulate with the R code below)
x <- rnorm(3300, mean=27.6,sd=sqrt(28.3))
n=10 #sample size
N=1000 #the number of trials
set.seed(1)
ys <- vector("numeric",N)
ysmean <- vector("numeric",N)
ysvar <- vector("numeric",N)
yssd <- vector("numeric",N)
yalldata <- vector("numeric",0)
for(i in 1:N){
ys <- sample(x, n)
ysmean[i] = mean(ys,na.rm = TRUE)
ysvar[i]= var(ys,na.rm = TRUE)
yssd[i] = sd(ys,na.rm = TRUE)
yalldata = c(yalldata,ys)
}
In order to understand deeper, this time assume that we know the TRUE
population parameter N(𝜇, 𝜎2
).
TRUE Parameter
mean 𝜇 = 27.6
variation 𝜎2 = 28.3
SD 𝜎 = 5.31
(This number is only for example)
↑
𝜎
𝜇
Set up
From a population following N 𝜇, 𝜎2
𝑛 = 10
Let us try sampling the first time!
And we set the sample size n=10
𝑥1 34 31 25 28 26 NA 25 20 27 25
②
𝜇
𝜎
We repeat it 6 times. It means we have 6 groups of samples and the sample
size of each group is 10
These 6 samples are different because each of sampling is an random
sampling.
But the result is not perfectly random because it is taken from a population
distribution.
So, we can say ”data is a representation of random variable gain from
sampling.”*
𝑥2 = 25.4𝑥1 = 26.8 𝑥4 = 27.6𝑥3 = 27.5 𝑥6 = 26.9𝑥5 = 27.6
And also, we can calculate each of samples’ mean.
You can see the sample mean is also a random variable.
How to calculate the sample mean? Yes, we must know.
𝑥1 34 31 25 28 26 NA 25 20 27 25 𝑥1 =26.8
𝑥2 20 NA 22 25 NA 24 21 29 39 23 𝑥2 =25.4
𝑥3 19 16 24 29 42 27 41 21 34 22 𝑥3 =27.5
𝑥4 24 35 24 25 28 20 26 38 28 28 𝑥4 =27.6
𝑥5 27 26 28 31 23 24 NA 34 30 26 𝑥5 =27.7
𝑥6 25 26 24 28 29 NA 28 26 21 35 𝑥6 =26.9
How do you think if we take more sample?
For example, we take 200 samples, and calc sample mean.
We can plot a histogram of𝑥1~𝑥200
There are 200 averages (of samples) and each of the average is random
variable.
Next, we would like to
calculate the distribution’s (this histogram’s)
-mean of sample means ( ҧ𝑥)
-variation of sample means (𝑉𝑥)
-standard deviation of sample means (𝑠 𝑥)
We can calculate it by definition. (I used R to calculate)
mean of sample means ( ҧ𝑥)
ҧ𝑥 =
1
𝑛
෍
𝑖=1
𝑛
ഥ𝑥𝑖 =
ഥ𝑥1 + ഥ𝑥2 + ⋯ + 𝑥199 + 𝑥200
200
= 27.541
variation of sample means
𝑉𝑥 =
1
𝑛 − 1
෍
𝑖=𝑖
𝑛
ഥ𝑥𝑖 − ҧ𝑥 2
=
𝑥1 − ҧ𝑥 2
+ 𝑥2 − ҧ𝑥 2
+ ⋯ 𝑥200 − ҧ𝑥 2
200 − 1
= 2.595608
standard deviation of sample means
𝑠 𝑥 = 𝑉𝑥 = 2.595608 = 1.611089
We can plot a Normal distribution using the result of the calculation on a
histogram we draw before.
↑
Mean ҧ𝑥 = 27.5
𝑁 ҧ𝑥, 𝑉𝑥 =
𝑁(27.5,2.6)
SD:
𝑠 𝑥 = 1.6
Let’s compare these distributions: population and sample means
↑
Mean ҧ𝑥 = 27.5
𝑁 ҧ𝑥, 𝑉𝑥 =
𝑁(27.5,2.6) 𝑆𝐷
𝑠 𝑥 = 1.6
↑
𝜎 = 5.3
Population mean 𝜇 = 27.6
𝑁 𝜇, 𝜎2 =
𝑁(27.6,28.3)
Remember,
First of all, we have a population distribution showing left.
We took randomly pick up samples 200 times and the number of items within
the each trial are n=10.
And we calculated each samples’ mean and the distribution of the 200 sample
means is showing right.
To compare, we can integrate these graphs.
What do you realize?
We know now…
The population mean is nearly samples’ mean.
The samples’ variation is smaller than population’s.
↑
𝜎
𝜇
Central Limit Theorem : CLT
From a distribution
that have 𝝁 𝒂𝒏𝒅 𝝈 𝟐
(it must NOT be
following normal)
We repeatedly try to
take a many samples and
the sample size is n.
The distribution of “means
of samples” are distributed
and it follows
𝑁 𝜇,
𝜎2
𝑛
↑
𝜇 = ҧ𝑥
𝑠 𝑥 =
𝜎
𝑛
Also, we call
𝜎
𝑛
as Standard Error of the mean ഥ𝑥𝑖 SE
Numerically examine it!
The goal is to show 𝜇 = 𝑥 and 𝑠 𝑥 =
𝜎
𝑛
↑
ҧ𝑥 = 27.5
𝑁 ҧ𝑥, 𝑉𝑥 =
𝑁(27.5,2.6) SE=𝑠 𝑥 = 1.61
↑
𝜎 = 5.3
𝜇 = 27.6
𝑁 𝜇, 𝜎2 =
𝑁(27.6,28.3)
𝜇 = 27.6 ≅ ҧ𝑥 = 27.5
𝜎
𝑛
= 𝑆𝐸 =
5.3
10
=
5.3
3.16277
= 1.68 ≅ 𝑠 𝑥(𝑆𝐸) = 1.61
Almost
Same!
True value we already
know
Theoretically calculate using true value Derived from R trial
n=2
n=5
n=10
𝑥1 34 31 25 28 26
𝑥2 20 NA 22 25 NA
⋮ 19 16 24 29 42
𝑥1000 24 35 24 25 28
𝑥1 34 31
𝑥2 20 NA
⋮ 27 26
𝑥1000 25 26
n is here
If we change sample size n (and fix the number of trial)

More Related Content

What's hot

Estimating a Population Standard Deviation or Variance
Estimating a Population Standard Deviation or VarianceEstimating a Population Standard Deviation or Variance
Estimating a Population Standard Deviation or VarianceLong Beach City College
 
Berlin Epi Methods Colluquium may 2021
Berlin Epi Methods Colluquium may 2021Berlin Epi Methods Colluquium may 2021
Berlin Epi Methods Colluquium may 2021Ellie Murray
 
Lecture slides stats1.13.l09.air
Lecture slides stats1.13.l09.airLecture slides stats1.13.l09.air
Lecture slides stats1.13.l09.airatutor_te
 
Descriptive Statistics - Thiyagu K
Descriptive Statistics - Thiyagu KDescriptive Statistics - Thiyagu K
Descriptive Statistics - Thiyagu KThiyagu K
 
Stat 3203 -multphase sampling
Stat 3203 -multphase samplingStat 3203 -multphase sampling
Stat 3203 -multphase samplingKhulna University
 
PG STAT 531 Lecture 3 Graphical and Diagrammatic Representation of Data
PG STAT 531 Lecture 3 Graphical and Diagrammatic Representation of DataPG STAT 531 Lecture 3 Graphical and Diagrammatic Representation of Data
PG STAT 531 Lecture 3 Graphical and Diagrammatic Representation of DataAashish Patel
 
PG STAT 531 Lecture 2 Descriptive statistics
PG STAT 531 Lecture 2 Descriptive statisticsPG STAT 531 Lecture 2 Descriptive statistics
PG STAT 531 Lecture 2 Descriptive statisticsAashish Patel
 
Chap01 describing data; graphical
Chap01 describing data;  graphicalChap01 describing data;  graphical
Chap01 describing data; graphicalJudianto Nugroho
 

What's hot (20)

Estimating a Population Standard Deviation or Variance
Estimating a Population Standard Deviation or VarianceEstimating a Population Standard Deviation or Variance
Estimating a Population Standard Deviation or Variance
 
Berlin Epi Methods Colluquium may 2021
Berlin Epi Methods Colluquium may 2021Berlin Epi Methods Colluquium may 2021
Berlin Epi Methods Colluquium may 2021
 
Lecture slides stats1.13.l09.air
Lecture slides stats1.13.l09.airLecture slides stats1.13.l09.air
Lecture slides stats1.13.l09.air
 
Stat sample test ch 11
Stat sample test ch 11Stat sample test ch 11
Stat sample test ch 11
 
Central tendency
Central tendencyCentral tendency
Central tendency
 
Descriptive Statistics - Thiyagu K
Descriptive Statistics - Thiyagu KDescriptive Statistics - Thiyagu K
Descriptive Statistics - Thiyagu K
 
Stat 3203 -multphase sampling
Stat 3203 -multphase samplingStat 3203 -multphase sampling
Stat 3203 -multphase sampling
 
How to describe things
How to describe thingsHow to describe things
How to describe things
 
The Standard Normal Distribution
The Standard Normal DistributionThe Standard Normal Distribution
The Standard Normal Distribution
 
Chapter15
Chapter15Chapter15
Chapter15
 
PG STAT 531 Lecture 3 Graphical and Diagrammatic Representation of Data
PG STAT 531 Lecture 3 Graphical and Diagrammatic Representation of DataPG STAT 531 Lecture 3 Graphical and Diagrammatic Representation of Data
PG STAT 531 Lecture 3 Graphical and Diagrammatic Representation of Data
 
Probability sampling
Probability samplingProbability sampling
Probability sampling
 
Kruskal-Wallis H test
Kruskal-Wallis H testKruskal-Wallis H test
Kruskal-Wallis H test
 
Chapter08
Chapter08Chapter08
Chapter08
 
Unit 5 Correlation
Unit 5 CorrelationUnit 5 Correlation
Unit 5 Correlation
 
PG STAT 531 Lecture 2 Descriptive statistics
PG STAT 531 Lecture 2 Descriptive statisticsPG STAT 531 Lecture 2 Descriptive statistics
PG STAT 531 Lecture 2 Descriptive statistics
 
Assessing Normality
Assessing NormalityAssessing Normality
Assessing Normality
 
Chap01 describing data; graphical
Chap01 describing data;  graphicalChap01 describing data;  graphical
Chap01 describing data; graphical
 
T test statistics
T test statisticsT test statistics
T test statistics
 
Regression Analysis
Regression AnalysisRegression Analysis
Regression Analysis
 

Similar to MITx_14310_CLT

Identifying the sampling distribution module5
Identifying the sampling distribution module5Identifying the sampling distribution module5
Identifying the sampling distribution module5REYEMMANUELILUMBA
 
Basics of Stats (2).pptx
Basics of Stats (2).pptxBasics of Stats (2).pptx
Basics of Stats (2).pptxmadihamaqbool6
 
Applications to Central Limit Theorem and Law of Large Numbers
Applications to Central Limit Theorem and Law of Large NumbersApplications to Central Limit Theorem and Law of Large Numbers
Applications to Central Limit Theorem and Law of Large NumbersUniversity of Salerno
 
Asymptotic properties of bayes factor in one way repeated measurements model
Asymptotic properties of bayes factor in one  way repeated measurements modelAsymptotic properties of bayes factor in one  way repeated measurements model
Asymptotic properties of bayes factor in one way repeated measurements modelAlexander Decker
 
Asymptotic properties of bayes factor in one way repeated measurements model
Asymptotic properties of bayes factor in one  way repeated measurements modelAsymptotic properties of bayes factor in one  way repeated measurements model
Asymptotic properties of bayes factor in one way repeated measurements modelAlexander Decker
 
MEASURE OF CENTRAL TENDENCY
MEASURE OF CENTRAL TENDENCY  MEASURE OF CENTRAL TENDENCY
MEASURE OF CENTRAL TENDENCY AB Rajar
 
1 Lab 4 The Central Limit Theorem and A Monte Carlo Si.docx
1 Lab 4   The Central Limit Theorem and A Monte Carlo Si.docx1 Lab 4   The Central Limit Theorem and A Monte Carlo Si.docx
1 Lab 4 The Central Limit Theorem and A Monte Carlo Si.docxjeremylockett77
 
Statistics in research
Statistics in researchStatistics in research
Statistics in researchBalaji P
 
The Arithmetic Mean.pptx
The Arithmetic Mean.pptxThe Arithmetic Mean.pptx
The Arithmetic Mean.pptxSanaJaved950916
 
A study on the ANOVA ANALYSIS OF VARIANCE.pptx
A study on the ANOVA ANALYSIS OF VARIANCE.pptxA study on the ANOVA ANALYSIS OF VARIANCE.pptx
A study on the ANOVA ANALYSIS OF VARIANCE.pptxjibinjohn140
 
CrashCourse_0622
CrashCourse_0622CrashCourse_0622
CrashCourse_0622Dexen Xi
 

Similar to MITx_14310_CLT (20)

Identifying the sampling distribution module5
Identifying the sampling distribution module5Identifying the sampling distribution module5
Identifying the sampling distribution module5
 
Talk 2
Talk 2Talk 2
Talk 2
 
Basics of Stats (2).pptx
Basics of Stats (2).pptxBasics of Stats (2).pptx
Basics of Stats (2).pptx
 
Applications to Central Limit Theorem and Law of Large Numbers
Applications to Central Limit Theorem and Law of Large NumbersApplications to Central Limit Theorem and Law of Large Numbers
Applications to Central Limit Theorem and Law of Large Numbers
 
Makalah ukuran penyebaran
Makalah ukuran penyebaranMakalah ukuran penyebaran
Makalah ukuran penyebaran
 
Asymptotic properties of bayes factor in one way repeated measurements model
Asymptotic properties of bayes factor in one  way repeated measurements modelAsymptotic properties of bayes factor in one  way repeated measurements model
Asymptotic properties of bayes factor in one way repeated measurements model
 
Asymptotic properties of bayes factor in one way repeated measurements model
Asymptotic properties of bayes factor in one  way repeated measurements modelAsymptotic properties of bayes factor in one  way repeated measurements model
Asymptotic properties of bayes factor in one way repeated measurements model
 
MEASURE OF CENTRAL TENDENCY
MEASURE OF CENTRAL TENDENCY  MEASURE OF CENTRAL TENDENCY
MEASURE OF CENTRAL TENDENCY
 
Lecture 4
Lecture 4Lecture 4
Lecture 4
 
1 Lab 4 The Central Limit Theorem and A Monte Carlo Si.docx
1 Lab 4   The Central Limit Theorem and A Monte Carlo Si.docx1 Lab 4   The Central Limit Theorem and A Monte Carlo Si.docx
1 Lab 4 The Central Limit Theorem and A Monte Carlo Si.docx
 
Factorial Experiments
Factorial ExperimentsFactorial Experiments
Factorial Experiments
 
Curve fitting
Curve fittingCurve fitting
Curve fitting
 
Curve fitting
Curve fittingCurve fitting
Curve fitting
 
Statistics in research
Statistics in researchStatistics in research
Statistics in research
 
The Arithmetic Mean.pptx
The Arithmetic Mean.pptxThe Arithmetic Mean.pptx
The Arithmetic Mean.pptx
 
Multiple linear regression
Multiple linear regressionMultiple linear regression
Multiple linear regression
 
A study on the ANOVA ANALYSIS OF VARIANCE.pptx
A study on the ANOVA ANALYSIS OF VARIANCE.pptxA study on the ANOVA ANALYSIS OF VARIANCE.pptx
A study on the ANOVA ANALYSIS OF VARIANCE.pptx
 
Statistical parameters
Statistical parametersStatistical parameters
Statistical parameters
 
Basic statistics
Basic statistics Basic statistics
Basic statistics
 
CrashCourse_0622
CrashCourse_0622CrashCourse_0622
CrashCourse_0622
 

More from Ryosuke Ishii

Psychological Flexibility for Psychological Safe Organization
Psychological Flexibility for Psychological Safe OrganizationPsychological Flexibility for Psychological Safe Organization
Psychological Flexibility for Psychological Safe OrganizationRyosuke Ishii
 
180104 find your passion
180104 find your passion 180104 find your passion
180104 find your passion Ryosuke Ishii
 
171212_find_your_passion
171212_find_your_passion171212_find_your_passion
171212_find_your_passionRyosuke Ishii
 
171124 get adopted your proposals public version
171124 get adopted your proposals public version171124 get adopted your proposals public version
171124 get adopted your proposals public versionRyosuke Ishii
 
171111 entrepreneurial economics 3
171111 entrepreneurial economics 3171111 entrepreneurial economics 3
171111 entrepreneurial economics 3Ryosuke Ishii
 
170925 entrepreneurial economics2
170925 entrepreneurial economics2170925 entrepreneurial economics2
170925 entrepreneurial economics2Ryosuke Ishii
 
170902 entrepreneurial economics1
170902 entrepreneurial economics1170902 entrepreneurial economics1
170902 entrepreneurial economics1Ryosuke Ishii
 
mathphysicsmastersv3
mathphysicsmastersv3mathphysicsmastersv3
mathphysicsmastersv3Ryosuke Ishii
 
mathphysicsmastersv2
mathphysicsmastersv2mathphysicsmastersv2
mathphysicsmastersv2Ryosuke Ishii
 
MathPhysicsMasters-v1
MathPhysicsMasters-v1MathPhysicsMasters-v1
MathPhysicsMasters-v1Ryosuke Ishii
 
150109_Lecture_WasedaUniv_venture
150109_Lecture_WasedaUniv_venture150109_Lecture_WasedaUniv_venture
150109_Lecture_WasedaUniv_ventureRyosuke Ishii
 
141011_what is_money
141011_what is_money141011_what is_money
141011_what is_moneyRyosuke Ishii
 
【初心者向】ロジカルシンキングをゼロからはじめる
【初心者向】ロジカルシンキングをゼロからはじめる【初心者向】ロジカルシンキングをゼロからはじめる
【初心者向】ロジカルシンキングをゼロからはじめるRyosuke Ishii
 

More from Ryosuke Ishii (13)

Psychological Flexibility for Psychological Safe Organization
Psychological Flexibility for Psychological Safe OrganizationPsychological Flexibility for Psychological Safe Organization
Psychological Flexibility for Psychological Safe Organization
 
180104 find your passion
180104 find your passion 180104 find your passion
180104 find your passion
 
171212_find_your_passion
171212_find_your_passion171212_find_your_passion
171212_find_your_passion
 
171124 get adopted your proposals public version
171124 get adopted your proposals public version171124 get adopted your proposals public version
171124 get adopted your proposals public version
 
171111 entrepreneurial economics 3
171111 entrepreneurial economics 3171111 entrepreneurial economics 3
171111 entrepreneurial economics 3
 
170925 entrepreneurial economics2
170925 entrepreneurial economics2170925 entrepreneurial economics2
170925 entrepreneurial economics2
 
170902 entrepreneurial economics1
170902 entrepreneurial economics1170902 entrepreneurial economics1
170902 entrepreneurial economics1
 
mathphysicsmastersv3
mathphysicsmastersv3mathphysicsmastersv3
mathphysicsmastersv3
 
mathphysicsmastersv2
mathphysicsmastersv2mathphysicsmastersv2
mathphysicsmastersv2
 
MathPhysicsMasters-v1
MathPhysicsMasters-v1MathPhysicsMasters-v1
MathPhysicsMasters-v1
 
150109_Lecture_WasedaUniv_venture
150109_Lecture_WasedaUniv_venture150109_Lecture_WasedaUniv_venture
150109_Lecture_WasedaUniv_venture
 
141011_what is_money
141011_what is_money141011_what is_money
141011_what is_money
 
【初心者向】ロジカルシンキングをゼロからはじめる
【初心者向】ロジカルシンキングをゼロからはじめる【初心者向】ロジカルシンキングをゼロからはじめる
【初心者向】ロジカルシンキングをゼロからはじめる
 

Recently uploaded

SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxSOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxkessiyaTpeter
 
Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Patrick Diehl
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​kaibalyasahoo82800
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Lokesh Kothari
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)PraveenaKalaiselvan1
 
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...anilsa9823
 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSarthak Sekhar Mondal
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptxanandsmhk
 
Analytical Profile of Coleus Forskohlii | Forskolin .pdf
Analytical Profile of Coleus Forskohlii | Forskolin .pdfAnalytical Profile of Coleus Forskohlii | Forskolin .pdf
Analytical Profile of Coleus Forskohlii | Forskolin .pdfSwapnil Therkar
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsSérgio Sacani
 
Artificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PArtificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PPRINCE C P
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bSérgio Sacani
 
Analytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptxAnalytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptxSwapnil Therkar
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsAArockiyaNisha
 
Cultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxCultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxpradhanghanshyam7136
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...Sérgio Sacani
 
Orientation, design and principles of polyhouse
Orientation, design and principles of polyhouseOrientation, design and principles of polyhouse
Orientation, design and principles of polyhousejana861314
 
Types of different blotting techniques.pptx
Types of different blotting techniques.pptxTypes of different blotting techniques.pptx
Types of different blotting techniques.pptxkhadijarafiq2012
 

Recently uploaded (20)

SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxSOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
 
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
 
Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​
 
CELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdfCELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdf
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)
 
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
 
Analytical Profile of Coleus Forskohlii | Forskolin .pdf
Analytical Profile of Coleus Forskohlii | Forskolin .pdfAnalytical Profile of Coleus Forskohlii | Forskolin .pdf
Analytical Profile of Coleus Forskohlii | Forskolin .pdf
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
 
Artificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PArtificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C P
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
 
Analytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptxAnalytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptx
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based Nanomaterials
 
Cultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxCultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptx
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
 
Orientation, design and principles of polyhouse
Orientation, design and principles of polyhouseOrientation, design and principles of polyhouse
Orientation, design and principles of polyhouse
 
Types of different blotting techniques.pptx
Types of different blotting techniques.pptxTypes of different blotting techniques.pptx
Types of different blotting techniques.pptx
 

MITx_14310_CLT

  • 1. Central Limit Theorem( ) for 14.310x students. Ryosuke ISHII (ryouen)
  • 2. About author • Ryosuke ISHII (call me ryo / ryouen) • From Tokyo, Japan • Graduated from The University of Tokyo. • Current: Researcher, Grad School of System Design and Management, Keio Univ. • Enjoining MITx 14.310x and learn from a lot. • Also MITx 14.100x (Microeconomics) and HarvardX PHP525.x (Statistics) on edX.
  • 3. According to CLT, When the population is following 𝜇(population mean) and 𝜎2(population variance), we took some sample and the sample size = 𝑛, This 𝑛 means how many items in the group. It is different from “the number of samples” If we took many samples repeatedly, we can calculate each of sample’s mean (this is sample mean ഥ𝑥𝑖) and the sample mean is also a random variable. And the sample mean follows: ҧ𝑥 ~ N(𝜇, 𝜎2 𝑛 ) ↑ 𝜎 𝜇 𝑠 𝑥 = 𝜎 𝑛
  • 4. Sample size is different from the number of samples. If we compare 10 males and 15 females The sample size of the male group is 10. The sample size of the female group is 15. The number of samples (or the number of groups) is 2. The number of samples and the sample size can potentially be confusing. Sample size is the number of items within a group. Number of samples is the number of groups.” *Metin Çakanyıldırım, Computing the Standard Deviation of Sample Means
  • 5. (if you wish, you can simulate with the R code below) x <- rnorm(3300, mean=27.6,sd=sqrt(28.3)) n=10 #sample size N=1000 #the number of trials set.seed(1) ys <- vector("numeric",N) ysmean <- vector("numeric",N) ysvar <- vector("numeric",N) yssd <- vector("numeric",N) yalldata <- vector("numeric",0) for(i in 1:N){ ys <- sample(x, n) ysmean[i] = mean(ys,na.rm = TRUE) ysvar[i]= var(ys,na.rm = TRUE) yssd[i] = sd(ys,na.rm = TRUE) yalldata = c(yalldata,ys) }
  • 6. In order to understand deeper, this time assume that we know the TRUE population parameter N(𝜇, 𝜎2 ). TRUE Parameter mean 𝜇 = 27.6 variation 𝜎2 = 28.3 SD 𝜎 = 5.31 (This number is only for example) ↑ 𝜎 𝜇 Set up
  • 7. From a population following N 𝜇, 𝜎2 𝑛 = 10 Let us try sampling the first time! And we set the sample size n=10 𝑥1 34 31 25 28 26 NA 25 20 27 25 ②
  • 8. 𝜇 𝜎 We repeat it 6 times. It means we have 6 groups of samples and the sample size of each group is 10
  • 9. These 6 samples are different because each of sampling is an random sampling. But the result is not perfectly random because it is taken from a population distribution. So, we can say ”data is a representation of random variable gain from sampling.”* 𝑥2 = 25.4𝑥1 = 26.8 𝑥4 = 27.6𝑥3 = 27.5 𝑥6 = 26.9𝑥5 = 27.6 And also, we can calculate each of samples’ mean. You can see the sample mean is also a random variable.
  • 10. How to calculate the sample mean? Yes, we must know. 𝑥1 34 31 25 28 26 NA 25 20 27 25 𝑥1 =26.8 𝑥2 20 NA 22 25 NA 24 21 29 39 23 𝑥2 =25.4 𝑥3 19 16 24 29 42 27 41 21 34 22 𝑥3 =27.5 𝑥4 24 35 24 25 28 20 26 38 28 28 𝑥4 =27.6 𝑥5 27 26 28 31 23 24 NA 34 30 26 𝑥5 =27.7 𝑥6 25 26 24 28 29 NA 28 26 21 35 𝑥6 =26.9 How do you think if we take more sample? For example, we take 200 samples, and calc sample mean.
  • 11. We can plot a histogram of𝑥1~𝑥200 There are 200 averages (of samples) and each of the average is random variable. Next, we would like to calculate the distribution’s (this histogram’s) -mean of sample means ( ҧ𝑥) -variation of sample means (𝑉𝑥) -standard deviation of sample means (𝑠 𝑥)
  • 12. We can calculate it by definition. (I used R to calculate) mean of sample means ( ҧ𝑥) ҧ𝑥 = 1 𝑛 ෍ 𝑖=1 𝑛 ഥ𝑥𝑖 = ഥ𝑥1 + ഥ𝑥2 + ⋯ + 𝑥199 + 𝑥200 200 = 27.541 variation of sample means 𝑉𝑥 = 1 𝑛 − 1 ෍ 𝑖=𝑖 𝑛 ഥ𝑥𝑖 − ҧ𝑥 2 = 𝑥1 − ҧ𝑥 2 + 𝑥2 − ҧ𝑥 2 + ⋯ 𝑥200 − ҧ𝑥 2 200 − 1 = 2.595608 standard deviation of sample means 𝑠 𝑥 = 𝑉𝑥 = 2.595608 = 1.611089
  • 13. We can plot a Normal distribution using the result of the calculation on a histogram we draw before. ↑ Mean ҧ𝑥 = 27.5 𝑁 ҧ𝑥, 𝑉𝑥 = 𝑁(27.5,2.6) SD: 𝑠 𝑥 = 1.6
  • 14. Let’s compare these distributions: population and sample means ↑ Mean ҧ𝑥 = 27.5 𝑁 ҧ𝑥, 𝑉𝑥 = 𝑁(27.5,2.6) 𝑆𝐷 𝑠 𝑥 = 1.6 ↑ 𝜎 = 5.3 Population mean 𝜇 = 27.6 𝑁 𝜇, 𝜎2 = 𝑁(27.6,28.3) Remember, First of all, we have a population distribution showing left. We took randomly pick up samples 200 times and the number of items within the each trial are n=10. And we calculated each samples’ mean and the distribution of the 200 sample means is showing right.
  • 15. To compare, we can integrate these graphs. What do you realize?
  • 16. We know now… The population mean is nearly samples’ mean. The samples’ variation is smaller than population’s.
  • 17. ↑ 𝜎 𝜇 Central Limit Theorem : CLT From a distribution that have 𝝁 𝒂𝒏𝒅 𝝈 𝟐 (it must NOT be following normal) We repeatedly try to take a many samples and the sample size is n. The distribution of “means of samples” are distributed and it follows 𝑁 𝜇, 𝜎2 𝑛 ↑ 𝜇 = ҧ𝑥 𝑠 𝑥 = 𝜎 𝑛 Also, we call 𝜎 𝑛 as Standard Error of the mean ഥ𝑥𝑖 SE
  • 18. Numerically examine it! The goal is to show 𝜇 = 𝑥 and 𝑠 𝑥 = 𝜎 𝑛 ↑ ҧ𝑥 = 27.5 𝑁 ҧ𝑥, 𝑉𝑥 = 𝑁(27.5,2.6) SE=𝑠 𝑥 = 1.61 ↑ 𝜎 = 5.3 𝜇 = 27.6 𝑁 𝜇, 𝜎2 = 𝑁(27.6,28.3) 𝜇 = 27.6 ≅ ҧ𝑥 = 27.5 𝜎 𝑛 = 𝑆𝐸 = 5.3 10 = 5.3 3.16277 = 1.68 ≅ 𝑠 𝑥(𝑆𝐸) = 1.61 Almost Same! True value we already know Theoretically calculate using true value Derived from R trial
  • 19. n=2 n=5 n=10 𝑥1 34 31 25 28 26 𝑥2 20 NA 22 25 NA ⋮ 19 16 24 29 42 𝑥1000 24 35 24 25 28 𝑥1 34 31 𝑥2 20 NA ⋮ 27 26 𝑥1000 25 26 n is here If we change sample size n (and fix the number of trial)