SlideShare ist ein Scribd-Unternehmen logo
1 von 33
Introduction to Statistics for Built
Environment
Course Code: AED 1222
Compiled by
DEPARTMENT OF ARCHITECTURE AND ENVIRONMENTAL DESIGN (AED)
CENTRE FOR FOUNDATION STUDIES (CFS)
INTERNATIONAL ISLAMIC UNIVERSITY MALAYSIA
Lecture 8
Today’s Lecture:
 The range
 Quartiles & the Interquartile range.
 Percentiles
 Percentiles Rank
 The five number summary
Measures of variability/dispersion
Part I
What is/are Measures of Variation/Dispersion?
●If the dispersion are widely dispersed, the central
location is said to be less representative of the
data as a whole.
●If the dispersion are closely dispersed, the central
location is considered more reliable.
Measures of Variation/Dispersion
●Measures of Variation / Dispersion is an
information on the spread or variability of the data
values.
Business Statistics: A Decision-
Making Approach, 7e Š 2008
Prentice-Hall, Inc.
• Describes how the data is spreading or the
variability
Same center,
different variation / dispersion
Shape of a Distribution and Variability / Dispersion
Measures of variation/dispersion cont.
Close dispersion
Wide dispersion
Business Statistics: A Decision-
Making Approach, 7e Š 2008
Prentice-Hall, Inc.
Right-SkewedLeft-Skewed Symmetric
Q1 Q2 Q3 Q1 Q2 Q3 Q1 Q2 Q3
Shape of a Distribution and Variability / Dispersion
• Shapes either Symmetric or skewed
Measures of variation/dispersion cont.
The range
●The range is simply the difference between the
largest and the smallest observed values in a data set.
Thus, range, including any outliers, is the actual
spread of data.
●A great deal of information is ignored when
computing the range, since only the largest and
smallest data values are considered.
Range = difference between highest and lowest
observed values
What is/are Range?
Business Statistics: A Decision-
Making Approach, 7e Š 2008
Prentice-Hall, Inc.
• Simplest measure of variation
• Difference between the largest and the smallest
observations:
Range = xmaximum – xminimum
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14
Range = 14 - 1 = 13
Example:
The range cont.
Find The Range
●The range value of a data set is greatly influenced
by the presence of just one unusually large or small
value (outlier).
●The range can be expressed as an interval such as
4–10, where 4 is the lowest value and 10 is highest.
●Often, it is expressed as interval width. For
example, the range of 4–10 can also be expressed
as a range of 6.
The range cont.
Business Statistics: A Decision-
Making Approach, 7e Š 2008
Prentice-Hall, Inc.
• Ignores the way in which data are distributed
• Sensitive to outliers
7 8 9 10 11
12 Range = 12 - 7 = 5
7 8 9 10 11
12 Range = 12 - 7 = 5
Disadvantages of the Range
1,1,1,1,1,1,1,1,1,1,1,2,2,2,2,2,2,2,2,3,3,3,3,4,5
1,1,1,1,1,1,1,1,1,1,1,2,2,2,2,2,2,2,2,3,3,3,3,4,120
Range = 5 - 1 = 4
Range = 120 - 1 = 119
The range cont.
●Other disadvantages of using range is that it does not
measure the spread of the majority of values in a data set
—it only measures the spread between highest and
lowest values.
●As a result, other measures are required in order to give
a better picture of the data spread.
●The range is an informative tool used as a supplement
to other measures such as the standard deviation or
semi-interquartile range, but it should rarely be used as
the only measure of spread.
The range cont.
1, 2, 4, 6, 12, 15, 19, 26
Smallest Value Largest Value
Does not take into account how clumped together
the scores are
Range = 26 - 1 = 25
Question: Is range a good measure of spread/dispersion?
Question: Is value 25 a good representative value?
The range cont.
Example 1:
425425 430430 430430 435435 435435 435435 435435 435435 440440 440440
440440 440440 440440 445445 445445 445445 445445 445445 450450 450450
450450 450450 450450 450450 450450 460460 460460 460460 465465 465465
465465 470470 470470 472472 475475 475475 475475 480480 480480 480480
480480 485485 490490 490490 490490 500500 500500 500500 500500 510510
515515 525525 525525 525525 535535 549549 550550 570570 570570
575575 575575 580580 590590 600600 600600 600600 600600 615615 615615
510510
Smallest value
Largest value
does not take into
account how
clumped together
the scores are
Range = 615 – 425 = 190
Question: Is range a good measure of spread/dispersion?
The range cont.
Example 2:
The quartiles
●In descriptive statistics, a quartile is any of the three values
which divide the sorted (arrayed) data set into four equal
parts, so that each part represents one fourth of the sampled
population.
●The median divides the data into two equal sets.
●The lower quartile is the value of the middle of the first set,
where 25% of the values are smaller than Q1 and 75% are
larger. This first quartile takes the notation Q1.
●The upper quartile is the value of the middle of the second
set, where 75% of the values are smaller than Q3 and 25% are
larger. This third quartile takes the notation Q3.
What is/are Quartiles?
Business Statistics: A Decision-
Making Approach, 7e Š 2008
Prentice-Hall, Inc.
• Quartiles split the ranked data into 4 equal groups:
25% 25% 25% 25%
Q1 Q2 Q3
The quartiles cont.
> first quartile (designated Q1) = lower quartile = cuts off
lowest 25% of data = 25th percentile
> second quartile (designated Q2) = median = cuts off data
set in half (50% of data) = 50th percentile
> third quartile (designated Q3) = upper quartile = cuts off
highest 25% of data, or lowest 75% = 75th percentile
• Note that the second quartile Q2 (the 50th
percentile) is the median
The formula for locating the position of the
observation at a given percentile, y, with n data
points sorted in ascending order is:
i = Ly = (y/100)n
•Case 1: If L is a whole number, then the value will
be found halfway between positions L and L+1.
•Case 2: If L is a decimal, round up to the nearest
whole number. (for example, L = 1.2 becomes 2).
Locating the position of the quartiles
The quartiles cont.
Business Statistics: A Decision-
Making Approach, 7e Š 2008
Prentice-Hall, Inc.
Sample Data in Ordered Array:
11 12 13 16 16 17 18 21 22
Example:
 Find the first quartile (Q1)
n = 9
Q1 = 25th
percentile, so find i : i = (9) = 2.25
so round up and use the value in the 3rd
position: Q1 = 13
25
100
The quartiles cont.
The Interquartile Range
●The interquartile range is another range used as a
measure of the spread.
●The difference between upper and lower quartiles (Q3–
Q1), which is called the interquartile range, also indicates
the dispersion of a data set.
●The interquartile range spans 50% of a data set, and
eliminates the influence of outliers because, in effect, the
highest and lowest quarters are removed.
What is/are Interquartile Range?
Business Statistics: A Decision-
Making Approach, 7e Š 2008
Prentice-Hall, Inc.
• Can eliminate some outlier problems by using the
interquartile range
• Eliminate some high-and low-valued observations
and calculate the range from the remaining values.
• Interquartile range = 3rd
quartile – 1st
quartile
The interquartile range cont.
Interquartile range = difference between upper quartile (Q3)
and lower quartile (Q1)
Business Statistics: A Decision-
Making Approach, 7e Š 2008
Prentice-Hall, Inc.
Median
(Q2)
X maximumX minimum
Q1 Q3
Example:
25% 25% 25% 25%
12 30 45 57 70
Interquartile range
= 57 – 30 = 27
The interquartile range cont.
An exercise
A year ago, Ali began working at a computer store. His
supervisor asked him to keep a record of the number of
sales he made each month.
The following data set is a list of his sales for the last 12
months: 34, 47, 1, 15, 57, 24, 20, 11, 19, 50, 28, 37.
•Use Ali's sales records to find:
 the median
 the range
 the upper and lower quartiles
 the interquartile range
The values in an ascending array are:
1, 11, 15, 19, 20, 24, 28, 34, 37, 47, 50, 57.
Median = (12 + 1) á 2
= 6.5th value
= (6th
+ 7th
observations) á 2
= (24 + 28) á 2
= 26
Range = difference between the highest and lowest
values = 57 – 1
= 56
Exercise cont.
Lower quartile = value of middle of first half of data Q1 =
the median of 1, 11, 15, 19, 20, 24
= (3rd
+ 4th
observations) á 2
= (15 + 19) á 2
= 17
Upper quartile = value of middle of second half of data Q3
= the median of 28, 34, 37, 47, 50, 57
= (3rd
+ 4th
observations) á 2
= (37 + 47) á 2
= 42
Interquartile range = Q3–Q1 = 42 – 17 = 25
Exercise cont.
Exercise cont.
Percentiles
●The ath
percentile is a value so that roughly a%
of the data are smaller and (100-a)% of the
data are larger.
●There are three steps for computing a
percentile.
– Organize the data into an ascending array;
– Count the number of values (n);
– Select the a*(n+1) observation.
What is/are Percentile?
Business Statistics: A Decision-
Making Approach, 7e Š 2008
Prentice-Hall, Inc.
• The pth
percentile in an ordered array of n values is the value
in ith
position, where
 Example: Find the 60th
percentile in an ordered array of
19 values.
(n)
100
p
i =
11.4(19)
100
60
(n)
100
p
i ===
If i is not an integer, round
up to the next higher integer
value
So use value in the i =
12th
position
Percentiles cont.
You can't always be so lucky to have a*(n+1) be
a nice whole number. Here are some scenarios:
•If a*(n+1) is not a whole number, then go
halfway between the two adjacent numbers.
•If a*(n+1) < 1, select the smallest observation.
•If a*(n+1) > n, select the largest observation.
Percentiles cont.
• Arrayed data: 18, 33, 58, 67, 73, 93, 147
• There are 7 observations (n=7).
• Select 0.50*(7+1) = 4th observation.
• Therefore, the 50th percentile equals 67.
-Notice that there are three observations larger than
67 and three observations smaller than 67.
Percentiles cont.
Example:
 Compute the 50th percentile for the following data set
73, 58, 67, 93, 33, 18, 147
50th
percentile
 Suppose we want to compute the 20th percentile…
• Notice that p*(n+1) = 0.20*(7+1)=1.6. This is not a
whole number so we select halfway between 1st and
2nd observation or 25.5.
 Suppose we want to compute the 10th percentile…
• Since 0.10*(7+1)=0.8, we should select the smallest
observation which is 18.
Percentile cont.
18, 33, 58, 67, 73, 93, 14710th
percentile
20th
percentile25.5Example: (Cont.)
• Percentile rank of a score is the percentage of
scores in its frequency distribution which are
lower than it.
• Percentile ranks are commonly used to clarify
the interpretation of scores on standardized
tests.
• Given formula:
Pr = (number of value below the score+ 0.5) á ( total
number of values) (100%).
Percentiles Rank
What is/are Percentile Rank?
Business Statistics: A Decision-
Making Approach, 7e Š 2008
Prentice-Hall, Inc.
Sample Data in Ordered Array:
11 12 13 16 16 17 18 21 22
Example:
 Find the percentile rank for (17)
n = 9
Pr17 = [(no. of values below the score of 17+0.5)/no. of scores]*100
= [(5+0.5)/9]*100
= 61st
percentile.
This value indicated that the student, who obtained a score of 17 marks,
scored higher than 61% of the members of his/her class.
Percentile Rank cont.
61%
The five number summary
A five number summary uses percentiles to
describe a set of data. The five number summary
consists of
•MAX - the maximum value
•75% - the 75th
percentile
•50% - the 50th
percentile (or the median)
•25% - the 25th
percentile
•MIN - the minimum value
The five number summary splits the data into four
regions, each of which contains 25% of the data.
What is/are Five Number Summary?
• The minimum value is = 1
• The lower half is {1, 3, 4}, and the median of that half (the
25th
percentile) is = 3
• The median (the 50th
percentile) is = 5
• The upper half is {6, 7, 9}, and the median of that half (the
75th
percentile) is = 7
• The maximum value is = 9
• The minimum value is = 1
• The lower half is {1, 3, 4}, and the median of that half (the
25th
percentile) is = 3
• The median (the 50th
percentile) is = 5
• The upper half is {6, 7, 9}, and the median of that half (the
75th
percentile) is = 7
• The maximum value is = 9
The five number summary cont.
Example:
 Find the five number summary for the data set.
1, 3, 4, 5, 6, 7, 9
Next class…
The following topics will be discussed:
 Measures of variability / dispersion (Part II):
 The average absolute deviation
 The Variance
 The Standard deviation
 Coefficient of Variation (CV)

Weitere ähnliche Inhalte

Was ist angesagt?

Statistics-Measures of dispersions
Statistics-Measures of dispersionsStatistics-Measures of dispersions
Statistics-Measures of dispersions
Capricorn
 
Measures of dispersion
Measures of dispersionMeasures of dispersion
Measures of dispersion
yogesh ingle
 
Measure of dispersion by Neeraj Bhandari ( Surkhet.Nepal )
Measure of dispersion by Neeraj Bhandari ( Surkhet.Nepal )Measure of dispersion by Neeraj Bhandari ( Surkhet.Nepal )
Measure of dispersion by Neeraj Bhandari ( Surkhet.Nepal )
Neeraj Bhandari
 
Mann Whitney U Test And Chi Squared
Mann Whitney U Test And Chi SquaredMann Whitney U Test And Chi Squared
Mann Whitney U Test And Chi Squared
guest2137aa
 
Chapter 5 sets & counting
Chapter 5   sets & countingChapter 5   sets & counting
Chapter 5 sets & counting
sarkissk
 
Box and whiskers power point
Box and whiskers power pointBox and whiskers power point
Box and whiskers power point
manswag123
 

Was ist angesagt? (20)

Descriptive Statistics, Numerical Description
Descriptive Statistics, Numerical DescriptionDescriptive Statistics, Numerical Description
Descriptive Statistics, Numerical Description
 
Statistics-Measures of dispersions
Statistics-Measures of dispersionsStatistics-Measures of dispersions
Statistics-Measures of dispersions
 
frequency distribution
 frequency distribution frequency distribution
frequency distribution
 
Measures of Variation
Measures of Variation Measures of Variation
Measures of Variation
 
Measures of dispersion
Measures of dispersionMeasures of dispersion
Measures of dispersion
 
Statistics and probability lesson6&7
Statistics and probability lesson6&7Statistics and probability lesson6&7
Statistics and probability lesson6&7
 
Measure of Dispersion
Measure of DispersionMeasure of Dispersion
Measure of Dispersion
 
Binomial Distribution Part 5
Binomial Distribution Part 5Binomial Distribution Part 5
Binomial Distribution Part 5
 
Mean, Median, Mode, And Range
Mean, Median, Mode, And RangeMean, Median, Mode, And Range
Mean, Median, Mode, And Range
 
Probability Distributions for Continuous Variables
Probability Distributions for Continuous VariablesProbability Distributions for Continuous Variables
Probability Distributions for Continuous Variables
 
Measure of dispersion by Neeraj Bhandari ( Surkhet.Nepal )
Measure of dispersion by Neeraj Bhandari ( Surkhet.Nepal )Measure of dispersion by Neeraj Bhandari ( Surkhet.Nepal )
Measure of dispersion by Neeraj Bhandari ( Surkhet.Nepal )
 
Quartile in Statistics
Quartile in StatisticsQuartile in Statistics
Quartile in Statistics
 
MEAN MODE MEDIAN AND RANGE
MEAN MODE MEDIAN AND RANGEMEAN MODE MEDIAN AND RANGE
MEAN MODE MEDIAN AND RANGE
 
Measures of variability grouped data
Measures of variability grouped dataMeasures of variability grouped data
Measures of variability grouped data
 
Mann Whitney U Test And Chi Squared
Mann Whitney U Test And Chi SquaredMann Whitney U Test And Chi Squared
Mann Whitney U Test And Chi Squared
 
Discrete Random Variables And Probability Distributions
Discrete Random Variables And Probability DistributionsDiscrete Random Variables And Probability Distributions
Discrete Random Variables And Probability Distributions
 
Combined mean and Weighted Arithmetic Mean
Combined mean and  Weighted Arithmetic MeanCombined mean and  Weighted Arithmetic Mean
Combined mean and Weighted Arithmetic Mean
 
Chapter 5 sets & counting
Chapter 5   sets & countingChapter 5   sets & counting
Chapter 5 sets & counting
 
Box and whiskers power point
Box and whiskers power pointBox and whiskers power point
Box and whiskers power point
 
Statistics and probability
Statistics and probability   Statistics and probability
Statistics and probability
 

Andere mochten auch

Lesson 7 measures of dispersion part 2
Lesson 7 measures of dispersion part 2Lesson 7 measures of dispersion part 2
Lesson 7 measures of dispersion part 2
nurun2010
 
Measures of dispersion
Measures of dispersionMeasures of dispersion
Measures of dispersion
Sachin Shekde
 
Measure of dispersion
Measure of dispersionMeasure of dispersion
Measure of dispersion
Hasnain Baber
 
Confidence intervals
Confidence intervalsConfidence intervals
Confidence intervals
Tanay Tandon
 
Ibe quantity surveyor 2013
Ibe quantity surveyor 2013Ibe quantity surveyor 2013
Ibe quantity surveyor 2013
nurun2010
 
Confidence Intervals
Confidence IntervalsConfidence Intervals
Confidence Intervals
mandalina landy
 
Chapter 3 Confidence Interval
Chapter 3 Confidence IntervalChapter 3 Confidence Interval
Chapter 3 Confidence Interval
ghalan
 

Andere mochten auch (20)

Measures of dispersion
Measures of dispersionMeasures of dispersion
Measures of dispersion
 
Measures of dispersion
Measures  of  dispersionMeasures  of  dispersion
Measures of dispersion
 
Lesson 7 measures of dispersion part 2
Lesson 7 measures of dispersion part 2Lesson 7 measures of dispersion part 2
Lesson 7 measures of dispersion part 2
 
Measures of dispersion
Measures of dispersionMeasures of dispersion
Measures of dispersion
 
Measure of dispersion
Measure of dispersionMeasure of dispersion
Measure of dispersion
 
Measures of Dispersion
Measures of DispersionMeasures of Dispersion
Measures of Dispersion
 
Modeling and Simulation in Business Process Management
Modeling and Simulation in Business Process ManagementModeling and Simulation in Business Process Management
Modeling and Simulation in Business Process Management
 
Measures of dispersion or variation
Measures of dispersion or variationMeasures of dispersion or variation
Measures of dispersion or variation
 
Confidence interval
Confidence intervalConfidence interval
Confidence interval
 
Confidence intervals
Confidence intervalsConfidence intervals
Confidence intervals
 
Chapter 11 ,Measures of Dispersion(statistics)
Chapter  11 ,Measures of Dispersion(statistics)Chapter  11 ,Measures of Dispersion(statistics)
Chapter 11 ,Measures of Dispersion(statistics)
 
Measures of Dispersion
Measures of DispersionMeasures of Dispersion
Measures of Dispersion
 
Ibe quantity surveyor 2013
Ibe quantity surveyor 2013Ibe quantity surveyor 2013
Ibe quantity surveyor 2013
 
Confidence Intervals
Confidence IntervalsConfidence Intervals
Confidence Intervals
 
Chapter 3 Confidence Interval
Chapter 3 Confidence IntervalChapter 3 Confidence Interval
Chapter 3 Confidence Interval
 
Measure of dispersion part I (Range, Quartile Deviation, Interquartile devi...
Measure of dispersion part   I (Range, Quartile Deviation, Interquartile devi...Measure of dispersion part   I (Range, Quartile Deviation, Interquartile devi...
Measure of dispersion part I (Range, Quartile Deviation, Interquartile devi...
 
Applications of simulation in Business with Example
Applications of simulation in Business with ExampleApplications of simulation in Business with Example
Applications of simulation in Business with Example
 
Accounting in insurance companies basic concepts
Accounting in insurance companies   basic conceptsAccounting in insurance companies   basic concepts
Accounting in insurance companies basic concepts
 
Introduction to t-tests (statistics)
Introduction to t-tests (statistics)Introduction to t-tests (statistics)
Introduction to t-tests (statistics)
 
MBA case study presentation template
MBA case study presentation templateMBA case study presentation template
MBA case study presentation template
 

Ähnlich wie Lesson 7 measures of dispersion part 1

Lesson 6 measures of central tendency
Lesson 6 measures of central tendencyLesson 6 measures of central tendency
Lesson 6 measures of central tendency
nurun2010
 
Statistics and probability lec006 part 1
Statistics and probability lec006 part 1Statistics and probability lec006 part 1
Statistics and probability lec006 part 1
TieeTiee
 
Answer the questions in one paragraph 4-5 sentences. ¡ Why did t.docx
Answer the questions in one paragraph 4-5 sentences. ¡ Why did t.docxAnswer the questions in one paragraph 4-5 sentences. ¡ Why did t.docx
Answer the questions in one paragraph 4-5 sentences. ¡ Why did t.docx
boyfieldhouse
 
Measure of Variability Report.pptx
Measure of Variability Report.pptxMeasure of Variability Report.pptx
Measure of Variability Report.pptx
CalvinAdorDionisio
 
1.0 Descriptive statistics.pdf
1.0 Descriptive statistics.pdf1.0 Descriptive statistics.pdf
1.0 Descriptive statistics.pdf
thaersyam
 

Ähnlich wie Lesson 7 measures of dispersion part 1 (20)

What is Descriptive Statistics and How Do You Choose the Right One for Enterp...
What is Descriptive Statistics and How Do You Choose the Right One for Enterp...What is Descriptive Statistics and How Do You Choose the Right One for Enterp...
What is Descriptive Statistics and How Do You Choose the Right One for Enterp...
 
quartiles,deciles,percentiles.ppt
quartiles,deciles,percentiles.pptquartiles,deciles,percentiles.ppt
quartiles,deciles,percentiles.ppt
 
Mat 255 chapter 3 notes
Mat 255 chapter 3 notesMat 255 chapter 3 notes
Mat 255 chapter 3 notes
 
Statistics.pdf
Statistics.pdfStatistics.pdf
Statistics.pdf
 
Lesson 6 measures of central tendency
Lesson 6 measures of central tendencyLesson 6 measures of central tendency
Lesson 6 measures of central tendency
 
Statistics and probability lec006 part 1
Statistics and probability lec006 part 1Statistics and probability lec006 part 1
Statistics and probability lec006 part 1
 
Answer the questions in one paragraph 4-5 sentences. ¡ Why did t.docx
Answer the questions in one paragraph 4-5 sentences. ¡ Why did t.docxAnswer the questions in one paragraph 4-5 sentences. ¡ Why did t.docx
Answer the questions in one paragraph 4-5 sentences. ¡ Why did t.docx
 
Intro to data science
Intro to data scienceIntro to data science
Intro to data science
 
Introduction To Data Science Using R
Introduction To Data Science Using RIntroduction To Data Science Using R
Introduction To Data Science Using R
 
Measures of Dispersion.pptx
Measures of Dispersion.pptxMeasures of Dispersion.pptx
Measures of Dispersion.pptx
 
3.3 Measures of relative standing and boxplots
3.3 Measures of relative standing and boxplots3.3 Measures of relative standing and boxplots
3.3 Measures of relative standing and boxplots
 
STATISTICS.pptx for the scholars and students
STATISTICS.pptx for the scholars and studentsSTATISTICS.pptx for the scholars and students
STATISTICS.pptx for the scholars and students
 
Measure of Variability Report.pptx
Measure of Variability Report.pptxMeasure of Variability Report.pptx
Measure of Variability Report.pptx
 
Measures-of-variation-report- advance stat
Measures-of-variation-report- advance statMeasures-of-variation-report- advance stat
Measures-of-variation-report- advance stat
 
TOPIC 9 VARIABILITY TESTS.pdf
TOPIC 9 VARIABILITY TESTS.pdfTOPIC 9 VARIABILITY TESTS.pdf
TOPIC 9 VARIABILITY TESTS.pdf
 
Descriptive Measures-Measures of Positionality
Descriptive Measures-Measures of PositionalityDescriptive Measures-Measures of Positionality
Descriptive Measures-Measures of Positionality
 
Numerical Descriptive Measures
Numerical Descriptive MeasuresNumerical Descriptive Measures
Numerical Descriptive Measures
 
1.0 Descriptive statistics.pdf
1.0 Descriptive statistics.pdf1.0 Descriptive statistics.pdf
1.0 Descriptive statistics.pdf
 
Lecture 1 Descriptives.pptx
Lecture 1 Descriptives.pptxLecture 1 Descriptives.pptx
Lecture 1 Descriptives.pptx
 
Topic 4 Measures of Dispersion & Numericals.pptx
Topic 4  Measures of Dispersion & Numericals.pptxTopic 4  Measures of Dispersion & Numericals.pptx
Topic 4 Measures of Dispersion & Numericals.pptx
 

Mehr von nurun2010

Conservation updated (1)
Conservation updated (1)Conservation updated (1)
Conservation updated (1)
nurun2010
 
Week 8 landscape architecture
Week 8 landscape architectureWeek 8 landscape architecture
Week 8 landscape architecture
nurun2010
 
Lesson 8 zscore
Lesson 8 zscoreLesson 8 zscore
Lesson 8 zscore
nurun2010
 
Ibe interiordesign
Ibe interiordesignIbe interiordesign
Ibe interiordesign
nurun2010
 
Week architecture updated
Week  architecture updatedWeek  architecture updated
Week architecture updated
nurun2010
 
Aed1222 lesson 6 2nd part
Aed1222 lesson 6 2nd partAed1222 lesson 6 2nd part
Aed1222 lesson 6 2nd part
nurun2010
 
Aed1222 lesson 6
Aed1222 lesson 6Aed1222 lesson 6
Aed1222 lesson 6
nurun2010
 
Aed1313 ibe week 5 urp
Aed1313 ibe week 5 urpAed1313 ibe week 5 urp
Aed1313 ibe week 5 urp
nurun2010
 
Env. management and issues
Env. management and issuesEnv. management and issues
Env. management and issues
nurun2010
 
Aed1222 lesson 5
Aed1222 lesson 5Aed1222 lesson 5
Aed1222 lesson 5
nurun2010
 
Aed 1313 ibe wk 2
Aed 1313 ibe wk 2Aed 1313 ibe wk 2
Aed 1313 ibe wk 2
nurun2010
 
Aed 1313 ibe-wk 1
Aed 1313 ibe-wk 1Aed 1313 ibe-wk 1
Aed 1313 ibe-wk 1
nurun2010
 
Aed1222 lesson 4
Aed1222 lesson 4Aed1222 lesson 4
Aed1222 lesson 4
nurun2010
 
Aed1222 lesson 3 addition
Aed1222 lesson 3 additionAed1222 lesson 3 addition
Aed1222 lesson 3 addition
nurun2010
 
Aed1222 lesson 2
Aed1222 lesson 2Aed1222 lesson 2
Aed1222 lesson 2
nurun2010
 
Aed1222 lesson 1 and 3
Aed1222 lesson 1 and 3Aed1222 lesson 1 and 3
Aed1222 lesson 1 and 3
nurun2010
 

Mehr von nurun2010 (16)

Conservation updated (1)
Conservation updated (1)Conservation updated (1)
Conservation updated (1)
 
Week 8 landscape architecture
Week 8 landscape architectureWeek 8 landscape architecture
Week 8 landscape architecture
 
Lesson 8 zscore
Lesson 8 zscoreLesson 8 zscore
Lesson 8 zscore
 
Ibe interiordesign
Ibe interiordesignIbe interiordesign
Ibe interiordesign
 
Week architecture updated
Week  architecture updatedWeek  architecture updated
Week architecture updated
 
Aed1222 lesson 6 2nd part
Aed1222 lesson 6 2nd partAed1222 lesson 6 2nd part
Aed1222 lesson 6 2nd part
 
Aed1222 lesson 6
Aed1222 lesson 6Aed1222 lesson 6
Aed1222 lesson 6
 
Aed1313 ibe week 5 urp
Aed1313 ibe week 5 urpAed1313 ibe week 5 urp
Aed1313 ibe week 5 urp
 
Env. management and issues
Env. management and issuesEnv. management and issues
Env. management and issues
 
Aed1222 lesson 5
Aed1222 lesson 5Aed1222 lesson 5
Aed1222 lesson 5
 
Aed 1313 ibe wk 2
Aed 1313 ibe wk 2Aed 1313 ibe wk 2
Aed 1313 ibe wk 2
 
Aed 1313 ibe-wk 1
Aed 1313 ibe-wk 1Aed 1313 ibe-wk 1
Aed 1313 ibe-wk 1
 
Aed1222 lesson 4
Aed1222 lesson 4Aed1222 lesson 4
Aed1222 lesson 4
 
Aed1222 lesson 3 addition
Aed1222 lesson 3 additionAed1222 lesson 3 addition
Aed1222 lesson 3 addition
 
Aed1222 lesson 2
Aed1222 lesson 2Aed1222 lesson 2
Aed1222 lesson 2
 
Aed1222 lesson 1 and 3
Aed1222 lesson 1 and 3Aed1222 lesson 1 and 3
Aed1222 lesson 1 and 3
 

KĂźrzlich hochgeladen

Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Victor Rentea
 

KĂźrzlich hochgeladen (20)

Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptx
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKSpring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
JohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptxJohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptx
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 

Lesson 7 measures of dispersion part 1

  • 1. Introduction to Statistics for Built Environment Course Code: AED 1222 Compiled by DEPARTMENT OF ARCHITECTURE AND ENVIRONMENTAL DESIGN (AED) CENTRE FOR FOUNDATION STUDIES (CFS) INTERNATIONAL ISLAMIC UNIVERSITY MALAYSIA
  • 2. Lecture 8 Today’s Lecture:  The range  Quartiles & the Interquartile range.  Percentiles  Percentiles Rank  The five number summary Measures of variability/dispersion Part I
  • 3. What is/are Measures of Variation/Dispersion? ●If the dispersion are widely dispersed, the central location is said to be less representative of the data as a whole. ●If the dispersion are closely dispersed, the central location is considered more reliable. Measures of Variation/Dispersion ●Measures of Variation / Dispersion is an information on the spread or variability of the data values.
  • 4. Business Statistics: A Decision- Making Approach, 7e Š 2008 Prentice-Hall, Inc. • Describes how the data is spreading or the variability Same center, different variation / dispersion Shape of a Distribution and Variability / Dispersion Measures of variation/dispersion cont. Close dispersion Wide dispersion
  • 5. Business Statistics: A Decision- Making Approach, 7e Š 2008 Prentice-Hall, Inc. Right-SkewedLeft-Skewed Symmetric Q1 Q2 Q3 Q1 Q2 Q3 Q1 Q2 Q3 Shape of a Distribution and Variability / Dispersion • Shapes either Symmetric or skewed Measures of variation/dispersion cont.
  • 6. The range ●The range is simply the difference between the largest and the smallest observed values in a data set. Thus, range, including any outliers, is the actual spread of data. ●A great deal of information is ignored when computing the range, since only the largest and smallest data values are considered. Range = difference between highest and lowest observed values What is/are Range?
  • 7. Business Statistics: A Decision- Making Approach, 7e Š 2008 Prentice-Hall, Inc. • Simplest measure of variation • Difference between the largest and the smallest observations: Range = xmaximum – xminimum 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 Range = 14 - 1 = 13 Example: The range cont. Find The Range
  • 8. ●The range value of a data set is greatly influenced by the presence of just one unusually large or small value (outlier). ●The range can be expressed as an interval such as 4–10, where 4 is the lowest value and 10 is highest. ●Often, it is expressed as interval width. For example, the range of 4–10 can also be expressed as a range of 6. The range cont.
  • 9. Business Statistics: A Decision- Making Approach, 7e Š 2008 Prentice-Hall, Inc. • Ignores the way in which data are distributed • Sensitive to outliers 7 8 9 10 11 12 Range = 12 - 7 = 5 7 8 9 10 11 12 Range = 12 - 7 = 5 Disadvantages of the Range 1,1,1,1,1,1,1,1,1,1,1,2,2,2,2,2,2,2,2,3,3,3,3,4,5 1,1,1,1,1,1,1,1,1,1,1,2,2,2,2,2,2,2,2,3,3,3,3,4,120 Range = 5 - 1 = 4 Range = 120 - 1 = 119 The range cont.
  • 10. ●Other disadvantages of using range is that it does not measure the spread of the majority of values in a data set —it only measures the spread between highest and lowest values. ●As a result, other measures are required in order to give a better picture of the data spread. ●The range is an informative tool used as a supplement to other measures such as the standard deviation or semi-interquartile range, but it should rarely be used as the only measure of spread. The range cont.
  • 11. 1, 2, 4, 6, 12, 15, 19, 26 Smallest Value Largest Value Does not take into account how clumped together the scores are Range = 26 - 1 = 25 Question: Is range a good measure of spread/dispersion? Question: Is value 25 a good representative value? The range cont. Example 1:
  • 12. 425425 430430 430430 435435 435435 435435 435435 435435 440440 440440 440440 440440 440440 445445 445445 445445 445445 445445 450450 450450 450450 450450 450450 450450 450450 460460 460460 460460 465465 465465 465465 470470 470470 472472 475475 475475 475475 480480 480480 480480 480480 485485 490490 490490 490490 500500 500500 500500 500500 510510 515515 525525 525525 525525 535535 549549 550550 570570 570570 575575 575575 580580 590590 600600 600600 600600 600600 615615 615615 510510 Smallest value Largest value does not take into account how clumped together the scores are Range = 615 – 425 = 190 Question: Is range a good measure of spread/dispersion? The range cont. Example 2:
  • 13. The quartiles ●In descriptive statistics, a quartile is any of the three values which divide the sorted (arrayed) data set into four equal parts, so that each part represents one fourth of the sampled population. ●The median divides the data into two equal sets. ●The lower quartile is the value of the middle of the first set, where 25% of the values are smaller than Q1 and 75% are larger. This first quartile takes the notation Q1. ●The upper quartile is the value of the middle of the second set, where 75% of the values are smaller than Q3 and 25% are larger. This third quartile takes the notation Q3. What is/are Quartiles?
  • 14. Business Statistics: A Decision- Making Approach, 7e Š 2008 Prentice-Hall, Inc. • Quartiles split the ranked data into 4 equal groups: 25% 25% 25% 25% Q1 Q2 Q3 The quartiles cont. > first quartile (designated Q1) = lower quartile = cuts off lowest 25% of data = 25th percentile > second quartile (designated Q2) = median = cuts off data set in half (50% of data) = 50th percentile > third quartile (designated Q3) = upper quartile = cuts off highest 25% of data, or lowest 75% = 75th percentile • Note that the second quartile Q2 (the 50th percentile) is the median
  • 15. The formula for locating the position of the observation at a given percentile, y, with n data points sorted in ascending order is: i = Ly = (y/100)n •Case 1: If L is a whole number, then the value will be found halfway between positions L and L+1. •Case 2: If L is a decimal, round up to the nearest whole number. (for example, L = 1.2 becomes 2). Locating the position of the quartiles The quartiles cont.
  • 16. Business Statistics: A Decision- Making Approach, 7e Š 2008 Prentice-Hall, Inc. Sample Data in Ordered Array: 11 12 13 16 16 17 18 21 22 Example:  Find the first quartile (Q1) n = 9 Q1 = 25th percentile, so find i : i = (9) = 2.25 so round up and use the value in the 3rd position: Q1 = 13 25 100 The quartiles cont.
  • 17. The Interquartile Range ●The interquartile range is another range used as a measure of the spread. ●The difference between upper and lower quartiles (Q3– Q1), which is called the interquartile range, also indicates the dispersion of a data set. ●The interquartile range spans 50% of a data set, and eliminates the influence of outliers because, in effect, the highest and lowest quarters are removed. What is/are Interquartile Range?
  • 18. Business Statistics: A Decision- Making Approach, 7e Š 2008 Prentice-Hall, Inc. • Can eliminate some outlier problems by using the interquartile range • Eliminate some high-and low-valued observations and calculate the range from the remaining values. • Interquartile range = 3rd quartile – 1st quartile The interquartile range cont. Interquartile range = difference between upper quartile (Q3) and lower quartile (Q1)
  • 19. Business Statistics: A Decision- Making Approach, 7e Š 2008 Prentice-Hall, Inc. Median (Q2) X maximumX minimum Q1 Q3 Example: 25% 25% 25% 25% 12 30 45 57 70 Interquartile range = 57 – 30 = 27 The interquartile range cont.
  • 20. An exercise A year ago, Ali began working at a computer store. His supervisor asked him to keep a record of the number of sales he made each month. The following data set is a list of his sales for the last 12 months: 34, 47, 1, 15, 57, 24, 20, 11, 19, 50, 28, 37. •Use Ali's sales records to find:  the median  the range  the upper and lower quartiles  the interquartile range
  • 21. The values in an ascending array are: 1, 11, 15, 19, 20, 24, 28, 34, 37, 47, 50, 57. Median = (12 + 1) á 2 = 6.5th value = (6th + 7th observations) á 2 = (24 + 28) á 2 = 26 Range = difference between the highest and lowest values = 57 – 1 = 56 Exercise cont.
  • 22. Lower quartile = value of middle of first half of data Q1 = the median of 1, 11, 15, 19, 20, 24 = (3rd + 4th observations) á 2 = (15 + 19) á 2 = 17 Upper quartile = value of middle of second half of data Q3 = the median of 28, 34, 37, 47, 50, 57 = (3rd + 4th observations) á 2 = (37 + 47) á 2 = 42 Interquartile range = Q3–Q1 = 42 – 17 = 25 Exercise cont.
  • 24. Percentiles ●The ath percentile is a value so that roughly a% of the data are smaller and (100-a)% of the data are larger. ●There are three steps for computing a percentile. – Organize the data into an ascending array; – Count the number of values (n); – Select the a*(n+1) observation. What is/are Percentile?
  • 25. Business Statistics: A Decision- Making Approach, 7e Š 2008 Prentice-Hall, Inc. • The pth percentile in an ordered array of n values is the value in ith position, where  Example: Find the 60th percentile in an ordered array of 19 values. (n) 100 p i = 11.4(19) 100 60 (n) 100 p i === If i is not an integer, round up to the next higher integer value So use value in the i = 12th position Percentiles cont.
  • 26. You can't always be so lucky to have a*(n+1) be a nice whole number. Here are some scenarios: •If a*(n+1) is not a whole number, then go halfway between the two adjacent numbers. •If a*(n+1) < 1, select the smallest observation. •If a*(n+1) > n, select the largest observation. Percentiles cont.
  • 27. • Arrayed data: 18, 33, 58, 67, 73, 93, 147 • There are 7 observations (n=7). • Select 0.50*(7+1) = 4th observation. • Therefore, the 50th percentile equals 67. -Notice that there are three observations larger than 67 and three observations smaller than 67. Percentiles cont. Example:  Compute the 50th percentile for the following data set 73, 58, 67, 93, 33, 18, 147 50th percentile
  • 28.  Suppose we want to compute the 20th percentile… • Notice that p*(n+1) = 0.20*(7+1)=1.6. This is not a whole number so we select halfway between 1st and 2nd observation or 25.5.  Suppose we want to compute the 10th percentile… • Since 0.10*(7+1)=0.8, we should select the smallest observation which is 18. Percentile cont. 18, 33, 58, 67, 73, 93, 14710th percentile 20th percentile25.5Example: (Cont.)
  • 29. • Percentile rank of a score is the percentage of scores in its frequency distribution which are lower than it. • Percentile ranks are commonly used to clarify the interpretation of scores on standardized tests. • Given formula: Pr = (number of value below the score+ 0.5) á ( total number of values) (100%). Percentiles Rank What is/are Percentile Rank?
  • 30. Business Statistics: A Decision- Making Approach, 7e Š 2008 Prentice-Hall, Inc. Sample Data in Ordered Array: 11 12 13 16 16 17 18 21 22 Example:  Find the percentile rank for (17) n = 9 Pr17 = [(no. of values below the score of 17+0.5)/no. of scores]*100 = [(5+0.5)/9]*100 = 61st percentile. This value indicated that the student, who obtained a score of 17 marks, scored higher than 61% of the members of his/her class. Percentile Rank cont. 61%
  • 31. The five number summary A five number summary uses percentiles to describe a set of data. The five number summary consists of •MAX - the maximum value •75% - the 75th percentile •50% - the 50th percentile (or the median) •25% - the 25th percentile •MIN - the minimum value The five number summary splits the data into four regions, each of which contains 25% of the data. What is/are Five Number Summary?
  • 32. • The minimum value is = 1 • The lower half is {1, 3, 4}, and the median of that half (the 25th percentile) is = 3 • The median (the 50th percentile) is = 5 • The upper half is {6, 7, 9}, and the median of that half (the 75th percentile) is = 7 • The maximum value is = 9 • The minimum value is = 1 • The lower half is {1, 3, 4}, and the median of that half (the 25th percentile) is = 3 • The median (the 50th percentile) is = 5 • The upper half is {6, 7, 9}, and the median of that half (the 75th percentile) is = 7 • The maximum value is = 9 The five number summary cont. Example:  Find the five number summary for the data set. 1, 3, 4, 5, 6, 7, 9
  • 33. Next class… The following topics will be discussed:  Measures of variability / dispersion (Part II):  The average absolute deviation  The Variance  The Standard deviation  Coefficient of Variation (CV)

Hinweis der Redaktion

  1. Updated Version 12/12/2012