SlideShare ist ein Scribd-Unternehmen logo
1 von 30
Statistics “Four”
Mohamed Ahmed Hefny, MD.
Describing data with numeric
summary values
Learning objectives
1. Explain what prevalence and incidence are.
2. Explain what a summary measure of location is, and
show that you understand the meaning of, and the
difference between, the mode, the median and the
mean.
3. Be able to calculate the mode, median and mean for a
set of values.
4. Explain what a percentile is, and calculate any given
percentile value.
5. Explain what a summary measure of spread is, and show
that you understand the difference between, and can
calculate, the range, the interquartile range and the
standard deviation.
Numbers, percentages and proportions
• When you present the results of an investigation, you
will almost certainly need to give the numbers of the
subjects involved; and perhaps also provide values for
percentages.
• It is usually categorical data that are summarized with
a value for percentage or proportion.
Prevalence and the incidence rate
When suitable we can also summarize data by providing a
value for the prevalence or the incidence rate of some
condition.
• Prevalence of a disease is the number of existing cases
in some population at a given time. In practice, the period
prevalence is more often used.
• i.e. the prevalence of Breast Cancer in women in a place
in 2010 was 3.1%. The prevalence figure will include
existing cases, i.e. those who contracted the disease
before 2010, and still had it, as well as those first
getting the disease in 2010.
Incidence or inception rate of a disease is the number
of new cases occurring per 1000, or per 10 000, of the
population , during a given period, usually 12 months.
Summary measures of location
A summary measure of location is a value around which
most of the data values tend to congregate or center.
There are three measures of location
• Mode
• Median
• Mean
Mode
• The mode is that category or value in the data that has
the highest frequency (i.e. occurs the most often). In this
sense, the mode is a measure of common-ness or
typical-ness.
• The mode is not particularly useful with metric
continuous data where no two values may be the same.
The other deficiency of this measure is that there may be
more than one mode in a set of data.
Patients Number of inhaler use in last 24 hours
A 5
B 12
C 10
Median
• If we arrange the data in ascending order of size, the
median is the middlemost number in the set. Thus, half
of the values will be equal to or less than the
median value, and half equal to or above it. The
median is thus a measure of central-ness.
• i.e. Age (in ascending order of years), for 5 individuals:
30 31 32 33 35. The middle value is 32, so the median
age for these 5 people is 32 years.
• Another way of determining the value of the median, If you
have “n” values arranged in ascending order, then: the
median = 1 / 2(n + 1)th value.
• i.e., if the ages of six people are: 30 31 32 33 35 36, then n
= 6, therefore:
• 1 / 2(n + 1) = 1 / 2 × (6 + 1) = 1 / 2 × 7 = 3.5
• Then, median is the 3.5th value. That is, it is the value half
way between the 3rd value of 32, and the 4th value of 33,
or 32.5 years, which is the same result as before.
• An advantage of the median is that it is not much affected
by skewness in the distribution, or by the presence of
outliers. However, it discards a lot of information, because it
ignores most of the values, apart from those in the center
of the distribution.
Mean
• The mean, or the arithmetic mean to give it its full name,
is more commonly known as the average.
• One advantage of the mean over the median is that it
uses all of the information in the data set.
• However, it is affected by skewness in the distribution,
and by the presence of outliers in the data.
• This may, on occasion, produce a mean that is not very
representative of the general mass of the data.
• Moreover, it cannot be used with ordinal data.
Percentiles
• A percentile (or a centile) is a measure used in statistics
indicating the value below which a given percentage of
observations in a group of observations fall. For example,
the 20th percentile is the value (or score) below which 20
percent of the observations may be found.
• Percentiles are the values which divide an ordered set of
data into 100 equal-sized groups.
• Notice that this makes the median the 50th percentile,
since it divides the data values into two equal halves, 50
per cent above the median and 50 per cent below.
Choosing the most appropriate measure
• How do you choose the most appropriate measure of
location for some given set of data?
• The main thing to remember is that the mean cannot be
used with ordinal data (because they are not real
numbers), and that the median can be used for both
ordinal and metric data (particularly when the latter is
skewed).
Type of variable Summary measure of location
Mode Median Mean
Nominal Yes Yes No
Ordinal Yes No No
Metric Discrete Yes Yes, if distribution Yes
Metric Continuous No Is markedly skewed Yes
Choosing an appropriate measure of location
Summary measures of spread
As well as a summary measure of location, a summary
measure of spread or dispersion can also be very useful.
There are three main measures in common use
• Range
• Interquartile range
• Standard Deviation
Range
• The range is the distance from the smallest value to the
largest. The range is not affected by skewness, but is
sensitive to the addition or removal of an outlier value. i.e,
the range of the 30 birth weights is (2.86 – 4.49 kg).
• The range is best written like this, rather than as the
single-valued difference, i.e. as 1.6 kg, in this example,
which is much less informative.
• The range can sometimes be misleading when there are
extremely high or low values.
The interquartile range (iqr)
• One solution to the problem of the sensitivity of the range to
extreme value (outliers) is to remove a quarter (25 %) of the
values off both ends of the distribution (which removes any
troublesome outliers), and then measure the range of the
remaining values. This distance is called the interquartile
range, or iqr.
• The interquartile range is not affected either by outliers or
skewness, but it does not use all of the information in the
data set since it ignores the bottom and top quarter of
values.
Standard Deviation
The Standard Deviation is a measure of how spread out
numbers are.
Its symbol is σ (the Greek letter sigma)
The formula is easy: it is the square root of the Variance.
So now you ask, "What is the Variance?“
Variance
The Variance is defined as:
The average of the squared differences from the Mean.
You and your friends have just measured the heights of
your dogs (in millimeters):
The heights (at the shoulders) are: 600mm, 470mm,
170mm, 430mm and 300mm.
Find out the Mean, the Variance, and the Standard
Deviation.
Your first step is to find the Mean
Mean =
600 + 470 +
170 + 430 +
300 =
1970
= 394
5 5
So the mean (average) height is 394 mm. Let's plot
this on the chart:
To calculate the Variance, take each difference, square it,
and then average the result:
Now we calculate each dog's difference from the Mean:
So, the Variance is 21,704.
And the Standard Deviation is just the square root of
Variance, so:
Standard Deviation: σ = √21,704 = 147.32... = 147
(to the nearest mm)
And the good thing about the Standard Deviation is that
it is useful. Now we can show which heights are within
one Standard Deviation (147mm) of the Mean
So, using the Standard Deviation we have a "standard"
way of knowing what is normal, and what is extra large
or extra small.
• The smaller this mean distance is, the narrower the
spread of values must be, and vice versa.
• This idea is the basis for what is known as the standard
deviation, or SD
Type of variable Summary measure of location
Range Interquartile range Standard deviation
Nominal No No No
Ordinal Yes Yes No
Metric Yes Yes, if skewed Yes
Choosing an appropriate measure of spread
Thank You

Weitere ähnliche Inhalte

Was ist angesagt?

3.1 measures of central tendency
3.1 measures of central tendency3.1 measures of central tendency
3.1 measures of central tendency
leblance
 
MEASURES OF CENTRAL TENDENCY AND MEASURES OF DISPERSION
MEASURES OF CENTRAL TENDENCY AND  MEASURES OF DISPERSION MEASURES OF CENTRAL TENDENCY AND  MEASURES OF DISPERSION
MEASURES OF CENTRAL TENDENCY AND MEASURES OF DISPERSION
Tanya Singla
 
Choosing the best measure of central tendency
Choosing the best measure of central tendencyChoosing the best measure of central tendency
Choosing the best measure of central tendency
bujols
 
Applications of mean ,mode & median
Applications of mean ,mode & medianApplications of mean ,mode & median
Applications of mean ,mode & median
Anagha Deshpande
 

Was ist angesagt? (20)

Basic Descriptive Statistics
Basic Descriptive StatisticsBasic Descriptive Statistics
Basic Descriptive Statistics
 
3.1 measures of central tendency
3.1 measures of central tendency3.1 measures of central tendency
3.1 measures of central tendency
 
Descriptive statistics ii
Descriptive statistics iiDescriptive statistics ii
Descriptive statistics ii
 
Measures of central tendency
Measures of central tendencyMeasures of central tendency
Measures of central tendency
 
Slideshare notes about measures of central tendancy(mean,median and mode)
Slideshare notes about measures of central tendancy(mean,median and mode)Slideshare notes about measures of central tendancy(mean,median and mode)
Slideshare notes about measures of central tendancy(mean,median and mode)
 
3 measures of central dendency
3  measures of central dendency3  measures of central dendency
3 measures of central dendency
 
2. chapter ii(analyz)
2. chapter ii(analyz)2. chapter ii(analyz)
2. chapter ii(analyz)
 
Measures of central tendency
Measures of central tendencyMeasures of central tendency
Measures of central tendency
 
Measures of central tendency
Measures of central tendencyMeasures of central tendency
Measures of central tendency
 
Introduction to Descriptive Statistics
Introduction to Descriptive StatisticsIntroduction to Descriptive Statistics
Introduction to Descriptive Statistics
 
MEASURES OF CENTRAL TENDENCY AND MEASURES OF DISPERSION
MEASURES OF CENTRAL TENDENCY AND  MEASURES OF DISPERSION MEASURES OF CENTRAL TENDENCY AND  MEASURES OF DISPERSION
MEASURES OF CENTRAL TENDENCY AND MEASURES OF DISPERSION
 
Measure of central tendency(0039)
Measure of central tendency(0039)Measure of central tendency(0039)
Measure of central tendency(0039)
 
Choosing the best measure of central tendency
Choosing the best measure of central tendencyChoosing the best measure of central tendency
Choosing the best measure of central tendency
 
Measure of Central Tendency
Measure of Central TendencyMeasure of Central Tendency
Measure of Central Tendency
 
Applications of mean ,mode & median
Applications of mean ,mode & medianApplications of mean ,mode & median
Applications of mean ,mode & median
 
Measures of Central tendency
Measures of Central tendencyMeasures of Central tendency
Measures of Central tendency
 
Presentation on "Measure of central tendency"
Presentation on "Measure of central tendency"Presentation on "Measure of central tendency"
Presentation on "Measure of central tendency"
 
Normal distribtion curve
Normal distribtion curveNormal distribtion curve
Normal distribtion curve
 
Statistics in research by dr. sudhir sahu
Statistics in research by dr. sudhir sahuStatistics in research by dr. sudhir sahu
Statistics in research by dr. sudhir sahu
 
Frequency distribution, central tendency, measures of dispersion
Frequency distribution, central tendency, measures of dispersionFrequency distribution, central tendency, measures of dispersion
Frequency distribution, central tendency, measures of dispersion
 

Andere mochten auch

Common Errors in Statistical Thinking
Common Errors in Statistical ThinkingCommon Errors in Statistical Thinking
Common Errors in Statistical Thinking
aprofitt
 
Standard Deviation
Standard DeviationStandard Deviation
Standard Deviation
pwheeles
 
Standard deviation (3)
Standard deviation (3)Standard deviation (3)
Standard deviation (3)
Sonali Prasad
 
Type i and type ii errors
Type i and type ii errorsType i and type ii errors
Type i and type ii errors
p24ssp
 
An introduction to qualitative research
An introduction to qualitative researchAn introduction to qualitative research
An introduction to qualitative research
Najibullah Safi
 
Relative and Atribute Risk
Relative and Atribute RiskRelative and Atribute Risk
Relative and Atribute Risk
Tauseef Jawaid
 
Quantitative And Qualitative Research
Quantitative And Qualitative ResearchQuantitative And Qualitative Research
Quantitative And Qualitative Research
doha07
 
Standard Deviation and Variance
Standard Deviation and VarianceStandard Deviation and Variance
Standard Deviation and Variance
Jufil Hombria
 

Andere mochten auch (20)

Common Errors in Statistical Thinking
Common Errors in Statistical ThinkingCommon Errors in Statistical Thinking
Common Errors in Statistical Thinking
 
Errors in Statistical Survey
Errors in Statistical SurveyErrors in Statistical Survey
Errors in Statistical Survey
 
Statistics three
Statistics threeStatistics three
Statistics three
 
Statistics
StatisticsStatistics
Statistics
 
Propteties of Standard Deviation
Propteties of Standard DeviationPropteties of Standard Deviation
Propteties of Standard Deviation
 
What does an odds ratio or relative risk mean?
What does an odds ratio or relative risk mean? What does an odds ratio or relative risk mean?
What does an odds ratio or relative risk mean?
 
Epidemiology lecture3 incidence
Epidemiology lecture3 incidenceEpidemiology lecture3 incidence
Epidemiology lecture3 incidence
 
Epidemiology lecture 2 measuring disease frequency
Epidemiology lecture 2 measuring disease frequencyEpidemiology lecture 2 measuring disease frequency
Epidemiology lecture 2 measuring disease frequency
 
Odds ratio
Odds ratioOdds ratio
Odds ratio
 
Incidence And Prevalence
Incidence And PrevalenceIncidence And Prevalence
Incidence And Prevalence
 
Standard Deviation
Standard DeviationStandard Deviation
Standard Deviation
 
Errors in research
Errors in researchErrors in research
Errors in research
 
Standard deviation (3)
Standard deviation (3)Standard deviation (3)
Standard deviation (3)
 
Type i and type ii errors
Type i and type ii errorsType i and type ii errors
Type i and type ii errors
 
An introduction to qualitative research
An introduction to qualitative researchAn introduction to qualitative research
An introduction to qualitative research
 
Relative and Atribute Risk
Relative and Atribute RiskRelative and Atribute Risk
Relative and Atribute Risk
 
Qualitative Research
Qualitative ResearchQualitative Research
Qualitative Research
 
Quantitative And Qualitative Research
Quantitative And Qualitative ResearchQuantitative And Qualitative Research
Quantitative And Qualitative Research
 
Measures Of Association
Measures Of AssociationMeasures Of Association
Measures Of Association
 
Standard Deviation and Variance
Standard Deviation and VarianceStandard Deviation and Variance
Standard Deviation and Variance
 

Ähnlich wie Statistics four

PARAMETRIC TESTS.pptx
PARAMETRIC TESTS.pptxPARAMETRIC TESTS.pptx
PARAMETRIC TESTS.pptx
DrLasya
 
statisticsintroductionofbusinessstats.ppt
statisticsintroductionofbusinessstats.pptstatisticsintroductionofbusinessstats.ppt
statisticsintroductionofbusinessstats.ppt
voore ajay
 
Ch5-quantitative-data analysis.pptx
Ch5-quantitative-data analysis.pptxCh5-quantitative-data analysis.pptx
Ch5-quantitative-data analysis.pptx
zerihunnana
 
Statistics and permeability engineering reports
Statistics and permeability engineering reportsStatistics and permeability engineering reports
Statistics and permeability engineering reports
wwwmostafalaith99
 
ANALYSIS ANDINTERPRETATION OF DATA Analysis and Interpr.docx
ANALYSIS ANDINTERPRETATION  OF DATA Analysis and Interpr.docxANALYSIS ANDINTERPRETATION  OF DATA Analysis and Interpr.docx
ANALYSIS ANDINTERPRETATION OF DATA Analysis and Interpr.docx
cullenrjzsme
 

Ähnlich wie Statistics four (20)

PARAMETRIC TESTS.pptx
PARAMETRIC TESTS.pptxPARAMETRIC TESTS.pptx
PARAMETRIC TESTS.pptx
 
Introduction to Statistics2312.ppt
Introduction to Statistics2312.pptIntroduction to Statistics2312.ppt
Introduction to Statistics2312.ppt
 
Introduction to Statistics23122223.ppt
Introduction to Statistics23122223.pptIntroduction to Statistics23122223.ppt
Introduction to Statistics23122223.ppt
 
Data Display and Summary
Data Display and SummaryData Display and Summary
Data Display and Summary
 
ststs nw.pptx
ststs nw.pptxststs nw.pptx
ststs nw.pptx
 
Ch2 Data Description
Ch2 Data DescriptionCh2 Data Description
Ch2 Data Description
 
Basic statisctis -Anandh Shankar
Basic statisctis -Anandh ShankarBasic statisctis -Anandh Shankar
Basic statisctis -Anandh Shankar
 
Presentation1
Presentation1Presentation1
Presentation1
 
statisticsintroductionofbusinessstats.ppt
statisticsintroductionofbusinessstats.pptstatisticsintroductionofbusinessstats.ppt
statisticsintroductionofbusinessstats.ppt
 
Ch5-quantitative-data analysis.pptx
Ch5-quantitative-data analysis.pptxCh5-quantitative-data analysis.pptx
Ch5-quantitative-data analysis.pptx
 
MEASURE-OF-VARIABILITY- for students. Ppt
MEASURE-OF-VARIABILITY- for students. PptMEASURE-OF-VARIABILITY- for students. Ppt
MEASURE-OF-VARIABILITY- for students. Ppt
 
Statr sessions 4 to 6
Statr sessions 4 to 6Statr sessions 4 to 6
Statr sessions 4 to 6
 
3. Statistical Analysis.pptx
3. Statistical Analysis.pptx3. Statistical Analysis.pptx
3. Statistical Analysis.pptx
 
Statistics and permeability engineering reports
Statistics and permeability engineering reportsStatistics and permeability engineering reports
Statistics and permeability engineering reports
 
Medical Statistics.ppt
Medical Statistics.pptMedical Statistics.ppt
Medical Statistics.ppt
 
ANALYSIS ANDINTERPRETATION OF DATA Analysis and Interpr.docx
ANALYSIS ANDINTERPRETATION  OF DATA Analysis and Interpr.docxANALYSIS ANDINTERPRETATION  OF DATA Analysis and Interpr.docx
ANALYSIS ANDINTERPRETATION OF DATA Analysis and Interpr.docx
 
Statistical Methods in Research
Statistical Methods in ResearchStatistical Methods in Research
Statistical Methods in Research
 
Descriptive Analysis.pptx
Descriptive Analysis.pptxDescriptive Analysis.pptx
Descriptive Analysis.pptx
 
Basic Statistical Concepts in Machine Learning.pptx
Basic Statistical Concepts in Machine Learning.pptxBasic Statistical Concepts in Machine Learning.pptx
Basic Statistical Concepts in Machine Learning.pptx
 
template.pptx
template.pptxtemplate.pptx
template.pptx
 

Mehr von Mohamed Hefny (9)

Sacroiliitis
SacroiliitisSacroiliitis
Sacroiliitis
 
Fibromyalgia,misconceptions
Fibromyalgia,misconceptionsFibromyalgia,misconceptions
Fibromyalgia,misconceptions
 
Statistics five
Statistics fiveStatistics five
Statistics five
 
ESWT in musculoskeletal disorders
ESWT in musculoskeletal disordersESWT in musculoskeletal disorders
ESWT in musculoskeletal disorders
 
Statistics two
Statistics twoStatistics two
Statistics two
 
Statistics 1
Statistics 1Statistics 1
Statistics 1
 
Adhesive capsulitis
Adhesive capsulitisAdhesive capsulitis
Adhesive capsulitis
 
Gout management evolution
Gout management evolutionGout management evolution
Gout management evolution
 
Hypermobility syndromes
Hypermobility syndromesHypermobility syndromes
Hypermobility syndromes
 

Kürzlich hochgeladen

Call Girl In Pune 👉 Just CALL ME: 9352988975 💋 Call Out Call Both With High p...
Call Girl In Pune 👉 Just CALL ME: 9352988975 💋 Call Out Call Both With High p...Call Girl In Pune 👉 Just CALL ME: 9352988975 💋 Call Out Call Both With High p...
Call Girl In Pune 👉 Just CALL ME: 9352988975 💋 Call Out Call Both With High p...
chetankumar9855
 

Kürzlich hochgeladen (20)

Models Call Girls In Hyderabad 9630942363 Hyderabad Call Girl & Hyderabad Esc...
Models Call Girls In Hyderabad 9630942363 Hyderabad Call Girl & Hyderabad Esc...Models Call Girls In Hyderabad 9630942363 Hyderabad Call Girl & Hyderabad Esc...
Models Call Girls In Hyderabad 9630942363 Hyderabad Call Girl & Hyderabad Esc...
 
Top Quality Call Girl Service Kalyanpur 6378878445 Available Call Girls Any Time
Top Quality Call Girl Service Kalyanpur 6378878445 Available Call Girls Any TimeTop Quality Call Girl Service Kalyanpur 6378878445 Available Call Girls Any Time
Top Quality Call Girl Service Kalyanpur 6378878445 Available Call Girls Any Time
 
Call Girls Hyderabad Just Call 8250077686 Top Class Call Girl Service Available
Call Girls Hyderabad Just Call 8250077686 Top Class Call Girl Service AvailableCall Girls Hyderabad Just Call 8250077686 Top Class Call Girl Service Available
Call Girls Hyderabad Just Call 8250077686 Top Class Call Girl Service Available
 
Kollam call girls Mallu aunty service 7877702510
Kollam call girls Mallu aunty service 7877702510Kollam call girls Mallu aunty service 7877702510
Kollam call girls Mallu aunty service 7877702510
 
Call Girls Jaipur Just Call 9521753030 Top Class Call Girl Service Available
Call Girls Jaipur Just Call 9521753030 Top Class Call Girl Service AvailableCall Girls Jaipur Just Call 9521753030 Top Class Call Girl Service Available
Call Girls Jaipur Just Call 9521753030 Top Class Call Girl Service Available
 
Saket * Call Girls in Delhi - Phone 9711199012 Escorts Service at 6k to 50k a...
Saket * Call Girls in Delhi - Phone 9711199012 Escorts Service at 6k to 50k a...Saket * Call Girls in Delhi - Phone 9711199012 Escorts Service at 6k to 50k a...
Saket * Call Girls in Delhi - Phone 9711199012 Escorts Service at 6k to 50k a...
 
Andheri East ^ (Genuine) Escort Service Mumbai ₹7.5k Pick Up & Drop With Cash...
Andheri East ^ (Genuine) Escort Service Mumbai ₹7.5k Pick Up & Drop With Cash...Andheri East ^ (Genuine) Escort Service Mumbai ₹7.5k Pick Up & Drop With Cash...
Andheri East ^ (Genuine) Escort Service Mumbai ₹7.5k Pick Up & Drop With Cash...
 
Call Girls Ahmedabad Just Call 9630942363 Top Class Call Girl Service Available
Call Girls Ahmedabad Just Call 9630942363 Top Class Call Girl Service AvailableCall Girls Ahmedabad Just Call 9630942363 Top Class Call Girl Service Available
Call Girls Ahmedabad Just Call 9630942363 Top Class Call Girl Service Available
 
8980367676 Call Girls In Ahmedabad Escort Service Available 24×7 In Ahmedabad
8980367676 Call Girls In Ahmedabad Escort Service Available 24×7 In Ahmedabad8980367676 Call Girls In Ahmedabad Escort Service Available 24×7 In Ahmedabad
8980367676 Call Girls In Ahmedabad Escort Service Available 24×7 In Ahmedabad
 
Call Girls in Delhi Triveni Complex Escort Service(🔝))/WhatsApp 97111⇛47426
Call Girls in Delhi Triveni Complex Escort Service(🔝))/WhatsApp 97111⇛47426Call Girls in Delhi Triveni Complex Escort Service(🔝))/WhatsApp 97111⇛47426
Call Girls in Delhi Triveni Complex Escort Service(🔝))/WhatsApp 97111⇛47426
 
Call Girl In Pune 👉 Just CALL ME: 9352988975 💋 Call Out Call Both With High p...
Call Girl In Pune 👉 Just CALL ME: 9352988975 💋 Call Out Call Both With High p...Call Girl In Pune 👉 Just CALL ME: 9352988975 💋 Call Out Call Both With High p...
Call Girl In Pune 👉 Just CALL ME: 9352988975 💋 Call Out Call Both With High p...
 
VIP Hyderabad Call Girls Bahadurpally 7877925207 ₹5000 To 25K With AC Room 💚😋
VIP Hyderabad Call Girls Bahadurpally 7877925207 ₹5000 To 25K With AC Room 💚😋VIP Hyderabad Call Girls Bahadurpally 7877925207 ₹5000 To 25K With AC Room 💚😋
VIP Hyderabad Call Girls Bahadurpally 7877925207 ₹5000 To 25K With AC Room 💚😋
 
Call Girls Amritsar Just Call 8250077686 Top Class Call Girl Service Available
Call Girls Amritsar Just Call 8250077686 Top Class Call Girl Service AvailableCall Girls Amritsar Just Call 8250077686 Top Class Call Girl Service Available
Call Girls Amritsar Just Call 8250077686 Top Class Call Girl Service Available
 
Call Girls Rishikesh Just Call 9667172968 Top Class Call Girl Service Available
Call Girls Rishikesh Just Call 9667172968 Top Class Call Girl Service AvailableCall Girls Rishikesh Just Call 9667172968 Top Class Call Girl Service Available
Call Girls Rishikesh Just Call 9667172968 Top Class Call Girl Service Available
 
Low Rate Call Girls Bangalore {7304373326} ❤️VVIP NISHA Call Girls in Bangalo...
Low Rate Call Girls Bangalore {7304373326} ❤️VVIP NISHA Call Girls in Bangalo...Low Rate Call Girls Bangalore {7304373326} ❤️VVIP NISHA Call Girls in Bangalo...
Low Rate Call Girls Bangalore {7304373326} ❤️VVIP NISHA Call Girls in Bangalo...
 
Andheri East ) Call Girls in Mumbai Phone No 9004268417 Elite Escort Service ...
Andheri East ) Call Girls in Mumbai Phone No 9004268417 Elite Escort Service ...Andheri East ) Call Girls in Mumbai Phone No 9004268417 Elite Escort Service ...
Andheri East ) Call Girls in Mumbai Phone No 9004268417 Elite Escort Service ...
 
Call Girls Mysore Just Call 8250077686 Top Class Call Girl Service Available
Call Girls Mysore Just Call 8250077686 Top Class Call Girl Service AvailableCall Girls Mysore Just Call 8250077686 Top Class Call Girl Service Available
Call Girls Mysore Just Call 8250077686 Top Class Call Girl Service Available
 
Top Rated Call Girls Kerala ☎ 8250092165👄 Delivery in 20 Mins Near Me
Top Rated Call Girls Kerala ☎ 8250092165👄 Delivery in 20 Mins Near MeTop Rated Call Girls Kerala ☎ 8250092165👄 Delivery in 20 Mins Near Me
Top Rated Call Girls Kerala ☎ 8250092165👄 Delivery in 20 Mins Near Me
 
Call Girls Service Jaipur {8445551418} ❤️VVIP BHAWNA Call Girl in Jaipur Raja...
Call Girls Service Jaipur {8445551418} ❤️VVIP BHAWNA Call Girl in Jaipur Raja...Call Girls Service Jaipur {8445551418} ❤️VVIP BHAWNA Call Girl in Jaipur Raja...
Call Girls Service Jaipur {8445551418} ❤️VVIP BHAWNA Call Girl in Jaipur Raja...
 
9630942363 Genuine Call Girls In Ahmedabad Gujarat Call Girls Service
9630942363 Genuine Call Girls In Ahmedabad Gujarat Call Girls Service9630942363 Genuine Call Girls In Ahmedabad Gujarat Call Girls Service
9630942363 Genuine Call Girls In Ahmedabad Gujarat Call Girls Service
 

Statistics four

  • 2.
  • 3. Describing data with numeric summary values
  • 4. Learning objectives 1. Explain what prevalence and incidence are. 2. Explain what a summary measure of location is, and show that you understand the meaning of, and the difference between, the mode, the median and the mean. 3. Be able to calculate the mode, median and mean for a set of values. 4. Explain what a percentile is, and calculate any given percentile value. 5. Explain what a summary measure of spread is, and show that you understand the difference between, and can calculate, the range, the interquartile range and the standard deviation.
  • 5.
  • 6. Numbers, percentages and proportions • When you present the results of an investigation, you will almost certainly need to give the numbers of the subjects involved; and perhaps also provide values for percentages. • It is usually categorical data that are summarized with a value for percentage or proportion.
  • 7. Prevalence and the incidence rate When suitable we can also summarize data by providing a value for the prevalence or the incidence rate of some condition. • Prevalence of a disease is the number of existing cases in some population at a given time. In practice, the period prevalence is more often used. • i.e. the prevalence of Breast Cancer in women in a place in 2010 was 3.1%. The prevalence figure will include existing cases, i.e. those who contracted the disease before 2010, and still had it, as well as those first getting the disease in 2010.
  • 8.
  • 9. Incidence or inception rate of a disease is the number of new cases occurring per 1000, or per 10 000, of the population , during a given period, usually 12 months.
  • 10. Summary measures of location A summary measure of location is a value around which most of the data values tend to congregate or center. There are three measures of location • Mode • Median • Mean
  • 11. Mode • The mode is that category or value in the data that has the highest frequency (i.e. occurs the most often). In this sense, the mode is a measure of common-ness or typical-ness. • The mode is not particularly useful with metric continuous data where no two values may be the same. The other deficiency of this measure is that there may be more than one mode in a set of data. Patients Number of inhaler use in last 24 hours A 5 B 12 C 10
  • 12. Median • If we arrange the data in ascending order of size, the median is the middlemost number in the set. Thus, half of the values will be equal to or less than the median value, and half equal to or above it. The median is thus a measure of central-ness. • i.e. Age (in ascending order of years), for 5 individuals: 30 31 32 33 35. The middle value is 32, so the median age for these 5 people is 32 years.
  • 13. • Another way of determining the value of the median, If you have “n” values arranged in ascending order, then: the median = 1 / 2(n + 1)th value. • i.e., if the ages of six people are: 30 31 32 33 35 36, then n = 6, therefore: • 1 / 2(n + 1) = 1 / 2 × (6 + 1) = 1 / 2 × 7 = 3.5 • Then, median is the 3.5th value. That is, it is the value half way between the 3rd value of 32, and the 4th value of 33, or 32.5 years, which is the same result as before. • An advantage of the median is that it is not much affected by skewness in the distribution, or by the presence of outliers. However, it discards a lot of information, because it ignores most of the values, apart from those in the center of the distribution.
  • 14. Mean • The mean, or the arithmetic mean to give it its full name, is more commonly known as the average. • One advantage of the mean over the median is that it uses all of the information in the data set. • However, it is affected by skewness in the distribution, and by the presence of outliers in the data. • This may, on occasion, produce a mean that is not very representative of the general mass of the data. • Moreover, it cannot be used with ordinal data.
  • 15. Percentiles • A percentile (or a centile) is a measure used in statistics indicating the value below which a given percentage of observations in a group of observations fall. For example, the 20th percentile is the value (or score) below which 20 percent of the observations may be found. • Percentiles are the values which divide an ordered set of data into 100 equal-sized groups. • Notice that this makes the median the 50th percentile, since it divides the data values into two equal halves, 50 per cent above the median and 50 per cent below.
  • 16. Choosing the most appropriate measure • How do you choose the most appropriate measure of location for some given set of data? • The main thing to remember is that the mean cannot be used with ordinal data (because they are not real numbers), and that the median can be used for both ordinal and metric data (particularly when the latter is skewed). Type of variable Summary measure of location Mode Median Mean Nominal Yes Yes No Ordinal Yes No No Metric Discrete Yes Yes, if distribution Yes Metric Continuous No Is markedly skewed Yes Choosing an appropriate measure of location
  • 17. Summary measures of spread As well as a summary measure of location, a summary measure of spread or dispersion can also be very useful. There are three main measures in common use • Range • Interquartile range • Standard Deviation
  • 18. Range • The range is the distance from the smallest value to the largest. The range is not affected by skewness, but is sensitive to the addition or removal of an outlier value. i.e, the range of the 30 birth weights is (2.86 – 4.49 kg). • The range is best written like this, rather than as the single-valued difference, i.e. as 1.6 kg, in this example, which is much less informative. • The range can sometimes be misleading when there are extremely high or low values.
  • 19. The interquartile range (iqr) • One solution to the problem of the sensitivity of the range to extreme value (outliers) is to remove a quarter (25 %) of the values off both ends of the distribution (which removes any troublesome outliers), and then measure the range of the remaining values. This distance is called the interquartile range, or iqr. • The interquartile range is not affected either by outliers or skewness, but it does not use all of the information in the data set since it ignores the bottom and top quarter of values.
  • 20.
  • 21. Standard Deviation The Standard Deviation is a measure of how spread out numbers are. Its symbol is σ (the Greek letter sigma) The formula is easy: it is the square root of the Variance. So now you ask, "What is the Variance?“ Variance The Variance is defined as: The average of the squared differences from the Mean.
  • 22. You and your friends have just measured the heights of your dogs (in millimeters): The heights (at the shoulders) are: 600mm, 470mm, 170mm, 430mm and 300mm. Find out the Mean, the Variance, and the Standard Deviation. Your first step is to find the Mean Mean = 600 + 470 + 170 + 430 + 300 = 1970 = 394 5 5
  • 23. So the mean (average) height is 394 mm. Let's plot this on the chart:
  • 24. To calculate the Variance, take each difference, square it, and then average the result: Now we calculate each dog's difference from the Mean: So, the Variance is 21,704.
  • 25. And the Standard Deviation is just the square root of Variance, so: Standard Deviation: σ = √21,704 = 147.32... = 147 (to the nearest mm) And the good thing about the Standard Deviation is that it is useful. Now we can show which heights are within one Standard Deviation (147mm) of the Mean So, using the Standard Deviation we have a "standard" way of knowing what is normal, and what is extra large or extra small.
  • 26. • The smaller this mean distance is, the narrower the spread of values must be, and vice versa. • This idea is the basis for what is known as the standard deviation, or SD
  • 27.
  • 28. Type of variable Summary measure of location Range Interquartile range Standard deviation Nominal No No No Ordinal Yes Yes No Metric Yes Yes, if skewed Yes Choosing an appropriate measure of spread
  • 29.