SlideShare ist ein Scribd-Unternehmen logo
1 von 16
Downloaden Sie, um offline zu lesen
Introduction
Summary Statistics
Definition
Summary statistics are used to summarize a
set of observations in order to
communicate as much as information
about the data as possible. It is part of
descriptive statistics and are used to
basically summarize or describe a set of
observations.
Rupak Roy
Example
The weight of the population are
45 kg
57kg
72 kg
52 kg
Now what we want here is the summary of
weight of the population , we can say it is the
average weight of the population is 56.5 kg and
now we can describe the population in the
simplest way as possible.
Rupak Roy
Types
Summary statistics
Measures of Central
Tendency
1 . Mean
2 . Median
3 . Mode
5 . Geometric Mean
Measures of
Dispersion
1. Standard
Deviation
2. Variance
3. Interquartile
Range
Others
1. Co efficient
2. Skewness
3. Kurtosis
4. Probability
Distributions.
5. Distribution plot
Rupak Roy
Definition
 Measures of central tendency : is the value that describes
which group of data clusters around a central value. In
simple words , it is a way to describe the center of a data
set. Again what is center of data ? A single number that
summarizes the entire dataset using techniques such as
mean/average or median of the dataset.
 Measures of Dispersion: “dispersion (also
called variability, scatter, or spread) is the extent to which
a distribution of data is stretched or squeezed.”
Here in the graph we can see the
distribution of data (assume population)
is more stretched at the right side
ranging from 50 to 80
Measures of Central Tendency
1. Mean : is the average of observations. Most effective
when data is not heavily skewed.
2. Median: represents the middle value of the dataset.
Useful for skewed data.
We will talk about skewed data in the upcoming
slides.
3. Mode: means max no of times the data has occurred.
4. Geometric mean: nth root of a product of n numbers.
It is used when we want to get the average rate of the
event and the event rate is determined by multiplication.
For example growth of a bank account per year in a
ABC bank is calculated by geometric mean since the
growth event rate is determined by multiplying the
amount of a bank account by the percentage of growth.
then we use geometric mean.
Rupak Roy
 Formula for calculating Geometric Mean
GM =
example: Geometric Mean of 23,56,66 ?
3 23 * 56*66
3 85008 = 43.9696761which means 3times of 43.9696761
is 85008
Note:
if one of the observation in the event is zero , Geometric
Mean becomes Zero and also it doesn’t works with
negative numbers like -1 , -4 , -5 and so on.
Rupak Roy
Calculation of Mode ; <- Delta
For ungrouped data = Max no of items
Example : 23,45,76,33,54,33,76,33 Therefore Mode = 33
For grouped data = = {(L + Delta 1) / Delta 1+Detal2 } * i
Where Delta 1 = f1 +f0
and Delta 2 = f1- f2
Nowadays, we don’t have to worry about the calculation, as in
any statistical software's like R, excel it will automatically calculate
the intense calculation for large amount of data but
for more in-depth information you can visit this website.
https://www.mathsisfun.com/data/frequency-grouped-mean-median-mode.html
Measures of dispersion
Standard Deviation is basically a measure of how near or far the
observations are from the mean.
Variance: the fact or quality of being different , divergent or
inconsistent. A value of zero means that there is no variability , all the
values in the data set are the same.
Interquartile Range: is a measure of variability ,
by dividing a data set into parts that is quartiles .
Say
Q1 is the middle value in the first half of the data set.
Q2 is the median value .
Q3 is the middle value in the second half of the
rank-ordered data set.
There interquartile range = Q3 – Q1
Skewness – refers to the lack of symmetry or imbalance in data
distribution.
In a symmetric distribution the data is
normally distributed where mean,
median, mode is at the same point.
However in real life data is never perfectly
distributed, hence we call it skewed data.
If the Left side has longer tail then the mass
distribution of data is concentrated on the right
side which is known as negatively skewed.
If the Right side has longer tail then the
mass distribution of data is
concentrated on the left side is
known as positive skewed.
Here is the summary of all the skewness as shown in the figure below.
Example (skewed data)
Temp(*c)
10
40
35
33
35
Mean = 153/5 = 30.6, if we apply mean is 30.6
which is incorrect since we can see maximum
number of values are above 35.
So we have to use median For Ungrouped
data ((n+1)/2)th
That will be ((5+1)/2)th = 6/2 = 3
i.e. 3th term ie 35.
For grouped data:
where L, lower class boundary of the group containing the group.
B, Cumulative frequency of the groups
G , Frequency of the median group
W , width/Range of the group
Again, we don’t have to worry about the calculation, as in any statistical software's like R
, excel it will automatically calculate the intense calculation for large amount of data
but for more in-depth information you can visit this website.
https://www.mathsisfun.com/data/frequency-grouped-mean-median-mode.html
Kurtosis : is a measure of whether data are peaked or flat relative to
normal distribution
(+) Leptokurtic
(-) PlatyKurtic
(0) Meskurtic
(+) Leptokurtic
This means the distribution is more clustered near the mean and has a
relativity less standard deviation
(-) PlatyKurtic
Where the distribution is less clustered around the mean and a standard
deviation more then Leptokurtic
(0) Meskurtic is typically measured with respect to the normal
distribution. Meskurtic has tails similar to normal distribution i.e neither
high nor low, rather it is consider to be a baseline for the other two’s.
 Now how to check the data is skewed or not
in Excel:
=skew(select the range of values/numbers)
=skew(10.24,9.48……….-0.42,-0.95)
= - 0.27 means Negatively skewed.
And to check the Kurtosis in Excel
=kurt(select the values/numbers)
=kurt(10.24,9.48……….-0.42,-0.95)
= -1.6 means it is PlatyKurtic
Recap
What we have learned ?
Measures of central tendency,
Measures of dispersion,
Measure of risk,
Next we will see how to compute this theory in
practical and analyze any data using our
everyday simple tools like Excel.
Rupak Roy
To be continued ………

Weitere ähnliche Inhalte

Was ist angesagt?

data analysis techniques and statistical softwares
data analysis techniques and statistical softwaresdata analysis techniques and statistical softwares
data analysis techniques and statistical softwaresDr.ammara khakwani
 
Introduction To Survival Analysis
Introduction To Survival AnalysisIntroduction To Survival Analysis
Introduction To Survival Analysisfedericorotolo
 
Basics of Systematic Review and Meta-analysis: Part 2
Basics of Systematic Review and Meta-analysis: Part 2Basics of Systematic Review and Meta-analysis: Part 2
Basics of Systematic Review and Meta-analysis: Part 2Rizwan S A
 
Statistical analysis using spss
Statistical analysis using spssStatistical analysis using spss
Statistical analysis using spssjpcagphil
 
Systematic ranom sampling for slide share
Systematic ranom sampling for slide shareSystematic ranom sampling for slide share
Systematic ranom sampling for slide shareIVenkatReddyGaaru
 
Dr digs central tendency
Dr digs central tendencyDr digs central tendency
Dr digs central tendencydrdig
 
INFERENTIAL STATISTICS: AN INTRODUCTION
INFERENTIAL STATISTICS: AN INTRODUCTIONINFERENTIAL STATISTICS: AN INTRODUCTION
INFERENTIAL STATISTICS: AN INTRODUCTIONJohn Labrador
 
Inferential statistics powerpoint
Inferential statistics powerpointInferential statistics powerpoint
Inferential statistics powerpointkellula
 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statisticsAileen Balbido
 
Introduction to statistics
Introduction to statisticsIntroduction to statistics
Introduction to statisticsKapil Dev Ghante
 
Measure of central tendency
Measure of central tendencyMeasure of central tendency
Measure of central tendencymauitaylor007
 
Basic Descriptive statistics
Basic Descriptive statisticsBasic Descriptive statistics
Basic Descriptive statisticsAjendra Sharma
 

Was ist angesagt? (20)

data analysis techniques and statistical softwares
data analysis techniques and statistical softwaresdata analysis techniques and statistical softwares
data analysis techniques and statistical softwares
 
Descriptive statistics ii
Descriptive statistics iiDescriptive statistics ii
Descriptive statistics ii
 
Introduction To Survival Analysis
Introduction To Survival AnalysisIntroduction To Survival Analysis
Introduction To Survival Analysis
 
Basics of Systematic Review and Meta-analysis: Part 2
Basics of Systematic Review and Meta-analysis: Part 2Basics of Systematic Review and Meta-analysis: Part 2
Basics of Systematic Review and Meta-analysis: Part 2
 
Normality
NormalityNormality
Normality
 
Statistical analysis using spss
Statistical analysis using spssStatistical analysis using spss
Statistical analysis using spss
 
Systematic ranom sampling for slide share
Systematic ranom sampling for slide shareSystematic ranom sampling for slide share
Systematic ranom sampling for slide share
 
Dr digs central tendency
Dr digs central tendencyDr digs central tendency
Dr digs central tendency
 
(Manual spss)
(Manual spss)(Manual spss)
(Manual spss)
 
Inferential statistics
Inferential statisticsInferential statistics
Inferential statistics
 
INFERENTIAL STATISTICS: AN INTRODUCTION
INFERENTIAL STATISTICS: AN INTRODUCTIONINFERENTIAL STATISTICS: AN INTRODUCTION
INFERENTIAL STATISTICS: AN INTRODUCTION
 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statistics
 
Inferential statistics powerpoint
Inferential statistics powerpointInferential statistics powerpoint
Inferential statistics powerpoint
 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statistics
 
Introduction to statistics
Introduction to statisticsIntroduction to statistics
Introduction to statistics
 
Central tendency
Central tendencyCentral tendency
Central tendency
 
Measure of central tendency
Measure of central tendencyMeasure of central tendency
Measure of central tendency
 
Basic Descriptive statistics
Basic Descriptive statisticsBasic Descriptive statistics
Basic Descriptive statistics
 
biostatistics basic
biostatistics basic biostatistics basic
biostatistics basic
 
Descriptive statistics i
Descriptive statistics iDescriptive statistics i
Descriptive statistics i
 

Ähnlich wie Summary statistics

Types of Statistics
Types of Statistics Types of Statistics
Types of Statistics Rupak Roy
 
1 descriptive statistics
1 descriptive statistics1 descriptive statistics
1 descriptive statisticsSanu Kumar
 
Descriptive Statistics.pptx
Descriptive Statistics.pptxDescriptive Statistics.pptx
Descriptive Statistics.pptxShashank Mishra
 
Descriptive Statistics: Measures of Central Tendency - Measures of Dispersion...
Descriptive Statistics: Measures of Central Tendency - Measures of Dispersion...Descriptive Statistics: Measures of Central Tendency - Measures of Dispersion...
Descriptive Statistics: Measures of Central Tendency - Measures of Dispersion...EqraBaig
 
Statistics (GE 4 CLASS).pptx
Statistics (GE 4 CLASS).pptxStatistics (GE 4 CLASS).pptx
Statistics (GE 4 CLASS).pptxYollyCalamba
 
Measures of central tendency
Measures of central tendencyMeasures of central tendency
Measures of central tendencyMmedsc Hahm
 
Graphical presentation of data
Graphical presentation of dataGraphical presentation of data
Graphical presentation of datadrasifk
 
Lect 3 background mathematics
Lect 3 background mathematicsLect 3 background mathematics
Lect 3 background mathematicshktripathy
 
QT1 - 03 - Measures of Central Tendency
QT1 - 03 - Measures of Central TendencyQT1 - 03 - Measures of Central Tendency
QT1 - 03 - Measures of Central TendencyPrithwis Mukerjee
 
QT1 - 03 - Measures of Central Tendency
QT1 - 03 - Measures of Central TendencyQT1 - 03 - Measures of Central Tendency
QT1 - 03 - Measures of Central TendencyPrithwis Mukerjee
 
Measures of Dispersion.pptx
Measures of Dispersion.pptxMeasures of Dispersion.pptx
Measures of Dispersion.pptxVanmala Buchke
 
3. measures of central tendency
3. measures of central tendency3. measures of central tendency
3. measures of central tendencyrenz50
 
Lect 3 background mathematics for Data Mining
Lect 3 background mathematics for Data MiningLect 3 background mathematics for Data Mining
Lect 3 background mathematics for Data Mininghktripathy
 
Central tendancy 4
Central tendancy 4Central tendancy 4
Central tendancy 4Sundar B N
 
Basic Statistical Descriptions of Data.pptx
Basic Statistical Descriptions of Data.pptxBasic Statistical Descriptions of Data.pptx
Basic Statistical Descriptions of Data.pptxAnusuya123
 
Chapter 4 MMW.pdf
Chapter 4 MMW.pdfChapter 4 MMW.pdf
Chapter 4 MMW.pdfRaRaRamirez
 
Statistics digital text book
Statistics digital text bookStatistics digital text book
Statistics digital text bookdeepuplr
 

Ähnlich wie Summary statistics (20)

Types of Statistics
Types of Statistics Types of Statistics
Types of Statistics
 
1 descriptive statistics
1 descriptive statistics1 descriptive statistics
1 descriptive statistics
 
Basic statistics
Basic statisticsBasic statistics
Basic statistics
 
Descriptive Statistics.pptx
Descriptive Statistics.pptxDescriptive Statistics.pptx
Descriptive Statistics.pptx
 
Descriptive Statistics: Measures of Central Tendency - Measures of Dispersion...
Descriptive Statistics: Measures of Central Tendency - Measures of Dispersion...Descriptive Statistics: Measures of Central Tendency - Measures of Dispersion...
Descriptive Statistics: Measures of Central Tendency - Measures of Dispersion...
 
Statistics (GE 4 CLASS).pptx
Statistics (GE 4 CLASS).pptxStatistics (GE 4 CLASS).pptx
Statistics (GE 4 CLASS).pptx
 
Measures of central tendency
Measures of central tendencyMeasures of central tendency
Measures of central tendency
 
Graphical presentation of data
Graphical presentation of dataGraphical presentation of data
Graphical presentation of data
 
Lect 3 background mathematics
Lect 3 background mathematicsLect 3 background mathematics
Lect 3 background mathematics
 
QT1 - 03 - Measures of Central Tendency
QT1 - 03 - Measures of Central TendencyQT1 - 03 - Measures of Central Tendency
QT1 - 03 - Measures of Central Tendency
 
QT1 - 03 - Measures of Central Tendency
QT1 - 03 - Measures of Central TendencyQT1 - 03 - Measures of Central Tendency
QT1 - 03 - Measures of Central Tendency
 
Measures of Dispersion.pptx
Measures of Dispersion.pptxMeasures of Dispersion.pptx
Measures of Dispersion.pptx
 
3. measures of central tendency
3. measures of central tendency3. measures of central tendency
3. measures of central tendency
 
Lect 3 background mathematics for Data Mining
Lect 3 background mathematics for Data MiningLect 3 background mathematics for Data Mining
Lect 3 background mathematics for Data Mining
 
Central tendancy 4
Central tendancy 4Central tendancy 4
Central tendancy 4
 
Basic Statistical Descriptions of Data.pptx
Basic Statistical Descriptions of Data.pptxBasic Statistical Descriptions of Data.pptx
Basic Statistical Descriptions of Data.pptx
 
SUMMARY MEASURES.pdf
SUMMARY MEASURES.pdfSUMMARY MEASURES.pdf
SUMMARY MEASURES.pdf
 
Basic statisctis -Anandh Shankar
Basic statisctis -Anandh ShankarBasic statisctis -Anandh Shankar
Basic statisctis -Anandh Shankar
 
Chapter 4 MMW.pdf
Chapter 4 MMW.pdfChapter 4 MMW.pdf
Chapter 4 MMW.pdf
 
Statistics digital text book
Statistics digital text bookStatistics digital text book
Statistics digital text book
 

Mehr von Rupak Roy

Hierarchical Clustering - Text Mining/NLP
Hierarchical Clustering - Text Mining/NLPHierarchical Clustering - Text Mining/NLP
Hierarchical Clustering - Text Mining/NLPRupak Roy
 
Clustering K means and Hierarchical - NLP
Clustering K means and Hierarchical - NLPClustering K means and Hierarchical - NLP
Clustering K means and Hierarchical - NLPRupak Roy
 
Network Analysis - NLP
Network Analysis  - NLPNetwork Analysis  - NLP
Network Analysis - NLPRupak Roy
 
Topic Modeling - NLP
Topic Modeling - NLPTopic Modeling - NLP
Topic Modeling - NLPRupak Roy
 
Sentiment Analysis Practical Steps
Sentiment Analysis Practical StepsSentiment Analysis Practical Steps
Sentiment Analysis Practical StepsRupak Roy
 
NLP - Sentiment Analysis
NLP - Sentiment AnalysisNLP - Sentiment Analysis
NLP - Sentiment AnalysisRupak Roy
 
Text Mining using Regular Expressions
Text Mining using Regular ExpressionsText Mining using Regular Expressions
Text Mining using Regular ExpressionsRupak Roy
 
Introduction to Text Mining
Introduction to Text Mining Introduction to Text Mining
Introduction to Text Mining Rupak Roy
 
Apache Hbase Architecture
Apache Hbase ArchitectureApache Hbase Architecture
Apache Hbase ArchitectureRupak Roy
 
Introduction to Hbase
Introduction to Hbase Introduction to Hbase
Introduction to Hbase Rupak Roy
 
Apache Hive Table Partition and HQL
Apache Hive Table Partition and HQLApache Hive Table Partition and HQL
Apache Hive Table Partition and HQLRupak Roy
 
Installing Apache Hive, internal and external table, import-export
Installing Apache Hive, internal and external table, import-export Installing Apache Hive, internal and external table, import-export
Installing Apache Hive, internal and external table, import-export Rupak Roy
 
Introductive to Hive
Introductive to Hive Introductive to Hive
Introductive to Hive Rupak Roy
 
Scoop Job, import and export to RDBMS
Scoop Job, import and export to RDBMSScoop Job, import and export to RDBMS
Scoop Job, import and export to RDBMSRupak Roy
 
Apache Scoop - Import with Append mode and Last Modified mode
Apache Scoop - Import with Append mode and Last Modified mode Apache Scoop - Import with Append mode and Last Modified mode
Apache Scoop - Import with Append mode and Last Modified mode Rupak Roy
 
Introduction to scoop and its functions
Introduction to scoop and its functionsIntroduction to scoop and its functions
Introduction to scoop and its functionsRupak Roy
 
Introduction to Flume
Introduction to FlumeIntroduction to Flume
Introduction to FlumeRupak Roy
 
Apache Pig Relational Operators - II
Apache Pig Relational Operators - II Apache Pig Relational Operators - II
Apache Pig Relational Operators - II Rupak Roy
 
Passing Parameters using File and Command Line
Passing Parameters using File and Command LinePassing Parameters using File and Command Line
Passing Parameters using File and Command LineRupak Roy
 
Apache PIG Relational Operations
Apache PIG Relational Operations Apache PIG Relational Operations
Apache PIG Relational Operations Rupak Roy
 

Mehr von Rupak Roy (20)

Hierarchical Clustering - Text Mining/NLP
Hierarchical Clustering - Text Mining/NLPHierarchical Clustering - Text Mining/NLP
Hierarchical Clustering - Text Mining/NLP
 
Clustering K means and Hierarchical - NLP
Clustering K means and Hierarchical - NLPClustering K means and Hierarchical - NLP
Clustering K means and Hierarchical - NLP
 
Network Analysis - NLP
Network Analysis  - NLPNetwork Analysis  - NLP
Network Analysis - NLP
 
Topic Modeling - NLP
Topic Modeling - NLPTopic Modeling - NLP
Topic Modeling - NLP
 
Sentiment Analysis Practical Steps
Sentiment Analysis Practical StepsSentiment Analysis Practical Steps
Sentiment Analysis Practical Steps
 
NLP - Sentiment Analysis
NLP - Sentiment AnalysisNLP - Sentiment Analysis
NLP - Sentiment Analysis
 
Text Mining using Regular Expressions
Text Mining using Regular ExpressionsText Mining using Regular Expressions
Text Mining using Regular Expressions
 
Introduction to Text Mining
Introduction to Text Mining Introduction to Text Mining
Introduction to Text Mining
 
Apache Hbase Architecture
Apache Hbase ArchitectureApache Hbase Architecture
Apache Hbase Architecture
 
Introduction to Hbase
Introduction to Hbase Introduction to Hbase
Introduction to Hbase
 
Apache Hive Table Partition and HQL
Apache Hive Table Partition and HQLApache Hive Table Partition and HQL
Apache Hive Table Partition and HQL
 
Installing Apache Hive, internal and external table, import-export
Installing Apache Hive, internal and external table, import-export Installing Apache Hive, internal and external table, import-export
Installing Apache Hive, internal and external table, import-export
 
Introductive to Hive
Introductive to Hive Introductive to Hive
Introductive to Hive
 
Scoop Job, import and export to RDBMS
Scoop Job, import and export to RDBMSScoop Job, import and export to RDBMS
Scoop Job, import and export to RDBMS
 
Apache Scoop - Import with Append mode and Last Modified mode
Apache Scoop - Import with Append mode and Last Modified mode Apache Scoop - Import with Append mode and Last Modified mode
Apache Scoop - Import with Append mode and Last Modified mode
 
Introduction to scoop and its functions
Introduction to scoop and its functionsIntroduction to scoop and its functions
Introduction to scoop and its functions
 
Introduction to Flume
Introduction to FlumeIntroduction to Flume
Introduction to Flume
 
Apache Pig Relational Operators - II
Apache Pig Relational Operators - II Apache Pig Relational Operators - II
Apache Pig Relational Operators - II
 
Passing Parameters using File and Command Line
Passing Parameters using File and Command LinePassing Parameters using File and Command Line
Passing Parameters using File and Command Line
 
Apache PIG Relational Operations
Apache PIG Relational Operations Apache PIG Relational Operations
Apache PIG Relational Operations
 

Kürzlich hochgeladen

ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...JhezDiaz1
 
4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptx4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptxmary850239
 
What is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPWhat is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPCeline George
 
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...Postal Advocate Inc.
 
4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptx4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptxmary850239
 
How to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPHow to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPCeline George
 
Daily Lesson Plan in Mathematics Quarter 4
Daily Lesson Plan in Mathematics Quarter 4Daily Lesson Plan in Mathematics Quarter 4
Daily Lesson Plan in Mathematics Quarter 4JOYLYNSAMANIEGO
 
AUDIENCE THEORY -CULTIVATION THEORY - GERBNER.pptx
AUDIENCE THEORY -CULTIVATION THEORY -  GERBNER.pptxAUDIENCE THEORY -CULTIVATION THEORY -  GERBNER.pptx
AUDIENCE THEORY -CULTIVATION THEORY - GERBNER.pptxiammrhaywood
 
ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4MiaBumagat1
 
Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Celine George
 
Active Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdfActive Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdfPatidar M
 
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfInclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfTechSoup
 
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxMULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxAnupkumar Sharma
 
Keynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designKeynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designMIPLM
 
Full Stack Web Development Course for Beginners
Full Stack Web Development Course  for BeginnersFull Stack Web Development Course  for Beginners
Full Stack Web Development Course for BeginnersSabitha Banu
 
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17Celine George
 
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTSGRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTSJoshuaGantuangco2
 
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdfVirtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdfErwinPantujan2
 

Kürzlich hochgeladen (20)

ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
 
4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptx4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptx
 
What is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPWhat is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERP
 
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
 
4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptx4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptx
 
Raw materials used in Herbal Cosmetics.pptx
Raw materials used in Herbal Cosmetics.pptxRaw materials used in Herbal Cosmetics.pptx
Raw materials used in Herbal Cosmetics.pptx
 
How to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPHow to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERP
 
Daily Lesson Plan in Mathematics Quarter 4
Daily Lesson Plan in Mathematics Quarter 4Daily Lesson Plan in Mathematics Quarter 4
Daily Lesson Plan in Mathematics Quarter 4
 
AUDIENCE THEORY -CULTIVATION THEORY - GERBNER.pptx
AUDIENCE THEORY -CULTIVATION THEORY -  GERBNER.pptxAUDIENCE THEORY -CULTIVATION THEORY -  GERBNER.pptx
AUDIENCE THEORY -CULTIVATION THEORY - GERBNER.pptx
 
ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4
 
Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17
 
Active Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdfActive Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdf
 
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfInclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
 
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptxYOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
 
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxMULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
 
Keynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designKeynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-design
 
Full Stack Web Development Course for Beginners
Full Stack Web Development Course  for BeginnersFull Stack Web Development Course  for Beginners
Full Stack Web Development Course for Beginners
 
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
 
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTSGRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
 
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdfVirtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
Virtual-Orientation-on-the-Administration-of-NATG12-NATG6-and-ELLNA.pdf
 

Summary statistics

  • 2. Definition Summary statistics are used to summarize a set of observations in order to communicate as much as information about the data as possible. It is part of descriptive statistics and are used to basically summarize or describe a set of observations. Rupak Roy
  • 3. Example The weight of the population are 45 kg 57kg 72 kg 52 kg Now what we want here is the summary of weight of the population , we can say it is the average weight of the population is 56.5 kg and now we can describe the population in the simplest way as possible. Rupak Roy
  • 4. Types Summary statistics Measures of Central Tendency 1 . Mean 2 . Median 3 . Mode 5 . Geometric Mean Measures of Dispersion 1. Standard Deviation 2. Variance 3. Interquartile Range Others 1. Co efficient 2. Skewness 3. Kurtosis 4. Probability Distributions. 5. Distribution plot Rupak Roy
  • 5. Definition  Measures of central tendency : is the value that describes which group of data clusters around a central value. In simple words , it is a way to describe the center of a data set. Again what is center of data ? A single number that summarizes the entire dataset using techniques such as mean/average or median of the dataset.  Measures of Dispersion: “dispersion (also called variability, scatter, or spread) is the extent to which a distribution of data is stretched or squeezed.” Here in the graph we can see the distribution of data (assume population) is more stretched at the right side ranging from 50 to 80
  • 6. Measures of Central Tendency 1. Mean : is the average of observations. Most effective when data is not heavily skewed. 2. Median: represents the middle value of the dataset. Useful for skewed data. We will talk about skewed data in the upcoming slides. 3. Mode: means max no of times the data has occurred. 4. Geometric mean: nth root of a product of n numbers. It is used when we want to get the average rate of the event and the event rate is determined by multiplication. For example growth of a bank account per year in a ABC bank is calculated by geometric mean since the growth event rate is determined by multiplying the amount of a bank account by the percentage of growth. then we use geometric mean. Rupak Roy
  • 7.  Formula for calculating Geometric Mean GM = example: Geometric Mean of 23,56,66 ? 3 23 * 56*66 3 85008 = 43.9696761which means 3times of 43.9696761 is 85008 Note: if one of the observation in the event is zero , Geometric Mean becomes Zero and also it doesn’t works with negative numbers like -1 , -4 , -5 and so on. Rupak Roy
  • 8. Calculation of Mode ; <- Delta For ungrouped data = Max no of items Example : 23,45,76,33,54,33,76,33 Therefore Mode = 33 For grouped data = = {(L + Delta 1) / Delta 1+Detal2 } * i Where Delta 1 = f1 +f0 and Delta 2 = f1- f2 Nowadays, we don’t have to worry about the calculation, as in any statistical software's like R, excel it will automatically calculate the intense calculation for large amount of data but for more in-depth information you can visit this website. https://www.mathsisfun.com/data/frequency-grouped-mean-median-mode.html
  • 9. Measures of dispersion Standard Deviation is basically a measure of how near or far the observations are from the mean. Variance: the fact or quality of being different , divergent or inconsistent. A value of zero means that there is no variability , all the values in the data set are the same. Interquartile Range: is a measure of variability , by dividing a data set into parts that is quartiles . Say Q1 is the middle value in the first half of the data set. Q2 is the median value . Q3 is the middle value in the second half of the rank-ordered data set. There interquartile range = Q3 – Q1
  • 10. Skewness – refers to the lack of symmetry or imbalance in data distribution. In a symmetric distribution the data is normally distributed where mean, median, mode is at the same point. However in real life data is never perfectly distributed, hence we call it skewed data. If the Left side has longer tail then the mass distribution of data is concentrated on the right side which is known as negatively skewed.
  • 11. If the Right side has longer tail then the mass distribution of data is concentrated on the left side is known as positive skewed. Here is the summary of all the skewness as shown in the figure below.
  • 12. Example (skewed data) Temp(*c) 10 40 35 33 35 Mean = 153/5 = 30.6, if we apply mean is 30.6 which is incorrect since we can see maximum number of values are above 35. So we have to use median For Ungrouped data ((n+1)/2)th That will be ((5+1)/2)th = 6/2 = 3 i.e. 3th term ie 35. For grouped data: where L, lower class boundary of the group containing the group. B, Cumulative frequency of the groups G , Frequency of the median group W , width/Range of the group Again, we don’t have to worry about the calculation, as in any statistical software's like R , excel it will automatically calculate the intense calculation for large amount of data but for more in-depth information you can visit this website. https://www.mathsisfun.com/data/frequency-grouped-mean-median-mode.html
  • 13. Kurtosis : is a measure of whether data are peaked or flat relative to normal distribution (+) Leptokurtic (-) PlatyKurtic (0) Meskurtic (+) Leptokurtic This means the distribution is more clustered near the mean and has a relativity less standard deviation (-) PlatyKurtic Where the distribution is less clustered around the mean and a standard deviation more then Leptokurtic (0) Meskurtic is typically measured with respect to the normal distribution. Meskurtic has tails similar to normal distribution i.e neither high nor low, rather it is consider to be a baseline for the other two’s.
  • 14.  Now how to check the data is skewed or not in Excel: =skew(select the range of values/numbers) =skew(10.24,9.48……….-0.42,-0.95) = - 0.27 means Negatively skewed. And to check the Kurtosis in Excel =kurt(select the values/numbers) =kurt(10.24,9.48……….-0.42,-0.95) = -1.6 means it is PlatyKurtic
  • 15. Recap What we have learned ? Measures of central tendency, Measures of dispersion, Measure of risk, Next we will see how to compute this theory in practical and analyze any data using our everyday simple tools like Excel. Rupak Roy
  • 16. To be continued ………