SlideShare ist ein Scribd-Unternehmen logo
1 von 25
BASIC
STATISTICS
BASICS OF STATISTICS
Definition: Science of collection, presentation, analysis, and reasonable
interpretation of data.
Statistics presents a rigorous scientific method for gaining insight into data. For
example, suppose we measure the weight of 100 patients in a study. With so
many measurements, simply looking at the data fails to provide an informative
account. However statistics can give an instant overall picture of data based
on graphical presentation or numerical summarization irrespective to the
number of data points. Besides data summarization, another important task of
statistics is to make inference and predict relations of variables.
TYPES OF DATA
Descriptive Statistics
Frequencies
Basic measurements
Inferential Statistics
Hypothesis Testing
Correlation
Confidence Intervals
Significance Testing
Prediction
4
A TAXONOMY OF STATISTICS
STATISTICAL DESCRIPTION OF
DATA
 Statistics describes a numeric set of data by its
 Center
 Variability
 Shape
 Statistics describes a categorical set of data by
 Frequency, percentage or proportion of each category
SOME DEFINITIONS
Variable - any characteristic of an individual or entity. A variable
can take different values for different individuals. Variables can be
categorical or quantitative. Per S. S. Stevens…
• Nominal - Categorical variables with no inherent order or ranking sequence
such as names or classes (e.g., gender). Value may be a numerical, but without
numerical value (e.g., I, II, III). The only operation that can be applied to Nominal
variables is enumeration.
• Ordinal - Variables with an inherent rank or order, e.g. mild, moderate, severe.
Can be compared for equality, or greater or less, but not how much greater or
less.
• Interval - Values of the variable are ordered as in Ordinal, and additionally,
differences between values are meaningful, however, the scale is not absolutely
anchored. Calendar dates and temperatures on the Fahrenheit scale are examples.
Addition and subtraction, but not multiplication and division are meaningful
operations.
• Ratio - Variables with all properties of Interval plus an absolute, non-arbitrary
zero point, e.g. age, weight, temperature (Kelvin). Addition, subtraction,
multiplication, and division are all meaningful operations.
SOME DEFINITIONS
Distribution - (of a variable) tells us what values the variable takes
and how often it takes these values.
• Unimodal - having a single peak
• Bimodal - having two distinct peaks
• Symmetric - left and right half are mirror images.
FREQUENCY DISTRIBUTION
Age 1 2 3 4 5 6
Frequency 5 3 7 5 4 2
Age Group 1-2 3-4 5-6
Frequency 8 12 6
Frequency Distribution of Age
Grouped Frequency Distribution of Age:
Consider a data set of 26 children of ages 1-6 years. Then the
frequency distribution of variable ‘age’ can be tabulated as
follows:
CUMULATIVE FREQUENCY
Age Group 1-2 3-4 5-6
Frequency 8 12 6
Cumulative Frequency 8 20 26
Age 1 2 3 4 5 6
Frequency 5 3 7 5 4 2
Cumulative Frequency 5 8 15 20 24 26
Cumulative frequency of data in previous page
DATA PRESENTATION
Two types of statistical presentation of data - graphical and numerical.
Graphical Presentation: We look for the overall pattern and for striking
deviations from that pattern. Over all pattern usually described by
shape, center, and spread of the data. An individual value that falls
outside the overall pattern is called an outlier.
Bar diagram and Pie charts are used for categorical variables.
Histogram, stem and leaf and Box-plot are used for numerical variable.
Data Presentation –Categorical
Variable
Bar Diagram: Lists the categories and presents the percent or count of
individuals who fall in each category.
Treatment
Group
Frequency Proportion Percent
(%)
1 15 (15/60)=0.25 25.0
2 25 (25/60)=0.333 41.7
3 20 (20/60)=0.417 33.3
Total 60 1.00 100
Figure 1: Bar Chart of Subjects in
Treatment Groups
0
5
10
15
20
25
30
1 2 3
Treatment Group
NumberofSubjects
Data Presentation –Categorical
Variable
Pie Chart: Lists the categories and presents the percent or count of
individuals who fall in each category.
Figure 2: Pie Chart of
Subjects in Treatment Groups
25%
42%
33% 1
2
3
Treatment
Group
Frequency Proportion Percent
(%)
1 15 (15/60)=0.25 25.0
2 25 (25/60)=0.333 41.7
3 20 (20/60)=0.417 33.3
Total 60 1.00 100
GRAPHICAL PRESENTATION –
NUMERICAL VARIABLE
Figure 3: Age Distribution
0
2
4
6
8
10
12
14
16
40 60 80 100 120 140 More
Age in Month
NumberofSubjects
Mean 90.41666667
Standard Error 3.902649518
Median 84
Mode 84
Standard Deviation 30.22979318
Sample Variance 913.8403955
Kurtosis -1.183899591
Skewness 0.389872725
Range 95
Minimum 48
Maximum 143
Sum 5425
Count 60
Histogram: Overall pattern can be described by its shape, center,
and spread. The following age distribution is right skewed. The
center lies between 80 to 100. No outliers.
GRAPHICAL PRESENTATION –
NUMERICAL VARIABLE
0
20
40
60
80
100
120
140
160
1
q1
min
median
max
q3
Box-Plot: Describes the five-number summary
Figure 3: Distribution of Age
Box Plot
NUMERICAL PRESENTATION
To understand how well a central value characterizes a set of observations, let
us consider the following two sets of data:
A: 30, 50, 70
B: 40, 50, 60
The mean of both two data sets is 50. But, the distance of the observations from
the mean in data set A is larger than in the data set B. Thus, the mean of data
set B is a better representation of the data set than is the case for set A.
A fundamental concept in summary statistics is that of a central value for a set
of observations and the extent to which the central value characterizes the
whole set of data. Measures of central value such as the mean or median must
be coupled with measures of data dispersion (e.g., average distance from the
mean) to indicate how well the central value characterizes the data as a whole.
METHODS OF CENTER
MEASUREMENT
n
x
n
xxx
x
x
nxxx
n
i
i
n
n
∑=
=
+++
= 121
,21
...
variable,thisofmeanThen the.
variableaofnsobservatioare...,Let:Notation
Commonly used methods are mean, median, mode, geometric
mean etc.
Mean: Summing up all the observation and dividing by number of
observations. Mean of 20, 30, 40 is (20+30+40)/3 = 30.
Center measurement is a summary measure of the overall level of
a dataset
METHODS OF CENTER
MEASUREMENT
Median: The middle value in an ordered sequence of observations.
That is, to find the median we need to order the data set and then
find the middle value. In case of an even number of observations
the average of the two middle most values is the median. For
example, to find the median of {9, 3, 6, 7, 5}, we first sort the
data giving {3, 5, 6, 7, 9}, then choose the middle value 6. If the
number of observations is even, e.g., {9, 3, 6, 7, 5, 2}, then the
median is the average of the two middle values from the sorted
sequence, in this case, (5 + 6) / 2 = 5.5.
Mode: The value that is observed most frequently. The mode is
undefined for sequences in which no observation is repeated.
MEAN OR MEDIAN
The median is less sensitive to outliers (extreme scores) than the
mean and thus a better measure than the mean for highly skewed
distributions, e.g. family income. For example mean of 20, 30, 40,
and 990 is (20+30+40+990)/4 =270. The median of these four
observations is (30+40)/2 =35. Here 3 observations out of 4 lie
between 20-40. So, the mean 270 really fails to give a realistic
picture of the major part of the data. It is influenced by extreme
value 990.
METHODS OF VARIABILITY
MEASUREMENT
Commonly used methods: range, variance, standard deviation,
interquartile range, coefficient of variation etc.
Range: The difference between the largest and the smallest
observations. The range of 10, 5, 2, 100 is (100-2)=98. It’s a crude
measure of variability.
Variability (or dispersion) measures the amount of scatter in a
dataset.
METHODS OF VARIABILITY
MEASUREMENT
1
)(....)( 22
12
−
−++−
=
n
xxxx
S n
Variance: The variance of a set of observations is the average of the
squares of the deviations of the observations from their mean. In
symbols, the variance of the n observations x1, x2,…xn is
Variance of 5, 7, 3? Mean is (5+7+3)/3 = 5 and the variance is
4
13
)57()53()55( 222
=
−
−+−+−
Standard Deviation: Square root of the variance. The standard
deviation of the above example is 2.
METHODS OF VARIABILITY
MEASUREMENT
Quartiles: Data can be divided into four regions that cover the total
range of observed values. Cut points for these regions are known as
quartiles.
The first quartile (Q1) is the first 25% of the data. The second quartile
(Q2) is between the 25th
and 50th
percentage points in the data. The
upper bound of Q2 is the median. The third quartile (Q3) is the 25%
of the data lying between the median and the 75% cut point in the
data.
Q1 is the median of the first half of the ordered observations and Q3 is
the median of the second half of the ordered observations.
In notations, quartiles of a data is the ((n+1)/4)qth
observation of the
data, where q is the desired quartile and n is the number of
observations of data.
METHODS OF VARIABILITY
MEASUREMENT
An example with 15 numbers
3 6 7 11 13 22 30 40 44 50 52 61 68 80 94
Q1 Q2 Q3
The first quartile is Q1=11. The second quartile is Q2=40 (This is
also the Median.) The third quartile is Q3=61.
Inter-quartile Range: Difference between Q3 and Q1. Inter-quartile
range of the previous example is 61- 40=21. The middle half of the
ordered data lie between 40 and 61.
In the following example Q1= ((15+1)/4)1 =4th
observation of the data.
The 4th
observation is 11. So Q1 is of this data is 11.
DECILES AND PERCENTILES
100×
x
σ
Percentiles: If data is ordered and divided into 100 parts, then cut
points are called Percentiles. 25th
percentile is the Q1, 50th
percentile
is the Median (Q2) and the 75th
percentile of the data is Q3.
Deciles: If data is ordered and divided into 10 parts, then cut points
are called Deciles
In notations, percentiles of a data is the ((n+1)/100)p th observation
of the data, where p is the desired percentile and n is the number of
observations of data.
Coefficient of Variation: The standard deviation of data divided by it’s
mean. It is usually expressed in percent.
Coefficient of Variation =
THANK YOU

Weitere ähnliche Inhalte

Was ist angesagt?

Medical Statistics Part-I:Descriptive statistics
Medical Statistics Part-I:Descriptive statisticsMedical Statistics Part-I:Descriptive statistics
Medical Statistics Part-I:Descriptive statisticsRamachandra Barik
 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statisticsAbhinav yadav
 
Descriptive Statistics and Data Visualization
Descriptive Statistics and Data VisualizationDescriptive Statistics and Data Visualization
Descriptive Statistics and Data VisualizationDouglas Joubert
 
Bcs 040 Descriptive Statistics
Bcs 040 Descriptive StatisticsBcs 040 Descriptive Statistics
Bcs 040 Descriptive StatisticsNarayan Thapa
 
Statistics in Physical Education
Statistics in Physical EducationStatistics in Physical Education
Statistics in Physical Educationdryadav1300
 
Measure of dispersion part I (Range, Quartile Deviation, Interquartile devi...
Measure of dispersion part   I (Range, Quartile Deviation, Interquartile devi...Measure of dispersion part   I (Range, Quartile Deviation, Interquartile devi...
Measure of dispersion part I (Range, Quartile Deviation, Interquartile devi...Shakehand with Life
 
Statistics in research
Statistics in researchStatistics in research
Statistics in researchBalaji P
 
Graphical presentation of data
Graphical presentation of dataGraphical presentation of data
Graphical presentation of datajennytuazon01630
 
Descriptive Statistics
Descriptive StatisticsDescriptive Statistics
Descriptive StatisticsBhagya Silva
 
Basic Descriptive statistics
Basic Descriptive statisticsBasic Descriptive statistics
Basic Descriptive statisticsAjendra Sharma
 

Was ist angesagt? (17)

statistic
statisticstatistic
statistic
 
Medical Statistics Part-I:Descriptive statistics
Medical Statistics Part-I:Descriptive statisticsMedical Statistics Part-I:Descriptive statistics
Medical Statistics Part-I:Descriptive statistics
 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statistics
 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statistics
 
Descriptive Statistics and Data Visualization
Descriptive Statistics and Data VisualizationDescriptive Statistics and Data Visualization
Descriptive Statistics and Data Visualization
 
Bcs 040 Descriptive Statistics
Bcs 040 Descriptive StatisticsBcs 040 Descriptive Statistics
Bcs 040 Descriptive Statistics
 
Descriptive
DescriptiveDescriptive
Descriptive
 
Statistics in Physical Education
Statistics in Physical EducationStatistics in Physical Education
Statistics in Physical Education
 
Measure of dispersion part I (Range, Quartile Deviation, Interquartile devi...
Measure of dispersion part   I (Range, Quartile Deviation, Interquartile devi...Measure of dispersion part   I (Range, Quartile Deviation, Interquartile devi...
Measure of dispersion part I (Range, Quartile Deviation, Interquartile devi...
 
Statistics in research
Statistics in researchStatistics in research
Statistics in research
 
Graphical presentation of data
Graphical presentation of dataGraphical presentation of data
Graphical presentation of data
 
Intro to statistics
Intro to statisticsIntro to statistics
Intro to statistics
 
Descriptive Statistics
Descriptive StatisticsDescriptive Statistics
Descriptive Statistics
 
Basic statistics
Basic statisticsBasic statistics
Basic statistics
 
Basic Descriptive statistics
Basic Descriptive statisticsBasic Descriptive statistics
Basic Descriptive statistics
 
Descriptive statistics ii
Descriptive statistics iiDescriptive statistics ii
Descriptive statistics ii
 
Descriptive statistics -review(2)
Descriptive statistics -review(2)Descriptive statistics -review(2)
Descriptive statistics -review(2)
 

Andere mochten auch

Presentación de comunicacion esrita y el texto
Presentación de  comunicacion esrita y el textoPresentación de  comunicacion esrita y el texto
Presentación de comunicacion esrita y el textoingrisnavarro
 
ثانية معدات زراعية2015
ثانية معدات زراعية2015ثانية معدات زراعية2015
ثانية معدات زراعية2015Nour Elbader
 
Crowdfunding, výzkum na téma dárcovství v ČR.
Crowdfunding, výzkum na téma dárcovství v ČR.Crowdfunding, výzkum na téma dárcovství v ČR.
Crowdfunding, výzkum na téma dárcovství v ČR.InsightLab
 
Kew Gardens Reference (2)
Kew Gardens Reference (2)Kew Gardens Reference (2)
Kew Gardens Reference (2)Daniel Band
 
publication new (2015 Sep)
publication new (2015 Sep)publication new (2015 Sep)
publication new (2015 Sep)Hye Kyong Kim
 
1.3 project management process groups & knowledge areas 1
1.3 project management process groups & knowledge areas 11.3 project management process groups & knowledge areas 1
1.3 project management process groups & knowledge areas 1reddvise
 
Layar Monthly Q&A October 2014
Layar Monthly Q&A October 2014Layar Monthly Q&A October 2014
Layar Monthly Q&A October 2014Layar
 

Andere mochten auch (13)

Especificaciones tecnicas
Especificaciones tecnicasEspecificaciones tecnicas
Especificaciones tecnicas
 
Presentación de comunicacion esrita y el texto
Presentación de  comunicacion esrita y el textoPresentación de  comunicacion esrita y el texto
Presentación de comunicacion esrita y el texto
 
ثانية معدات زراعية2015
ثانية معدات زراعية2015ثانية معدات زراعية2015
ثانية معدات زراعية2015
 
TJ
TJTJ
TJ
 
HKMM Ref Letter
HKMM Ref LetterHKMM Ref Letter
HKMM Ref Letter
 
Crowdfunding, výzkum na téma dárcovství v ČR.
Crowdfunding, výzkum na téma dárcovství v ČR.Crowdfunding, výzkum na téma dárcovství v ČR.
Crowdfunding, výzkum na téma dárcovství v ČR.
 
Los Khanes 7
Los Khanes 7Los Khanes 7
Los Khanes 7
 
Kew Gardens Reference (2)
Kew Gardens Reference (2)Kew Gardens Reference (2)
Kew Gardens Reference (2)
 
Rsrduployer
RsrduployerRsrduployer
Rsrduployer
 
publication new (2015 Sep)
publication new (2015 Sep)publication new (2015 Sep)
publication new (2015 Sep)
 
Clothes ilija i iva
Clothes  ilija i ivaClothes  ilija i iva
Clothes ilija i iva
 
1.3 project management process groups & knowledge areas 1
1.3 project management process groups & knowledge areas 11.3 project management process groups & knowledge areas 1
1.3 project management process groups & knowledge areas 1
 
Layar Monthly Q&A October 2014
Layar Monthly Q&A October 2014Layar Monthly Q&A October 2014
Layar Monthly Q&A October 2014
 

Ähnlich wie Student’s presentation

Class1.ppt
Class1.pptClass1.ppt
Class1.pptGautam G
 
STATISTICS BASICS INCLUDING DESCRIPTIVE STATISTICS
STATISTICS BASICS INCLUDING DESCRIPTIVE STATISTICSSTATISTICS BASICS INCLUDING DESCRIPTIVE STATISTICS
STATISTICS BASICS INCLUDING DESCRIPTIVE STATISTICSnagamani651296
 
Introduction to Statistics - Basics of Data - Class 1
Introduction to Statistics - Basics of Data - Class 1Introduction to Statistics - Basics of Data - Class 1
Introduction to Statistics - Basics of Data - Class 1RajnishSingh367990
 
Business statistics (Basics)
Business statistics (Basics)Business statistics (Basics)
Business statistics (Basics)AhmedToheed3
 
STATISTICAL PROCEDURES (Discriptive Statistics).pptx
STATISTICAL PROCEDURES (Discriptive Statistics).pptxSTATISTICAL PROCEDURES (Discriptive Statistics).pptx
STATISTICAL PROCEDURES (Discriptive Statistics).pptxMuhammadNafees42
 
2-L2 Presentation of data.pptx
2-L2 Presentation of data.pptx2-L2 Presentation of data.pptx
2-L2 Presentation of data.pptxssuser03ba7c
 
Measures of Dispersion.pptx
Measures of Dispersion.pptxMeasures of Dispersion.pptx
Measures of Dispersion.pptxVanmala Buchke
 
Biostatistics mean median mode unit 1.pptx
Biostatistics mean median mode unit 1.pptxBiostatistics mean median mode unit 1.pptx
Biostatistics mean median mode unit 1.pptxSailajaReddyGunnam
 
QT1 - 03 - Measures of Central Tendency
QT1 - 03 - Measures of Central TendencyQT1 - 03 - Measures of Central Tendency
QT1 - 03 - Measures of Central TendencyPrithwis Mukerjee
 
QT1 - 03 - Measures of Central Tendency
QT1 - 03 - Measures of Central TendencyQT1 - 03 - Measures of Central Tendency
QT1 - 03 - Measures of Central TendencyPrithwis Mukerjee
 
2-Descriptive statistics.pptx
2-Descriptive statistics.pptx2-Descriptive statistics.pptx
2-Descriptive statistics.pptxSandipanMaji3
 
Biostatistics Survey Project on Menstrual cup v/s Sanitary Pads
Biostatistics Survey Project on Menstrual cup v/s Sanitary PadsBiostatistics Survey Project on Menstrual cup v/s Sanitary Pads
Biostatistics Survey Project on Menstrual cup v/s Sanitary PadsCheshta Rawat
 
Measures of Dispersion
Measures of DispersionMeasures of Dispersion
Measures of DispersionKainatIqbal7
 
Basics of biostatistic
Basics of biostatisticBasics of biostatistic
Basics of biostatisticNeurologyKota
 

Ähnlich wie Student’s presentation (20)

Class1.ppt
Class1.pptClass1.ppt
Class1.ppt
 
Class1.ppt
Class1.pptClass1.ppt
Class1.ppt
 
Class1.ppt
Class1.pptClass1.ppt
Class1.ppt
 
Class1.ppt
Class1.pptClass1.ppt
Class1.ppt
 
STATISTICS BASICS INCLUDING DESCRIPTIVE STATISTICS
STATISTICS BASICS INCLUDING DESCRIPTIVE STATISTICSSTATISTICS BASICS INCLUDING DESCRIPTIVE STATISTICS
STATISTICS BASICS INCLUDING DESCRIPTIVE STATISTICS
 
Introduction to Statistics - Basics of Data - Class 1
Introduction to Statistics - Basics of Data - Class 1Introduction to Statistics - Basics of Data - Class 1
Introduction to Statistics - Basics of Data - Class 1
 
Business statistics (Basics)
Business statistics (Basics)Business statistics (Basics)
Business statistics (Basics)
 
STATISTICAL PROCEDURES (Discriptive Statistics).pptx
STATISTICAL PROCEDURES (Discriptive Statistics).pptxSTATISTICAL PROCEDURES (Discriptive Statistics).pptx
STATISTICAL PROCEDURES (Discriptive Statistics).pptx
 
2-L2 Presentation of data.pptx
2-L2 Presentation of data.pptx2-L2 Presentation of data.pptx
2-L2 Presentation of data.pptx
 
Measures of Dispersion.pptx
Measures of Dispersion.pptxMeasures of Dispersion.pptx
Measures of Dispersion.pptx
 
Biostatistics mean median mode unit 1.pptx
Biostatistics mean median mode unit 1.pptxBiostatistics mean median mode unit 1.pptx
Biostatistics mean median mode unit 1.pptx
 
QT1 - 03 - Measures of Central Tendency
QT1 - 03 - Measures of Central TendencyQT1 - 03 - Measures of Central Tendency
QT1 - 03 - Measures of Central Tendency
 
QT1 - 03 - Measures of Central Tendency
QT1 - 03 - Measures of Central TendencyQT1 - 03 - Measures of Central Tendency
QT1 - 03 - Measures of Central Tendency
 
2-Descriptive statistics.pptx
2-Descriptive statistics.pptx2-Descriptive statistics.pptx
2-Descriptive statistics.pptx
 
Descriptive Statistics
Descriptive StatisticsDescriptive Statistics
Descriptive Statistics
 
Medical statistics
Medical statisticsMedical statistics
Medical statistics
 
Biostatistics Survey Project on Menstrual cup v/s Sanitary Pads
Biostatistics Survey Project on Menstrual cup v/s Sanitary PadsBiostatistics Survey Project on Menstrual cup v/s Sanitary Pads
Biostatistics Survey Project on Menstrual cup v/s Sanitary Pads
 
Measures of dispersion
Measures  of  dispersionMeasures  of  dispersion
Measures of dispersion
 
Measures of Dispersion
Measures of DispersionMeasures of Dispersion
Measures of Dispersion
 
Basics of biostatistic
Basics of biostatisticBasics of biostatistic
Basics of biostatistic
 

Kürzlich hochgeladen

Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfadriantubila
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...amitlee9823
 
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% SecurePooja Nehwal
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...amitlee9823
 
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...SUHANI PANDEY
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxolyaivanovalion
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAroojKhan71
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...ZurliaSoop
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceDelhi Call girls
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxolyaivanovalion
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Valters Lauzums
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxolyaivanovalion
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxolyaivanovalion
 
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangaloreamitlee9823
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxolyaivanovalion
 
Probability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter LessonsProbability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter LessonsJoseMangaJr1
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusTimothy Spann
 

Kürzlich hochgeladen (20)

Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
 
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
 
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
 
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFx
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptx
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptx
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptx
 
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptx
 
Probability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter LessonsProbability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter Lessons
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 

Student’s presentation

  • 1.
  • 3. BASICS OF STATISTICS Definition: Science of collection, presentation, analysis, and reasonable interpretation of data. Statistics presents a rigorous scientific method for gaining insight into data. For example, suppose we measure the weight of 100 patients in a study. With so many measurements, simply looking at the data fails to provide an informative account. However statistics can give an instant overall picture of data based on graphical presentation or numerical summarization irrespective to the number of data points. Besides data summarization, another important task of statistics is to make inference and predict relations of variables.
  • 4. TYPES OF DATA Descriptive Statistics Frequencies Basic measurements Inferential Statistics Hypothesis Testing Correlation Confidence Intervals Significance Testing Prediction 4
  • 5. A TAXONOMY OF STATISTICS
  • 6. STATISTICAL DESCRIPTION OF DATA  Statistics describes a numeric set of data by its  Center  Variability  Shape  Statistics describes a categorical set of data by  Frequency, percentage or proportion of each category
  • 7. SOME DEFINITIONS Variable - any characteristic of an individual or entity. A variable can take different values for different individuals. Variables can be categorical or quantitative. Per S. S. Stevens… • Nominal - Categorical variables with no inherent order or ranking sequence such as names or classes (e.g., gender). Value may be a numerical, but without numerical value (e.g., I, II, III). The only operation that can be applied to Nominal variables is enumeration. • Ordinal - Variables with an inherent rank or order, e.g. mild, moderate, severe. Can be compared for equality, or greater or less, but not how much greater or less. • Interval - Values of the variable are ordered as in Ordinal, and additionally, differences between values are meaningful, however, the scale is not absolutely anchored. Calendar dates and temperatures on the Fahrenheit scale are examples. Addition and subtraction, but not multiplication and division are meaningful operations. • Ratio - Variables with all properties of Interval plus an absolute, non-arbitrary zero point, e.g. age, weight, temperature (Kelvin). Addition, subtraction, multiplication, and division are all meaningful operations.
  • 8. SOME DEFINITIONS Distribution - (of a variable) tells us what values the variable takes and how often it takes these values. • Unimodal - having a single peak • Bimodal - having two distinct peaks • Symmetric - left and right half are mirror images.
  • 9. FREQUENCY DISTRIBUTION Age 1 2 3 4 5 6 Frequency 5 3 7 5 4 2 Age Group 1-2 3-4 5-6 Frequency 8 12 6 Frequency Distribution of Age Grouped Frequency Distribution of Age: Consider a data set of 26 children of ages 1-6 years. Then the frequency distribution of variable ‘age’ can be tabulated as follows:
  • 10. CUMULATIVE FREQUENCY Age Group 1-2 3-4 5-6 Frequency 8 12 6 Cumulative Frequency 8 20 26 Age 1 2 3 4 5 6 Frequency 5 3 7 5 4 2 Cumulative Frequency 5 8 15 20 24 26 Cumulative frequency of data in previous page
  • 11. DATA PRESENTATION Two types of statistical presentation of data - graphical and numerical. Graphical Presentation: We look for the overall pattern and for striking deviations from that pattern. Over all pattern usually described by shape, center, and spread of the data. An individual value that falls outside the overall pattern is called an outlier. Bar diagram and Pie charts are used for categorical variables. Histogram, stem and leaf and Box-plot are used for numerical variable.
  • 12. Data Presentation –Categorical Variable Bar Diagram: Lists the categories and presents the percent or count of individuals who fall in each category. Treatment Group Frequency Proportion Percent (%) 1 15 (15/60)=0.25 25.0 2 25 (25/60)=0.333 41.7 3 20 (20/60)=0.417 33.3 Total 60 1.00 100 Figure 1: Bar Chart of Subjects in Treatment Groups 0 5 10 15 20 25 30 1 2 3 Treatment Group NumberofSubjects
  • 13. Data Presentation –Categorical Variable Pie Chart: Lists the categories and presents the percent or count of individuals who fall in each category. Figure 2: Pie Chart of Subjects in Treatment Groups 25% 42% 33% 1 2 3 Treatment Group Frequency Proportion Percent (%) 1 15 (15/60)=0.25 25.0 2 25 (25/60)=0.333 41.7 3 20 (20/60)=0.417 33.3 Total 60 1.00 100
  • 14. GRAPHICAL PRESENTATION – NUMERICAL VARIABLE Figure 3: Age Distribution 0 2 4 6 8 10 12 14 16 40 60 80 100 120 140 More Age in Month NumberofSubjects Mean 90.41666667 Standard Error 3.902649518 Median 84 Mode 84 Standard Deviation 30.22979318 Sample Variance 913.8403955 Kurtosis -1.183899591 Skewness 0.389872725 Range 95 Minimum 48 Maximum 143 Sum 5425 Count 60 Histogram: Overall pattern can be described by its shape, center, and spread. The following age distribution is right skewed. The center lies between 80 to 100. No outliers.
  • 15. GRAPHICAL PRESENTATION – NUMERICAL VARIABLE 0 20 40 60 80 100 120 140 160 1 q1 min median max q3 Box-Plot: Describes the five-number summary Figure 3: Distribution of Age Box Plot
  • 16. NUMERICAL PRESENTATION To understand how well a central value characterizes a set of observations, let us consider the following two sets of data: A: 30, 50, 70 B: 40, 50, 60 The mean of both two data sets is 50. But, the distance of the observations from the mean in data set A is larger than in the data set B. Thus, the mean of data set B is a better representation of the data set than is the case for set A. A fundamental concept in summary statistics is that of a central value for a set of observations and the extent to which the central value characterizes the whole set of data. Measures of central value such as the mean or median must be coupled with measures of data dispersion (e.g., average distance from the mean) to indicate how well the central value characterizes the data as a whole.
  • 17. METHODS OF CENTER MEASUREMENT n x n xxx x x nxxx n i i n n ∑= = +++ = 121 ,21 ... variable,thisofmeanThen the. variableaofnsobservatioare...,Let:Notation Commonly used methods are mean, median, mode, geometric mean etc. Mean: Summing up all the observation and dividing by number of observations. Mean of 20, 30, 40 is (20+30+40)/3 = 30. Center measurement is a summary measure of the overall level of a dataset
  • 18. METHODS OF CENTER MEASUREMENT Median: The middle value in an ordered sequence of observations. That is, to find the median we need to order the data set and then find the middle value. In case of an even number of observations the average of the two middle most values is the median. For example, to find the median of {9, 3, 6, 7, 5}, we first sort the data giving {3, 5, 6, 7, 9}, then choose the middle value 6. If the number of observations is even, e.g., {9, 3, 6, 7, 5, 2}, then the median is the average of the two middle values from the sorted sequence, in this case, (5 + 6) / 2 = 5.5. Mode: The value that is observed most frequently. The mode is undefined for sequences in which no observation is repeated.
  • 19. MEAN OR MEDIAN The median is less sensitive to outliers (extreme scores) than the mean and thus a better measure than the mean for highly skewed distributions, e.g. family income. For example mean of 20, 30, 40, and 990 is (20+30+40+990)/4 =270. The median of these four observations is (30+40)/2 =35. Here 3 observations out of 4 lie between 20-40. So, the mean 270 really fails to give a realistic picture of the major part of the data. It is influenced by extreme value 990.
  • 20. METHODS OF VARIABILITY MEASUREMENT Commonly used methods: range, variance, standard deviation, interquartile range, coefficient of variation etc. Range: The difference between the largest and the smallest observations. The range of 10, 5, 2, 100 is (100-2)=98. It’s a crude measure of variability. Variability (or dispersion) measures the amount of scatter in a dataset.
  • 21. METHODS OF VARIABILITY MEASUREMENT 1 )(....)( 22 12 − −++− = n xxxx S n Variance: The variance of a set of observations is the average of the squares of the deviations of the observations from their mean. In symbols, the variance of the n observations x1, x2,…xn is Variance of 5, 7, 3? Mean is (5+7+3)/3 = 5 and the variance is 4 13 )57()53()55( 222 = − −+−+− Standard Deviation: Square root of the variance. The standard deviation of the above example is 2.
  • 22. METHODS OF VARIABILITY MEASUREMENT Quartiles: Data can be divided into four regions that cover the total range of observed values. Cut points for these regions are known as quartiles. The first quartile (Q1) is the first 25% of the data. The second quartile (Q2) is between the 25th and 50th percentage points in the data. The upper bound of Q2 is the median. The third quartile (Q3) is the 25% of the data lying between the median and the 75% cut point in the data. Q1 is the median of the first half of the ordered observations and Q3 is the median of the second half of the ordered observations. In notations, quartiles of a data is the ((n+1)/4)qth observation of the data, where q is the desired quartile and n is the number of observations of data.
  • 23. METHODS OF VARIABILITY MEASUREMENT An example with 15 numbers 3 6 7 11 13 22 30 40 44 50 52 61 68 80 94 Q1 Q2 Q3 The first quartile is Q1=11. The second quartile is Q2=40 (This is also the Median.) The third quartile is Q3=61. Inter-quartile Range: Difference between Q3 and Q1. Inter-quartile range of the previous example is 61- 40=21. The middle half of the ordered data lie between 40 and 61. In the following example Q1= ((15+1)/4)1 =4th observation of the data. The 4th observation is 11. So Q1 is of this data is 11.
  • 24. DECILES AND PERCENTILES 100× x σ Percentiles: If data is ordered and divided into 100 parts, then cut points are called Percentiles. 25th percentile is the Q1, 50th percentile is the Median (Q2) and the 75th percentile of the data is Q3. Deciles: If data is ordered and divided into 10 parts, then cut points are called Deciles In notations, percentiles of a data is the ((n+1)/100)p th observation of the data, where p is the desired percentile and n is the number of observations of data. Coefficient of Variation: The standard deviation of data divided by it’s mean. It is usually expressed in percent. Coefficient of Variation =