SlideShare a Scribd company logo
1 of 26
Data Analysis Lab - 1
Introduction
By
Dr. Abhishek Kumar Singh
Student Introduction
• Name
• City and State
• Education detail (graduation, XII and X)
• PhD (IIT BHU Varanasi)
• M Tech (IIT BHU Varanasi)
• B Tech (GBTU)
• 3 Research Paper in SCOPUS/ABDC Indexed
journals
• 8 papers reviewed as a reviewer
• Six sigma green belt
Content
• Syllabus
• Data Analysis
• Variables
• Univariate
• Bivariate
Univariate Descriptive Analysis
• Measures of Central Tendency- Mean, Median,
Mode
• Measures of Variability- Range, Variance, Standard
Deviation, Co-efficient of Deviation
• Measures of Shape- Skewness and Kurtosis
• Measures of Stability- Standard Error
Bivariate Descriptive Analysis
• Covariance
• Correlation
Data Analysis
• The Process of cleaning, transforming,
interpreting, analyzing and visualizing the data
to extract useful information and gain valuable
insights to make more effective business
decisions is called data analysis.
Variables
• Variables: Any character, characteristics or
quality that varies is termed a variable.
• E.g.: To collect the basic clinical and
demographic information on patients with
particular illness. Variables of interest may
include Gender (M/F), age and height of the
patients.
Variable
Categorical Numerical
Nominal Ordinal Discrete Continuous
Categories are
mutually
exclusive and
unordered.
Eg. Gender (M/F)
Blood Group
(A/B/AB/O)
Categories are
mutually exclusive
and ordered.
Eg. Disease
severity (Mild,
Moderate and
Severe)
Integer values,
typically counts no
notion of
magnitude. Eg. No.
of children
vaccinated, days
sick per year
Takes any value in
a range of values
have a magnitude.
E.g. weight in kg
and Height in cm
Statistics
Descriptive Inferential
• Collecting
• Organizing
• Summarizing
• Presenting Data
• Making inference
• Hypothesis testing
• Determining relationship
• Making Prediction
Three types of analysis
• Univariate analysis: the examination of cases on only
one variable at a time (e.g., weight of college
students).
• Bivariate analysis: the examination of two variables
simultaneously (e.g., the relation between gender
and weight of college students).
• Multivariate analysis: examination of two variables
simultaneously (e.g., the relationship between
gender, race, and weight of college students).
Purpose of different type of analysis
• Univariate analysis: mainly description
• Bivariate analysis: Determining the empirical
relationship between two variables.
• Multivariate analysis: Determining the empirical
relationship among multiple variables.
Univariate
• The objective of univariate analysis is to derive the
data, define and summarize it and analyze the
pattern present in it.
• Univariate techniques are appropriate when there is
a single measurement of each element in the sample
or when there are several measurements of each
element but each variable is analyzed in isolation.
Univariate
Descriptive Inferential
• Measures of Central Tendency- Mean,
Median, Mode
• Measures of Variability- Range,
Variance, Standard Deviation, Co-efficient
of Deviation
• Measures of Shape- Skewness and
Kurtosis
• Measures of Stability- Standard Error
• z test
• t test
• Chi square test
Numerical Methods
• Mean
– Let X1, X2, X3,….Xn be the n data points, then mean
of data is defined as
– Mean provide the central value about which the
data is spread out.
Numerical Methods
• Median
– Median is the value which divide the data in two
halves
– Let X1, X2, X3,….Xn be the n data points
– Order the n data values
– If the number of data points is odd then sample
median is the value in position of (n+1)/2
– If the number of data points is even then sample
median is the average of value in position of n/2
and (n/2+1)
Mean or Median?
• Both the measures provide the “middle” value
of data, so how do they compare?
– Median is robust again extreme values in the data
– While mean is affected by the extreme values
• Example: 8, 9, 10, 11, 12 be the five data
points
– Mean = 10 and Median = 10
– Replace 12 by 18
• Mean = 11.2 but Median =10
Numerical Methods
• Mode
– Mode is the a value in data that occurs with
highest frequency
– It’s the most probable value of the data
– It is possible to have data that has more than one
Mode value. Such data is called multimodal.
Measures of Variability
• Percentile
– Order the data in ascending order
• Then, p1 in called the first percentile if 1% of points lie
below this value
• Similarly pk is called the k% of data points lie below this
value, where 0≤k≤100
• Quartile
– P25 is called the 1st quartile Q1
– P75 is called the 3rd quartile Q3
– P50 is Median
Measure of Dispersion
• Measures the spread of data
– Range
– Variation or standard deviation
• Measures the spread about mean/average value of
data
– Interquartile range
• Measures the spread about median value of the data
Measure of Dispersion
• Range = M-m, where,
– M = Max (x1, x2, ….xn)
– m = Min (x1, x2, ….xn)
• Variance
– S2 =
– Standard deviation = S
• Interquartile range: Q3 - Q1
Standard Deviation
• Standard Deviation is most commonly used
measure of dispersion.
– Under the assumption of normality the range of
Covers 67% of the data.
• Hence, this is commonly used to show possible error in
the observed value of data
Graphical Method
• Histogram or Bar chart
– Frequency Plot
• Pie Chart
• Cumulative frequency plot
• Box and Whisker plot
Bivariate
• Bi means two and variate means variable, so here
there are two variables. The analysisis related to
cause and the relationship between the two
variables.
• Correlation
• Covariance

More Related Content

Similar to Data Analysis Introduction.pptx

Sampling and Data_Update.ppt
Sampling and Data_Update.pptSampling and Data_Update.ppt
Sampling and Data_Update.pptMdShohelRana69
 
3. Statistical Analysis.pptx
3. Statistical Analysis.pptx3. Statistical Analysis.pptx
3. Statistical Analysis.pptxjeyanthisivakumar
 
Introduction to statistics.pptx
Introduction to statistics.pptxIntroduction to statistics.pptx
Introduction to statistics.pptxMuddaAbdo1
 
Biostatistics mean median mode unit 1.pptx
Biostatistics mean median mode unit 1.pptxBiostatistics mean median mode unit 1.pptx
Biostatistics mean median mode unit 1.pptxSailajaReddyGunnam
 
Introduction to Biostatistics_20_4_17.ppt
Introduction to Biostatistics_20_4_17.pptIntroduction to Biostatistics_20_4_17.ppt
Introduction to Biostatistics_20_4_17.pptnyakundi340
 
Chapter 11 quantitative data
Chapter 11 quantitative dataChapter 11 quantitative data
Chapter 11 quantitative datau59
 
CHAPTER 2 - NORM, CORRELATION AND REGRESSION.ppt
CHAPTER 2  - NORM, CORRELATION AND REGRESSION.pptCHAPTER 2  - NORM, CORRELATION AND REGRESSION.ppt
CHAPTER 2 - NORM, CORRELATION AND REGRESSION.pptkriti137049
 
Descriptive_statistics - Sample 1.pptx
Descriptive_statistics - Sample 1.pptxDescriptive_statistics - Sample 1.pptx
Descriptive_statistics - Sample 1.pptxSachinKumar524686
 
Statistical analysis
Statistical analysisStatistical analysis
Statistical analysisXiuxia Du
 
Chapter 6.pptx Data Analysis and processing
Chapter 6.pptx Data Analysis and processingChapter 6.pptx Data Analysis and processing
Chapter 6.pptx Data Analysis and processingetebarkhmichale
 
Introduction to Data Analysis for Nurse Researchers
Introduction to Data Analysis for Nurse ResearchersIntroduction to Data Analysis for Nurse Researchers
Introduction to Data Analysis for Nurse ResearchersRupa Verma
 
PARAMETRIC TESTS.pptx
PARAMETRIC TESTS.pptxPARAMETRIC TESTS.pptx
PARAMETRIC TESTS.pptxDrLasya
 
Multivariate Analysis Techniques
Multivariate Analysis TechniquesMultivariate Analysis Techniques
Multivariate Analysis TechniquesMehul Gondaliya
 
Biostatistics and Research Methodology Semester 8
Biostatistics and Research Methodology Semester 8Biostatistics and Research Methodology Semester 8
Biostatistics and Research Methodology Semester 8ParulSharma130721
 

Similar to Data Analysis Introduction.pptx (20)

Sampling and Data_Update.ppt
Sampling and Data_Update.pptSampling and Data_Update.ppt
Sampling and Data_Update.ppt
 
3. Statistical Analysis.pptx
3. Statistical Analysis.pptx3. Statistical Analysis.pptx
3. Statistical Analysis.pptx
 
Introduction to statistics.pptx
Introduction to statistics.pptxIntroduction to statistics.pptx
Introduction to statistics.pptx
 
Biostatistics mean median mode unit 1.pptx
Biostatistics mean median mode unit 1.pptxBiostatistics mean median mode unit 1.pptx
Biostatistics mean median mode unit 1.pptx
 
Introduction to Biostatistics_20_4_17.ppt
Introduction to Biostatistics_20_4_17.pptIntroduction to Biostatistics_20_4_17.ppt
Introduction to Biostatistics_20_4_17.ppt
 
Chapter 11 quantitative data
Chapter 11 quantitative dataChapter 11 quantitative data
Chapter 11 quantitative data
 
BMS.ppt
BMS.pptBMS.ppt
BMS.ppt
 
Analysis
AnalysisAnalysis
Analysis
 
CHAPTER 2 - NORM, CORRELATION AND REGRESSION.ppt
CHAPTER 2  - NORM, CORRELATION AND REGRESSION.pptCHAPTER 2  - NORM, CORRELATION AND REGRESSION.ppt
CHAPTER 2 - NORM, CORRELATION AND REGRESSION.ppt
 
Descriptive_statistics - Sample 1.pptx
Descriptive_statistics - Sample 1.pptxDescriptive_statistics - Sample 1.pptx
Descriptive_statistics - Sample 1.pptx
 
PRESENTATION.pptx
PRESENTATION.pptxPRESENTATION.pptx
PRESENTATION.pptx
 
Statistical analysis
Statistical analysisStatistical analysis
Statistical analysis
 
Chapter 6.pptx Data Analysis and processing
Chapter 6.pptx Data Analysis and processingChapter 6.pptx Data Analysis and processing
Chapter 6.pptx Data Analysis and processing
 
determinatiion of
determinatiion of determinatiion of
determinatiion of
 
Introduction to Data Analysis for Nurse Researchers
Introduction to Data Analysis for Nurse ResearchersIntroduction to Data Analysis for Nurse Researchers
Introduction to Data Analysis for Nurse Researchers
 
PARAMETRIC TESTS.pptx
PARAMETRIC TESTS.pptxPARAMETRIC TESTS.pptx
PARAMETRIC TESTS.pptx
 
Multivariate Analysis Techniques
Multivariate Analysis TechniquesMultivariate Analysis Techniques
Multivariate Analysis Techniques
 
Biostatistics and Research Methodology Semester 8
Biostatistics and Research Methodology Semester 8Biostatistics and Research Methodology Semester 8
Biostatistics and Research Methodology Semester 8
 
ANALYSIS OF DATA.pptx
ANALYSIS OF DATA.pptxANALYSIS OF DATA.pptx
ANALYSIS OF DATA.pptx
 
Biostatistics ppt
Biostatistics  pptBiostatistics  ppt
Biostatistics ppt
 

More from DrAbhishekKumarSingh3

More from DrAbhishekKumarSingh3 (6)

Microsoft word.pptx
Microsoft word.pptxMicrosoft word.pptx
Microsoft word.pptx
 
Data Preparation.pptx
Data Preparation.pptxData Preparation.pptx
Data Preparation.pptx
 
Sorting and Filtering.pptx
Sorting and Filtering.pptxSorting and Filtering.pptx
Sorting and Filtering.pptx
 
BASIC STRUCTURE OF COMPUTERS.pptx
BASIC STRUCTURE OF COMPUTERS.pptxBASIC STRUCTURE OF COMPUTERS.pptx
BASIC STRUCTURE OF COMPUTERS.pptx
 
How to start writing a paper.pptx
How to start writing a paper.pptxHow to start writing a paper.pptx
How to start writing a paper.pptx
 
Optimization using lp.pptx
Optimization using lp.pptxOptimization using lp.pptx
Optimization using lp.pptx
 

Recently uploaded

Micro-Choices, Max Impact Personalizing Your Journey, One Moment at a Time.pdf
Micro-Choices, Max Impact Personalizing Your Journey, One Moment at a Time.pdfMicro-Choices, Max Impact Personalizing Your Journey, One Moment at a Time.pdf
Micro-Choices, Max Impact Personalizing Your Journey, One Moment at a Time.pdfPiyush Kumar
 
Martal Group - B2B Lead Gen Agency - Onboarding Overview
Martal Group - B2B Lead Gen Agency - Onboarding OverviewMartal Group - B2B Lead Gen Agency - Onboarding Overview
Martal Group - B2B Lead Gen Agency - Onboarding OverviewMartal Group
 
BDSM⚡Call Girls in Sector 19 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 19 Noida Escorts >༒8448380779 Escort ServiceBDSM⚡Call Girls in Sector 19 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 19 Noida Escorts >༒8448380779 Escort ServiceDelhi Call girls
 
W.H.Bender Quote 61 -Influential restaurant and food service industry network...
W.H.Bender Quote 61 -Influential restaurant and food service industry network...W.H.Bender Quote 61 -Influential restaurant and food service industry network...
W.H.Bender Quote 61 -Influential restaurant and food service industry network...William (Bill) H. Bender, FCSI
 
BDSM⚡Call Girls in Sector 44 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 44 Noida Escorts >༒8448380779 Escort ServiceBDSM⚡Call Girls in Sector 44 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 44 Noida Escorts >༒8448380779 Escort ServiceDelhi Call girls
 
Instant Digital Issuance: An Overview With Critical First Touch Best Practices
Instant Digital Issuance: An Overview With Critical First Touch Best PracticesInstant Digital Issuance: An Overview With Critical First Touch Best Practices
Instant Digital Issuance: An Overview With Critical First Touch Best PracticesMedia Logic
 
Labour Day Celebrating Workers and Their Contributions.pptx
Labour Day Celebrating Workers and Their Contributions.pptxLabour Day Celebrating Workers and Their Contributions.pptx
Labour Day Celebrating Workers and Their Contributions.pptxelizabethella096
 
Press Release Distribution Evolving with Digital Trends.pdf
Press Release Distribution Evolving with Digital Trends.pdfPress Release Distribution Evolving with Digital Trends.pdf
Press Release Distribution Evolving with Digital Trends.pdfPR Wires
 
Brand experience Peoria City Soccer Presentation.pdf
Brand experience Peoria City Soccer Presentation.pdfBrand experience Peoria City Soccer Presentation.pdf
Brand experience Peoria City Soccer Presentation.pdftbatkhuu1
 
Best 5 Graphics Designing Course In Chandigarh
Best 5 Graphics Designing Course In ChandigarhBest 5 Graphics Designing Course In Chandigarh
Best 5 Graphics Designing Course In Chandigarhhamitthakurdma01
 
Unveiling the Legacy of the Rosetta stone A Key to Ancient Knowledge.pptx
Unveiling the Legacy of the Rosetta stone A Key to Ancient Knowledge.pptxUnveiling the Legacy of the Rosetta stone A Key to Ancient Knowledge.pptx
Unveiling the Legacy of the Rosetta stone A Key to Ancient Knowledge.pptxelizabethella096
 
2024 Social Trends Report V4 from Later.com
2024 Social Trends Report V4 from Later.com2024 Social Trends Report V4 from Later.com
2024 Social Trends Report V4 from Later.comnmislamchannal
 
What is Google Search Console and What is it provide?
What is Google Search Console and What is it provide?What is Google Search Console and What is it provide?
What is Google Search Console and What is it provide?riteshhsociall
 
BDSM⚡Call Girls in Sector 39 Noida Escorts Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 39 Noida Escorts Escorts >༒8448380779 Escort ServiceBDSM⚡Call Girls in Sector 39 Noida Escorts Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 39 Noida Escorts Escorts >༒8448380779 Escort ServiceDelhi Call girls
 
How consumers use technology and the impacts on their lives
How consumers use technology and the impacts on their livesHow consumers use technology and the impacts on their lives
How consumers use technology and the impacts on their livesMathuraa
 
Choosing the Right White Label SEO Services to Boost Your Agency's Growth.pdf
Choosing the Right White Label SEO Services to Boost Your Agency's Growth.pdfChoosing the Right White Label SEO Services to Boost Your Agency's Growth.pdf
Choosing the Right White Label SEO Services to Boost Your Agency's Growth.pdfAutus Digital
 
Unlocking the Mystery of the Voynich Manuscript
Unlocking the Mystery of the Voynich ManuscriptUnlocking the Mystery of the Voynich Manuscript
Unlocking the Mystery of the Voynich Manuscriptelizabethella096
 
Elevate Your Advertising Game: Introducing Billion Broadcaster Lift Advertising
Elevate Your Advertising Game: Introducing Billion Broadcaster Lift AdvertisingElevate Your Advertising Game: Introducing Billion Broadcaster Lift Advertising
Elevate Your Advertising Game: Introducing Billion Broadcaster Lift AdvertisingVikasYadav194549
 
personal branding kit for music business
personal branding kit for music businesspersonal branding kit for music business
personal branding kit for music businessbrjohnson6
 
BDSM⚡Call Girls in Sector 128 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 128 Noida Escorts >༒8448380779 Escort ServiceBDSM⚡Call Girls in Sector 128 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 128 Noida Escorts >༒8448380779 Escort ServiceDelhi Call girls
 

Recently uploaded (20)

Micro-Choices, Max Impact Personalizing Your Journey, One Moment at a Time.pdf
Micro-Choices, Max Impact Personalizing Your Journey, One Moment at a Time.pdfMicro-Choices, Max Impact Personalizing Your Journey, One Moment at a Time.pdf
Micro-Choices, Max Impact Personalizing Your Journey, One Moment at a Time.pdf
 
Martal Group - B2B Lead Gen Agency - Onboarding Overview
Martal Group - B2B Lead Gen Agency - Onboarding OverviewMartal Group - B2B Lead Gen Agency - Onboarding Overview
Martal Group - B2B Lead Gen Agency - Onboarding Overview
 
BDSM⚡Call Girls in Sector 19 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 19 Noida Escorts >༒8448380779 Escort ServiceBDSM⚡Call Girls in Sector 19 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 19 Noida Escorts >༒8448380779 Escort Service
 
W.H.Bender Quote 61 -Influential restaurant and food service industry network...
W.H.Bender Quote 61 -Influential restaurant and food service industry network...W.H.Bender Quote 61 -Influential restaurant and food service industry network...
W.H.Bender Quote 61 -Influential restaurant and food service industry network...
 
BDSM⚡Call Girls in Sector 44 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 44 Noida Escorts >༒8448380779 Escort ServiceBDSM⚡Call Girls in Sector 44 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 44 Noida Escorts >༒8448380779 Escort Service
 
Instant Digital Issuance: An Overview With Critical First Touch Best Practices
Instant Digital Issuance: An Overview With Critical First Touch Best PracticesInstant Digital Issuance: An Overview With Critical First Touch Best Practices
Instant Digital Issuance: An Overview With Critical First Touch Best Practices
 
Labour Day Celebrating Workers and Their Contributions.pptx
Labour Day Celebrating Workers and Their Contributions.pptxLabour Day Celebrating Workers and Their Contributions.pptx
Labour Day Celebrating Workers and Their Contributions.pptx
 
Press Release Distribution Evolving with Digital Trends.pdf
Press Release Distribution Evolving with Digital Trends.pdfPress Release Distribution Evolving with Digital Trends.pdf
Press Release Distribution Evolving with Digital Trends.pdf
 
Brand experience Peoria City Soccer Presentation.pdf
Brand experience Peoria City Soccer Presentation.pdfBrand experience Peoria City Soccer Presentation.pdf
Brand experience Peoria City Soccer Presentation.pdf
 
Best 5 Graphics Designing Course In Chandigarh
Best 5 Graphics Designing Course In ChandigarhBest 5 Graphics Designing Course In Chandigarh
Best 5 Graphics Designing Course In Chandigarh
 
Unveiling the Legacy of the Rosetta stone A Key to Ancient Knowledge.pptx
Unveiling the Legacy of the Rosetta stone A Key to Ancient Knowledge.pptxUnveiling the Legacy of the Rosetta stone A Key to Ancient Knowledge.pptx
Unveiling the Legacy of the Rosetta stone A Key to Ancient Knowledge.pptx
 
2024 Social Trends Report V4 from Later.com
2024 Social Trends Report V4 from Later.com2024 Social Trends Report V4 from Later.com
2024 Social Trends Report V4 from Later.com
 
What is Google Search Console and What is it provide?
What is Google Search Console and What is it provide?What is Google Search Console and What is it provide?
What is Google Search Console and What is it provide?
 
BDSM⚡Call Girls in Sector 39 Noida Escorts Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 39 Noida Escorts Escorts >༒8448380779 Escort ServiceBDSM⚡Call Girls in Sector 39 Noida Escorts Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 39 Noida Escorts Escorts >༒8448380779 Escort Service
 
How consumers use technology and the impacts on their lives
How consumers use technology and the impacts on their livesHow consumers use technology and the impacts on their lives
How consumers use technology and the impacts on their lives
 
Choosing the Right White Label SEO Services to Boost Your Agency's Growth.pdf
Choosing the Right White Label SEO Services to Boost Your Agency's Growth.pdfChoosing the Right White Label SEO Services to Boost Your Agency's Growth.pdf
Choosing the Right White Label SEO Services to Boost Your Agency's Growth.pdf
 
Unlocking the Mystery of the Voynich Manuscript
Unlocking the Mystery of the Voynich ManuscriptUnlocking the Mystery of the Voynich Manuscript
Unlocking the Mystery of the Voynich Manuscript
 
Elevate Your Advertising Game: Introducing Billion Broadcaster Lift Advertising
Elevate Your Advertising Game: Introducing Billion Broadcaster Lift AdvertisingElevate Your Advertising Game: Introducing Billion Broadcaster Lift Advertising
Elevate Your Advertising Game: Introducing Billion Broadcaster Lift Advertising
 
personal branding kit for music business
personal branding kit for music businesspersonal branding kit for music business
personal branding kit for music business
 
BDSM⚡Call Girls in Sector 128 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 128 Noida Escorts >༒8448380779 Escort ServiceBDSM⚡Call Girls in Sector 128 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 128 Noida Escorts >༒8448380779 Escort Service
 

Data Analysis Introduction.pptx

  • 1. Data Analysis Lab - 1 Introduction By Dr. Abhishek Kumar Singh
  • 2. Student Introduction • Name • City and State • Education detail (graduation, XII and X)
  • 3. • PhD (IIT BHU Varanasi) • M Tech (IIT BHU Varanasi) • B Tech (GBTU) • 3 Research Paper in SCOPUS/ABDC Indexed journals • 8 papers reviewed as a reviewer • Six sigma green belt
  • 4. Content • Syllabus • Data Analysis • Variables • Univariate • Bivariate
  • 5. Univariate Descriptive Analysis • Measures of Central Tendency- Mean, Median, Mode • Measures of Variability- Range, Variance, Standard Deviation, Co-efficient of Deviation • Measures of Shape- Skewness and Kurtosis • Measures of Stability- Standard Error
  • 6. Bivariate Descriptive Analysis • Covariance • Correlation
  • 7. Data Analysis • The Process of cleaning, transforming, interpreting, analyzing and visualizing the data to extract useful information and gain valuable insights to make more effective business decisions is called data analysis.
  • 8. Variables • Variables: Any character, characteristics or quality that varies is termed a variable. • E.g.: To collect the basic clinical and demographic information on patients with particular illness. Variables of interest may include Gender (M/F), age and height of the patients.
  • 9. Variable Categorical Numerical Nominal Ordinal Discrete Continuous Categories are mutually exclusive and unordered. Eg. Gender (M/F) Blood Group (A/B/AB/O) Categories are mutually exclusive and ordered. Eg. Disease severity (Mild, Moderate and Severe) Integer values, typically counts no notion of magnitude. Eg. No. of children vaccinated, days sick per year Takes any value in a range of values have a magnitude. E.g. weight in kg and Height in cm
  • 10. Statistics Descriptive Inferential • Collecting • Organizing • Summarizing • Presenting Data • Making inference • Hypothesis testing • Determining relationship • Making Prediction
  • 11. Three types of analysis • Univariate analysis: the examination of cases on only one variable at a time (e.g., weight of college students). • Bivariate analysis: the examination of two variables simultaneously (e.g., the relation between gender and weight of college students). • Multivariate analysis: examination of two variables simultaneously (e.g., the relationship between gender, race, and weight of college students).
  • 12. Purpose of different type of analysis • Univariate analysis: mainly description • Bivariate analysis: Determining the empirical relationship between two variables. • Multivariate analysis: Determining the empirical relationship among multiple variables.
  • 13. Univariate • The objective of univariate analysis is to derive the data, define and summarize it and analyze the pattern present in it. • Univariate techniques are appropriate when there is a single measurement of each element in the sample or when there are several measurements of each element but each variable is analyzed in isolation.
  • 14. Univariate Descriptive Inferential • Measures of Central Tendency- Mean, Median, Mode • Measures of Variability- Range, Variance, Standard Deviation, Co-efficient of Deviation • Measures of Shape- Skewness and Kurtosis • Measures of Stability- Standard Error • z test • t test • Chi square test
  • 15. Numerical Methods • Mean – Let X1, X2, X3,….Xn be the n data points, then mean of data is defined as – Mean provide the central value about which the data is spread out.
  • 16. Numerical Methods • Median – Median is the value which divide the data in two halves – Let X1, X2, X3,….Xn be the n data points – Order the n data values – If the number of data points is odd then sample median is the value in position of (n+1)/2 – If the number of data points is even then sample median is the average of value in position of n/2 and (n/2+1)
  • 17. Mean or Median? • Both the measures provide the “middle” value of data, so how do they compare? – Median is robust again extreme values in the data – While mean is affected by the extreme values • Example: 8, 9, 10, 11, 12 be the five data points – Mean = 10 and Median = 10 – Replace 12 by 18 • Mean = 11.2 but Median =10
  • 18. Numerical Methods • Mode – Mode is the a value in data that occurs with highest frequency – It’s the most probable value of the data – It is possible to have data that has more than one Mode value. Such data is called multimodal.
  • 19. Measures of Variability • Percentile – Order the data in ascending order • Then, p1 in called the first percentile if 1% of points lie below this value • Similarly pk is called the k% of data points lie below this value, where 0≤k≤100 • Quartile – P25 is called the 1st quartile Q1 – P75 is called the 3rd quartile Q3 – P50 is Median
  • 20. Measure of Dispersion • Measures the spread of data – Range – Variation or standard deviation • Measures the spread about mean/average value of data – Interquartile range • Measures the spread about median value of the data
  • 21. Measure of Dispersion • Range = M-m, where, – M = Max (x1, x2, ….xn) – m = Min (x1, x2, ….xn) • Variance – S2 = – Standard deviation = S • Interquartile range: Q3 - Q1
  • 22. Standard Deviation • Standard Deviation is most commonly used measure of dispersion. – Under the assumption of normality the range of Covers 67% of the data. • Hence, this is commonly used to show possible error in the observed value of data
  • 23. Graphical Method • Histogram or Bar chart – Frequency Plot • Pie Chart • Cumulative frequency plot • Box and Whisker plot
  • 24.
  • 25.
  • 26. Bivariate • Bi means two and variate means variable, so here there are two variables. The analysisis related to cause and the relationship between the two variables. • Correlation • Covariance