SlideShare ist ein Scribd-Unternehmen logo
1 von 13
Downloaden Sie, um offline zu lesen
Quantified Self
      Exploiting Your Data


           22 March 2012
            Akram Najjar




 This talk is en “eye opener”
     We will Not discuss
    Techniques or “How”
      Data is Analyzed!

We will Only talk about “What”
  such methods can give us
What Methods can you Apply to Your Data?


A. The Bell Shaped Curve (Normal Distribution)
B. Correlation of two variables
C. Forecasting using Simple Linear Regression
   (Best Line of Fit)
D. Statistical Process Control




                                                 3 / 25




Other Tools that work directly on Data . . . .

   Goodness of Fit testing
   Independence Testing
   Moving Averages and Exponential Smoothing
   Non-Linear Regression
    (polynomial, exponential, logarithmic)
   Weighted Index Scoring
   Excel: The Pivot Table
   Excel: Conditional Formatting


                                                 4 / 25
A. The Bell Shaped Curve
                  (The Gaussian or Normal Distribution)
                     Useful when you have a lot of data
                     Prepare a Bar Chart or a Frequency Table
                     Most likely, they will plot as a Bell Shaped Curve
                      (Normal/Gauss Curve)
                     Example: Measurements of most natural variables
                     Example: Measurements of most manufactured items
                     Prepare a frequency table of your data
                     How many times did you get a specific value?
                     Out of 200 measurements, how many times was your Systolic
                      Blood Pressure = 110,115, 120, 125, 130, 135, 140 . .


                                                                                    5 / 25




                      Here are 24 Systolic Blood Pressure
                      Measurements – They Look like a Bell Curve
                                                                   Probability of
                                                                  Pressure > 125
                                                                   = (4 + 2) / 24
                                                                   = 1/4 = 25%
How many times?




                        Probability of Pressure > 125 = (4 + 2) / 24 = 1/4 = 25%
If we had 201 measurements . . . .

                                 Total Count in Bars
                                 = Area of Bars
                                 = Probability > 122
                                 = 15.83%




            The Bell Shaped Curve
          is completely defined by:

a)   Average (115) of the data

b)   Standard deviation (7) of the data. It
     indicates how spread is our data from
     the average.

     (Approx 70% of observations are
     between 115-7 and 115+7)
What do we get if we use the Bell Shaped
Curve (Normal Distribution)?

    Benefit 1: measuring the spread of our data
    Benefit 2: we can now compare specific
     scores in two different population (next slide)
    Benefit 3: if we know the measure, we can
     compute the probability of it happening
    Benefit 4: if we know the probability, we can
     work out the cut off measure that will give it


                                                       9 / 25




If I have the same score 78 in Courses A and B,
     can I say I am doing the same in both?

                          78
                     72


                           88
Benefits 3 and 4

   Given a specific measurement or range, what is
    the probability of their occurrence?
        Probability I will get a fever of more than 38 degrees?
        Probability flights will be more than 30 minutes late?
        Probability my systolic is > 122
   Given the probability, what is the cutoff
    measurement?
        I want to remain at a sugar level representing the top 15%
         allowed, what is the level related to that?
        If Human Resources want the top 15% results, what is the
         passing grade?
                                                                      11 / 25




B. Correlation
   If we have two sets of data, how are they related?
   Example: Blood Pressure vs Intake of Salt
   Example: Advertising Expenditure vs Sales Revenue
   Example: Hours walked per day vs Weight in Kilograms
   What is the direction of the relationship?
        Direct or inverse?
   What is the strength of the relationship?
        Correlation
   We use the Correlation Function (Demonstrate in Excel)

                                                                      12 / 25
C. Forecasting using Simple Linear
  Regression (Best Line of Fit)
   If we have an independent variable (X): Sugar Intake
   And a dependent variable (Y): Weight
   What is the relationship that allows us to forecast
    Weight for different Sugar Intakes?

   We need two columns: X and Y
   Simple Linear Regression allows us to find the Best
    Line to fit our data


                                                      13 / 25




           Regression finds the Best Line
             that Fits our Observations


      5
Y
      4

      3

      2

      1

     0,0     1    2    3    4    5   6    7    8
Which Straight Line Best Fits our Observations?


      5
 Y
      4

      3

      2

      1

     0,0    1    2   3   4   5    6   7   8




     Multiple Regression: allows us to find the
         Equation Y = aX1 + bX2 + cX3 + d




                X2           X3


      X1                                      Y
                                                  16 / 25
D. Statistical Process Control (SPC)

              The Purpose of SPC is to Monitor a Process
              SPC allows us to Check if a variable is behaving properly
                   Over time
                   Over different locations/departments
                   Over different events
                   Over different samples
              Control Charts were first used in Bell Labs (1924)
              Although mostly used in industry SPC can be used in any sector




                                                                                          17 / 25




               The General Form of a Control Chart: 4 Components
                            4) Process Data
                                                           1) UCL : Upper Control Limit
Our Variable




                                                                       2) AL : Average Line




                                                            3) LCL : Lower Lower Limit




                     The IDs of the Samples - - - - - OR The Time Series
This Process is “In control”

50
45
40
35
                                               Upper Limit
30
25
20
15
10
                                               Lower Limit
5
0




      This Process is Regularly “Out of Control”




            Look for an explanation INSIDE the system
This Process is Irregularly “Out of Control”




     Look for an explanation OUTSIDE the system




This Process is Irregularly “Out of Control”.
                                    Trends in either
                                    direction of 5 or
                                       more points




     Look for an explanation OUTSIDE the system
The 7 Point Rule: there is a problem if 7 points in a
row (Or more) are above the average or below it




      Look for an explanation OUTSIDE the system




            Types of Control Charts
Thank you
for your kind
    attention

Weitere ähnliche Inhalte

Was ist angesagt?

Was ist angesagt? (19)

Measures of Dispersion
Measures of DispersionMeasures of Dispersion
Measures of Dispersion
 
Regression
RegressionRegression
Regression
 
Biostatistics i
Biostatistics iBiostatistics i
Biostatistics i
 
13 ch ken black solution
13 ch ken black solution13 ch ken black solution
13 ch ken black solution
 
Standard deviation
Standard deviationStandard deviation
Standard deviation
 
15 ch ken black solution
15 ch ken black solution15 ch ken black solution
15 ch ken black solution
 
Statistical analysis in analytical chemistry
Statistical analysis in analytical chemistryStatistical analysis in analytical chemistry
Statistical analysis in analytical chemistry
 
statistics
statisticsstatistics
statistics
 
Measure of dispersion
Measure of dispersionMeasure of dispersion
Measure of dispersion
 
Variability
VariabilityVariability
Variability
 
Properties of Standard Deviation
Properties of Standard DeviationProperties of Standard Deviation
Properties of Standard Deviation
 
Measure of Dispersion
Measure of DispersionMeasure of Dispersion
Measure of Dispersion
 
Assessing Normality
Assessing NormalityAssessing Normality
Assessing Normality
 
11 ch ken black solution
11 ch ken black solution11 ch ken black solution
11 ch ken black solution
 
Measures of dispersion
Measures of dispersionMeasures of dispersion
Measures of dispersion
 
Statistics-Measures of dispersions
Statistics-Measures of dispersionsStatistics-Measures of dispersions
Statistics-Measures of dispersions
 
16 ch ken black solution
16 ch ken black solution16 ch ken black solution
16 ch ken black solution
 
Standard deviation
Standard deviationStandard deviation
Standard deviation
 
The Standard Normal Distribution
The Standard Normal DistributionThe Standard Normal Distribution
The Standard Normal Distribution
 

Andere mochten auch

A couple cool Chinabuye's products
A couple cool Chinabuye's productsA couple cool Chinabuye's products
A couple cool Chinabuye's productsAnelia Lawrance
 
Measurement Blood Coagulation Rate
Measurement Blood Coagulation RateMeasurement Blood Coagulation Rate
Measurement Blood Coagulation RateBeirutQuantifiedSelf
 
사회적시간은행 이그나이트광주 20120317
사회적시간은행 이그나이트광주 20120317사회적시간은행 이그나이트광주 20120317
사회적시간은행 이그나이트광주 20120317aleccho
 
웹20년과정치 세종대웹20주년 20141017
웹20년과정치 세종대웹20주년 20141017웹20년과정치 세종대웹20주년 20141017
웹20년과정치 세종대웹20주년 20141017aleccho
 
A couple cool Chinabuye's products
A couple cool Chinabuye's productsA couple cool Chinabuye's products
A couple cool Chinabuye's productsAnelia Lawrance
 
кодове за Gta san andreas
кодове за Gta san andreasкодове за Gta san andreas
кодове за Gta san andreasminio10
 
High expectations @ SAMS
High expectations @ SAMSHigh expectations @ SAMS
High expectations @ SAMSSAMS2012
 
Teaching as Inquiry
Teaching as InquiryTeaching as Inquiry
Teaching as InquirySAMS2012
 
HVTT14 Investigating Heavy Vehicle Rollover Crashes and the Influence of Road...
HVTT14 Investigating Heavy Vehicle Rollover Crashes and the Influence of Road...HVTT14 Investigating Heavy Vehicle Rollover Crashes and the Influence of Road...
HVTT14 Investigating Heavy Vehicle Rollover Crashes and the Influence of Road...Johan Granlund
 
Jesica mafe
Jesica mafeJesica mafe
Jesica mafejesica16
 
HVTT14 Traffic Safety Risks with EU Tractor-Semitrailer Rigs on Slippery Roads
HVTT14 Traffic Safety Risks with EU Tractor-Semitrailer Rigs on Slippery RoadsHVTT14 Traffic Safety Risks with EU Tractor-Semitrailer Rigs on Slippery Roads
HVTT14 Traffic Safety Risks with EU Tractor-Semitrailer Rigs on Slippery RoadsJohan Granlund
 

Andere mochten auch (18)

A couple cool Chinabuye's products
A couple cool Chinabuye's productsA couple cool Chinabuye's products
A couple cool Chinabuye's products
 
Path to market analysis
Path to market analysisPath to market analysis
Path to market analysis
 
Measurement Blood Coagulation Rate
Measurement Blood Coagulation RateMeasurement Blood Coagulation Rate
Measurement Blood Coagulation Rate
 
Hiphop php
Hiphop phpHiphop php
Hiphop php
 
Exploiting your Data
Exploiting your DataExploiting your Data
Exploiting your Data
 
사회적시간은행 이그나이트광주 20120317
사회적시간은행 이그나이트광주 20120317사회적시간은행 이그나이트광주 20120317
사회적시간은행 이그나이트광주 20120317
 
웹20년과정치 세종대웹20주년 20141017
웹20년과정치 세종대웹20주년 20141017웹20년과정치 세종대웹20주년 20141017
웹20년과정치 세종대웹20주년 20141017
 
Formulir oprec saung sastra
Formulir oprec saung sastraFormulir oprec saung sastra
Formulir oprec saung sastra
 
A couple cool Chinabuye's products
A couple cool Chinabuye's productsA couple cool Chinabuye's products
A couple cool Chinabuye's products
 
кодове за Gta san andreas
кодове за Gta san andreasкодове за Gta san andreas
кодове за Gta san andreas
 
Samsung galaxy s 2
Samsung galaxy  s 2Samsung galaxy  s 2
Samsung galaxy s 2
 
High expectations @ SAMS
High expectations @ SAMSHigh expectations @ SAMS
High expectations @ SAMS
 
Teaching as Inquiry
Teaching as InquiryTeaching as Inquiry
Teaching as Inquiry
 
HVTT14 Investigating Heavy Vehicle Rollover Crashes and the Influence of Road...
HVTT14 Investigating Heavy Vehicle Rollover Crashes and the Influence of Road...HVTT14 Investigating Heavy Vehicle Rollover Crashes and the Influence of Road...
HVTT14 Investigating Heavy Vehicle Rollover Crashes and the Influence of Road...
 
Ppt laju reaksi
Ppt laju reaksiPpt laju reaksi
Ppt laju reaksi
 
Jesica mafe
Jesica mafeJesica mafe
Jesica mafe
 
mafe
mafe mafe
mafe
 
HVTT14 Traffic Safety Risks with EU Tractor-Semitrailer Rigs on Slippery Roads
HVTT14 Traffic Safety Risks with EU Tractor-Semitrailer Rigs on Slippery RoadsHVTT14 Traffic Safety Risks with EU Tractor-Semitrailer Rigs on Slippery Roads
HVTT14 Traffic Safety Risks with EU Tractor-Semitrailer Rigs on Slippery Roads
 

Ähnlich wie Akram najjar exploiting your data (for printing)

Statistical quality__control_2
Statistical  quality__control_2Statistical  quality__control_2
Statistical quality__control_2Tech_MX
 
Normal Distribution
Normal DistributionNormal Distribution
Normal DistributionCIToolkit
 
7. logistics regression using spss
7. logistics regression using spss7. logistics regression using spss
7. logistics regression using spssDr Nisha Arora
 
Lesson 5 - Chebyshev and Normal.ppt
Lesson 5 - Chebyshev and Normal.pptLesson 5 - Chebyshev and Normal.ppt
Lesson 5 - Chebyshev and Normal.pptlokeshgupta130
 
Quality control program 05042018
Quality control program 05042018Quality control program 05042018
Quality control program 05042018Syed Basheer
 
Module-2_Notes-with-Example for data science
Module-2_Notes-with-Example for data scienceModule-2_Notes-with-Example for data science
Module-2_Notes-with-Example for data sciencepujashri1975
 
A-guide-to-creating-and-interpreting-run-and-control-charts
A-guide-to-creating-and-interpreting-run-and-control-chartsA-guide-to-creating-and-interpreting-run-and-control-charts
A-guide-to-creating-and-interpreting-run-and-control-chartsRohitLakhotia12
 
Back to the basics-Part2: Data exploration: representing and testing data pro...
Back to the basics-Part2: Data exploration: representing and testing data pro...Back to the basics-Part2: Data exploration: representing and testing data pro...
Back to the basics-Part2: Data exploration: representing and testing data pro...Giannis Tsakonas
 
Data analysis ( Bio-statistic )
Data analysis ( Bio-statistic )Data analysis ( Bio-statistic )
Data analysis ( Bio-statistic )Amany Elsayed
 
Internal quality control
Internal quality controlInternal quality control
Internal quality controlSyed Basheer
 
SPC WithAdrian Adrian Beale
SPC WithAdrian Adrian BealeSPC WithAdrian Adrian Beale
SPC WithAdrian Adrian BealeAdrian Beale
 

Ähnlich wie Akram najjar exploiting your data (for printing) (20)

Statistical quality__control_2
Statistical  quality__control_2Statistical  quality__control_2
Statistical quality__control_2
 
Qc tools
Qc toolsQc tools
Qc tools
 
Qc tools
Qc toolsQc tools
Qc tools
 
Normal Distribution
Normal DistributionNormal Distribution
Normal Distribution
 
7. logistics regression using spss
7. logistics regression using spss7. logistics regression using spss
7. logistics regression using spss
 
Lesson 5 - Chebyshev and Normal.ppt
Lesson 5 - Chebyshev and Normal.pptLesson 5 - Chebyshev and Normal.ppt
Lesson 5 - Chebyshev and Normal.ppt
 
Measures of dispersion discuss 2.2
Measures of dispersion discuss 2.2Measures of dispersion discuss 2.2
Measures of dispersion discuss 2.2
 
Measures of dispersion
Measures of dispersionMeasures of dispersion
Measures of dispersion
 
Measure of Dispersion in statistics
Measure of Dispersion in statisticsMeasure of Dispersion in statistics
Measure of Dispersion in statistics
 
Quality control program 05042018
Quality control program 05042018Quality control program 05042018
Quality control program 05042018
 
Module-2_Notes-with-Example for data science
Module-2_Notes-with-Example for data scienceModule-2_Notes-with-Example for data science
Module-2_Notes-with-Example for data science
 
TESCO Evaluation of Non-Normal Meter Data
TESCO Evaluation of Non-Normal Meter DataTESCO Evaluation of Non-Normal Meter Data
TESCO Evaluation of Non-Normal Meter Data
 
Spc training
Spc training Spc training
Spc training
 
A-guide-to-creating-and-interpreting-run-and-control-charts
A-guide-to-creating-and-interpreting-run-and-control-chartsA-guide-to-creating-and-interpreting-run-and-control-charts
A-guide-to-creating-and-interpreting-run-and-control-charts
 
Back to the basics-Part2: Data exploration: representing and testing data pro...
Back to the basics-Part2: Data exploration: representing and testing data pro...Back to the basics-Part2: Data exploration: representing and testing data pro...
Back to the basics-Part2: Data exploration: representing and testing data pro...
 
Cairo 02 Stat Inference
Cairo 02 Stat InferenceCairo 02 Stat Inference
Cairo 02 Stat Inference
 
Data analysis ( Bio-statistic )
Data analysis ( Bio-statistic )Data analysis ( Bio-statistic )
Data analysis ( Bio-statistic )
 
Internal quality control
Internal quality controlInternal quality control
Internal quality control
 
SPC WithAdrian Adrian Beale
SPC WithAdrian Adrian BealeSPC WithAdrian Adrian Beale
SPC WithAdrian Adrian Beale
 
Statistical Process Control
Statistical Process ControlStatistical Process Control
Statistical Process Control
 

Kürzlich hochgeladen

Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 

Kürzlich hochgeladen (20)

Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 

Akram najjar exploiting your data (for printing)

  • 1. Quantified Self Exploiting Your Data 22 March 2012 Akram Najjar This talk is en “eye opener” We will Not discuss Techniques or “How” Data is Analyzed! We will Only talk about “What” such methods can give us
  • 2. What Methods can you Apply to Your Data? A. The Bell Shaped Curve (Normal Distribution) B. Correlation of two variables C. Forecasting using Simple Linear Regression (Best Line of Fit) D. Statistical Process Control 3 / 25 Other Tools that work directly on Data . . . .  Goodness of Fit testing  Independence Testing  Moving Averages and Exponential Smoothing  Non-Linear Regression (polynomial, exponential, logarithmic)  Weighted Index Scoring  Excel: The Pivot Table  Excel: Conditional Formatting 4 / 25
  • 3. A. The Bell Shaped Curve (The Gaussian or Normal Distribution)  Useful when you have a lot of data  Prepare a Bar Chart or a Frequency Table  Most likely, they will plot as a Bell Shaped Curve (Normal/Gauss Curve)  Example: Measurements of most natural variables  Example: Measurements of most manufactured items  Prepare a frequency table of your data  How many times did you get a specific value?  Out of 200 measurements, how many times was your Systolic Blood Pressure = 110,115, 120, 125, 130, 135, 140 . . 5 / 25 Here are 24 Systolic Blood Pressure Measurements – They Look like a Bell Curve Probability of Pressure > 125 = (4 + 2) / 24 = 1/4 = 25% How many times? Probability of Pressure > 125 = (4 + 2) / 24 = 1/4 = 25%
  • 4. If we had 201 measurements . . . . Total Count in Bars = Area of Bars = Probability > 122 = 15.83% The Bell Shaped Curve is completely defined by: a) Average (115) of the data b) Standard deviation (7) of the data. It indicates how spread is our data from the average. (Approx 70% of observations are between 115-7 and 115+7)
  • 5. What do we get if we use the Bell Shaped Curve (Normal Distribution)?  Benefit 1: measuring the spread of our data  Benefit 2: we can now compare specific scores in two different population (next slide)  Benefit 3: if we know the measure, we can compute the probability of it happening  Benefit 4: if we know the probability, we can work out the cut off measure that will give it 9 / 25 If I have the same score 78 in Courses A and B, can I say I am doing the same in both? 78 72 88
  • 6. Benefits 3 and 4  Given a specific measurement or range, what is the probability of their occurrence?  Probability I will get a fever of more than 38 degrees?  Probability flights will be more than 30 minutes late?  Probability my systolic is > 122  Given the probability, what is the cutoff measurement?  I want to remain at a sugar level representing the top 15% allowed, what is the level related to that?  If Human Resources want the top 15% results, what is the passing grade? 11 / 25 B. Correlation  If we have two sets of data, how are they related?  Example: Blood Pressure vs Intake of Salt  Example: Advertising Expenditure vs Sales Revenue  Example: Hours walked per day vs Weight in Kilograms  What is the direction of the relationship?  Direct or inverse?  What is the strength of the relationship?  Correlation  We use the Correlation Function (Demonstrate in Excel) 12 / 25
  • 7. C. Forecasting using Simple Linear Regression (Best Line of Fit)  If we have an independent variable (X): Sugar Intake  And a dependent variable (Y): Weight  What is the relationship that allows us to forecast Weight for different Sugar Intakes?  We need two columns: X and Y  Simple Linear Regression allows us to find the Best Line to fit our data 13 / 25 Regression finds the Best Line that Fits our Observations 5 Y 4 3 2 1 0,0 1 2 3 4 5 6 7 8
  • 8. Which Straight Line Best Fits our Observations? 5 Y 4 3 2 1 0,0 1 2 3 4 5 6 7 8 Multiple Regression: allows us to find the Equation Y = aX1 + bX2 + cX3 + d X2 X3 X1 Y 16 / 25
  • 9. D. Statistical Process Control (SPC)  The Purpose of SPC is to Monitor a Process  SPC allows us to Check if a variable is behaving properly  Over time  Over different locations/departments  Over different events  Over different samples  Control Charts were first used in Bell Labs (1924)  Although mostly used in industry SPC can be used in any sector 17 / 25 The General Form of a Control Chart: 4 Components 4) Process Data 1) UCL : Upper Control Limit Our Variable 2) AL : Average Line 3) LCL : Lower Lower Limit The IDs of the Samples - - - - - OR The Time Series
  • 10. This Process is “In control” 50 45 40 35 Upper Limit 30 25 20 15 10 Lower Limit 5 0 This Process is Regularly “Out of Control” Look for an explanation INSIDE the system
  • 11. This Process is Irregularly “Out of Control” Look for an explanation OUTSIDE the system This Process is Irregularly “Out of Control”. Trends in either direction of 5 or more points Look for an explanation OUTSIDE the system
  • 12. The 7 Point Rule: there is a problem if 7 points in a row (Or more) are above the average or below it Look for an explanation OUTSIDE the system Types of Control Charts
  • 13. Thank you for your kind attention