SlideShare a Scribd company logo
1 of 30
Presented by –Harsh Upadhyay
Module Objective


          Agenda
•Introduce the concept of Simple Linear Regression

•Walk through the process of plotting our data

•Apply regression Techniques

•Evaluate our model

•Interpret the result

   Expected Learning
•Understand key Simple Linear Regression terminology

•Evaluate the relationship between a continuous X and continuous Y

•Use regression analysis to make predictions about process
Historical Note


•Sir Francis Galton (1822-1911) used the term
Regression.


•To explain the relationship between the heights
(inches) of fathers and their sons.
Simple Linear Regression


Regression analysis is used to predict the value of one variable (the dependent
variable) on the basis of other variables (the independent variables).

Dependent variable: denoted Y
Independent variables: denoted X1, X2, …, Xk

   If we only have ONE independent variable, the model is
Simple Linear Regression Analysis



Variables:
         X = Independent Variable (we provide this)
         Y = Dependent Variable (we observe this)
Parameters:
         β0 = Intercept
                 The y-intercept of a line is the point at which the line crosses the y
                  axis. ( i.e. where the x value equals 0)
             β1 = Slope
                  Change in the mean of Y for a unit change in X
             ε ~ Normal Random Variable (με = 0, σε = ???) [Noise]
Simple Linear Regression Analysis


Meaning of       and

   > 0 [positive slope]   < 0 [negative slope]

             y



                                         rise

                               run
                                           =slope (=rise/run)


  =y-intercept

                                                    x
Effect of Larger Values of σ ε


                Lower vs. Higher
                Variability
 Y




25K$

                    Y= β0+ β1 +

                                   X
The least Square Method


                                                            s
                                                         nce
                                               dif   fere
these differences are                       ed
                                        quar
called residuals or                  hes
errors                           of t
                           e sum line…
                    ze s th d the
                 imi nts an
              min poi
           ne
       s li n the
    Thi wee
     bet
Exercise

A black belt is connected with optimize a call center in a retail bank where clients place
the inquiries and order over the phone during 6 am to 6 pm (Monday to Friday). The
current staffing plan is begin with about 4 associates at 6 am and increases to about 35
associate by 9 am. At 3.30 pm the no. of associate begin to drop to about 7 by 6 pm. The
black belt is anticipating an increasing in call volume and want to know how many call
can be answer in 30 min time interval for various staffing level to better staff the call
center.

The black belt obtain the data on the no. of associate and the no. of calls answered
for each 30 min interval for the last two weeks. You have a total of 240 Samples.




                                          Data sheet
Scatter Plot
Fitted Line Plot

      Regression Analysis: CallsAnswd
      versus Staff

      The regression equation is
      CallsAnswd = - 8.844 + 3.099 Staff


      S = 11.7311     R-Sq = 83.4%     R-Sq(adj) = 83.3%


      Analysis of Variance

      Source         DF       SS       MS         F       P
      Regression      1   164091   164091   1192.37   0.000
      Error         238    32753      138
      Total         239   196844



      Fitted Line: CallsAnswd versus Staff
Minitab Output-Session Window

CallsAnswd = - 8.844 + 3.099 Staff

CallsAnswd = - 8.844 + 3.099 (17)=43.839

CallsAnswd = - 8.844 + 3.099 (22)=59.334


                 How Confident are you
                 with this model?

                               59 Calls
   Staff of 22         ?       59 Calls
                               59 Calls
Assessing The Model

              Evaluate the strength of the regression model
1. Determine how much variation in our calls answered data is actually explained by
   staff.

2. Determine The strength of the relationship between Call Answered and Staff


Regression Analysis: CallsAnswd                                     R-sq
versus Staff
The regression equation is
CallsAnswd = - 8.844 + 3.099 Staff                         % of variation in the Y
S = 11.7311    R-Sq = 83.4%    R-Sq(adj) = 83.3%           values explained by the
                                                           linear relationship with X
Analysis of Variance
Source       DF      SS       MS         F       P
Regression    1 164091    164091   1192.37   0.000
Error       238   32753      138                         % of variation Calls Answered
Total       239 196844                                   explained by Staff
Fitted Line: CallsAnswd versus Staff
% of Variation-Explanation




                                                       Explained       Total
                                                       Variation (Y)   Variation (Y)




              Explained Variation
R Squared =      Total Variation    = Between 0 and 1 (0% to 100%)
Properties of R

                                                              The Correlation Coefficient
                                      √ R2 = R         Measure the strength of the linear relationship
                                                          (Pearson’s Correlation of coefficient)




                                                                    R
      Strong            Moderate                     Low                            Low                   Moderate           Moderate
-1 Relationship -0.8   Relationship   -0.5       Relationship        0          Relationship    0.5      Relationship   0.8 Relationship +1
Caution- Correlation and
                          Causation


                                          Calls Answered



                                                  Staff
                                              Correlated




   Correlation                                Causation
                                        Change in one variable causes
Two things vary together
                                             change in another
Regression Interpretation-
                               Graphical




•Residual is not normal
•Residual Vs Fits is not constant
Regression- Session Window

Regression Analysis: CallsAnswd versus Staff         Unusual Observations

The regression equation is                           Obs   Staff CallsAnswd Fit      SE Fit   Residual   St Resid
CallsAnswd = - 8.84 + 3.10 Staff                       1   3.0    6.000     0.454     1.995     5.546     0.48 X
                                                       3   4.0    7.000     3.554     1.913     3.446     0.30 X
                                                       6   4.0   11.000     3.554     1.913     7.446     0.64 X
Predictor Coef SE Coef      T    P                     7   4.0   12.000     3.554     1.913     8.446     0.73 X
Constant -8.844 2.247     -3.94 0.000                  8   3.0    5.000     0.454     1.995     4.546     0.39 X
Staff     3.09947 0.08976 34.53 0.000                  9   3.0    7.000     0.454     1.995     6.546     0.57 X
                                                      67   29.0 109.000     81.040    0.901    27.960     2.39R
                                                      78   32.0 127.000     90.339    1.071    36.661     3.14R
S = 11.7311 R-Sq = 83.4% R-Sq(adj) = 83.3%           141   30.0 134.000     84.140    0.952    49.860     4.26R
                                                     151   30.0 114.000     84.140    0.952    29.860     2.55R
                                                     161   33.0 128.000     93.438    1.136    34.562     2.96R
Analysis of Variance                                 171   33.0 126.000     93.438    1.136    32.562     2.79R
                                                     183   32.0    59.000   90.339    1.071   -31.339    -2.68R
Source         DF       SS     MS       F       P    199   29.0    48.000   81.040    0.901   -33.040    -2.82R
Regression     1       164091 164091 1192.37 0.000   202   25.0    35.000   68.643    0.768   -33.643    -2.87R
Residual Error 238     32753 138                     235    4.0   14.000     3.554    1.913    10.446     0.90 X
Total          239     196844                        236    4.0    7.000     3.554    1.913     3.446     0.30 X

                                                     R denotes an observation with a large standardized residual (>2)
                                                     X denotes an observation whose X value gives it large leverage
Unusual Observations
 Unusual Observations                                               Unusual Observations

 Obs   Staff CallsAnswd Fit      SE Fit   Residual      St Resid    Obs   Staff CallsAnswd Fit      SE Fit   Residual   St Resid
   1   3.0    6.000     0.454     1.995     5.546        0.48 X       1   3.0    6.000     0.454     1.995     5.546     0.48 X
   3   4.0    7.000     3.554     1.913     3.446        0.30 X       3   4.0    7.000     3.554     1.913     3.446     0.30 X
   6   4.0   11.000     3.554     1.913     7.446        0.64 X       6   4.0   11.000     3.554     1.913     7.446     0.64 X
   7   4.0   12.000     3.554     1.913     8.446        0.73 X       7   4.0   12.000     3.554     1.913     8.446     0.73 X
   8   3.0    5.000     0.454     1.995     4.546        0.39 X       8   3.0    5.000     0.454     1.995     4.546     0.39 X
   9   3.0    7.000     0.454     1.995     6.546        0.57 X       9   3.0    7.000     0.454     1.995     6.546     0.57 X
  67   29.0 109.000     81.040    0.901    27.960        2.39R       67   29.0 109.000     81.040    0.901    27.960     2.39R
  78   32.0 127.000     90.339    1.071    36.661        3.14R       78   32.0 127.000     90.339    1.071    36.661     3.14R
 141   30.0 134.000     84.140    0.952    49.860        4.26R      141   30.0 134.000     84.140    0.952    49.860     4.26R
 151   30.0 114.000     84.140    0.952    29.860        2.55R      151   30.0 114.000     84.140    0.952    29.860     2.55R
 161   33.0 128.000     93.438    1.136    34.562        2.96R      161   33.0 128.000     93.438    1.136    34.562     2.96R
 171   33.0 126.000     93.438    1.136    32.562        2.79R      171   33.0 126.000     93.438    1.136    32.562     2.79R
 183   32.0    59.000   90.339    1.071   -31.339       -2.68R      183   32.0    59.000   90.339    1.071   -31.339    -2.68R
 199   29.0    48.000   81.040    0.901   -33.040       -2.82R      199   29.0    48.000   81.040    0.901   -33.040    -2.82R
 202   25.0    35.000   68.643    0.768   -33.643       -2.87R      202   25.0    35.000   68.643    0.768   -33.643    -2.87R
 235    4.0   14.000     3.554    1.913    10.446        0.90 X     235    4.0   14.000     3.554    1.913    10.446     0.90 X
 236    4.0    7.000     3.554    1.913     3.446        0.30 X     236    4.0    7.000     3.554    1.913     3.446     0.30 X

 R denotes an observation with a large standardized residual (>2)   X denotes an observation whose X value gives it large leverage

                                          240 Data points

Unusual observation                       4% = 9 points


Residual values (R) ≤
5% of total observation
Original Vs Altered

Original Graph               Altered Graph
Interpreting The Model

Regression Analysis: CallsAnswd versus Staff

The regression equation is
CallsAnswd = - 8.84 + 3.10 Staff


Predictor    Coef    SE Coef         T     P
Constant    -8.844    2.247        -3.94 0.000         Predictor Table
Staff        3.09947 0.08976        34.53 0.000



S = 11.7311 R-Sq = 83.4% R-Sq(adj) = 83.3%           Additional Statistics


Analysis of Variance

Source         DF       SS     MS       F       P
Regression     1       164091 164091 1192.37 0.000
                                                        ANOVA Table
Residual Error 238     32753 138
Total          239     196844
Interpreting The Model

Regression Analysis: CallsAnswd versus Staff

The regression equation is
CallsAnswd = - 8.84 + 3.10 Staff
                                                           Predictor Table
Predictor    Coef    SE Coef         T     P
Constant    -8.844    2.247        -3.94 0.000
Staff        3.09947 0.08976        34.53 0.000      Ho: Slope = 0
                                                     •No difference in Y when X
                                                     changes
S = 11.7311 R-Sq = 83.4% R-Sq(adj) = 83.3%           •X has no impact on y


Analysis of Variance                                 Ha: Slope ≠ 0
Source         DF       SS     MS       F       P
Regression     1       164091 164091 1192.37 0.000
                                                     •Y changes as X changes
Residual Error 238     32753 138                     •X has impact on y
Total          239     196844
Interpreting The Model

Regression Analysis: CallsAnswd versus Staff

The regression equation is
CallsAnswd = - 8.84 + 3.10 Staff
                                                         Additional Statistics
Predictor    Coef    SE Coef         T     P
Constant    -8.844    2.247        -3.94 0.000
                                                     R-squared
Staff        3.09947 0.08976        34.53 0.000      •The amount of variation
                                                     explained by this variation

S = 11.7311 R-Sq = 83.4% R-Sq(adj) = 83.3%           R-squared (Adjusted)
                                                     •Account for the no. of X’s used in
                                                     model (Used in multiple
Analysis of Variance                                 regression)

Source         DF       SS     MS       F       P    S
Regression     1       164091 164091 1192.37 0.000
Residual Error 238     32753 138
                                                     •Standard deviation for the
Total          239     196844                        leftover or unexplained variation.
                                                     It is used for designing the
                                                     confidence interval
Interpreting The Model

Regression Analysis: CallsAnswd versus Staff

The regression equation is
CallsAnswd = - 8.84 + 3.10 Staff
                                                             ANOVA Table
Predictor    Coef    SE Coef         T     P
Constant    -8.844    2.247        -3.94 0.000
                                                     Regression
Staff        3.09947 0.08976        34.53 0.000
                                                     •The explained variation


S = 11.7311 R-Sq = 83.4% R-Sq(adj) = 83.3%           Residual Error
                                                     •Unexplained Variation
Analysis of Variance

Source         DF       SS     MS       F       P
                                                     Ho:
Regression     1       164091 164091 1192.37 0.000   •The model does not explain the
Residual Error 238     32753 138                     observed variation
Total          239     196844

                                                     Ha:
                                                     •The model does explain the
                                                     observed variation
Confidence on Output

 CallsAnswd = -8.844 + 3.099 Staff



If Staff 10 then, - 8.844 + 3.099 (10) = 22

If Staff 20 then, - 8.844 + 3.099 (20) = 53

If Staff 30 then, - 8.844 + 3.099 (30) = 84


               Confidence = ???%

               R-squared = 83.4%
                                              Report result using
         Unexplained Variation = 16.6%
                                              confidence intervals
Confidence on Output



 95% confidence the true
regression line lies within




                        95% prediction interval expect 95%
                          of the data points to fall within
Confidence on Output



Confidence interval report
  average performance




                         Prediction intervals
                        predict actual values
Predicted Value for Y
Let management decide max 20 staff, then how may call answer by these
20 executive



Predicted Values for New Observations

New Obs  Fit   SE Fit          95% CI              95% PI
   1    53.145 0.822       (51.526, 54.765)   (29.979, 76.312)


Values of Predictors for New Observations

New Obs Staff
  1 20.0
Review Methodology

1. Get Familiar with the data                     3. Check the model & the assumption

• Identify the output or Y variable               • Look at the residual plot
• Identify the X variable or predictor                   Are assumption valid?
• Look at the time series, dot plots, histogram          Do residual look ok?
and scatter plot                                         Are there unusual observation? If yes,
• Run descriptive statistics                            address, if possible and rerun the analysis
        Check for outliers                       • What is r-squared?
        Check for gaps in the data               • How much variation can be explained by the
                                                  model?
                                                  • Look at P –values
                                                         Does X have significant impact on Y?



2. Fit the model to the data                      4. Report Result & Use Equation
• Run regression                                  • Summarize the results for your stakeholders
• Use the regression                              • If you have excluded data, explain why
                                                  • If you kept outlier in the data, explain why
                                                  • How much variation in Y can be explained by X?
                                                  • Remember causation vs. correlation
                                                  • Make a predictions using confidence and
                                                  prediction intervals.
Simple linear regression (final)

More Related Content

What's hot

Regression Analysis
Regression AnalysisRegression Analysis
Regression AnalysisASAD ALI
 
Simple linear regression and correlation
Simple linear regression and correlationSimple linear regression and correlation
Simple linear regression and correlationShakeel Nouman
 
Multiple regression presentation
Multiple regression presentationMultiple regression presentation
Multiple regression presentationCarlo Magno
 
Regression analysis ppt
Regression analysis pptRegression analysis ppt
Regression analysis pptElkana Rorio
 
Regression
Regression Regression
Regression Ali Raza
 
Correlation and regression
Correlation and regressionCorrelation and regression
Correlation and regressionAbdelaziz Tayoun
 
Regression Analysis
Regression AnalysisRegression Analysis
Regression AnalysisSalim Azad
 
Mpc 006 - 02-03 partial and multiple correlation
Mpc 006 - 02-03 partial and multiple correlationMpc 006 - 02-03 partial and multiple correlation
Mpc 006 - 02-03 partial and multiple correlationVasant Kothari
 
Correlation and regression
Correlation and regressionCorrelation and regression
Correlation and regressionKhalid Aziz
 
Basics of Regression analysis
 Basics of Regression analysis Basics of Regression analysis
Basics of Regression analysisMahak Vijayvargiya
 
Statistics-Regression analysis
Statistics-Regression analysisStatistics-Regression analysis
Statistics-Regression analysisRabin BK
 
correlation and regression
correlation and regressioncorrelation and regression
correlation and regressionUnsa Shakir
 
Linear regression
Linear regressionLinear regression
Linear regressionSreerajVA
 
Regression Analysis presentation by Al Arizmendez and Cathryn Lottier
Regression Analysis presentation by Al Arizmendez and Cathryn LottierRegression Analysis presentation by Al Arizmendez and Cathryn Lottier
Regression Analysis presentation by Al Arizmendez and Cathryn LottierAl Arizmendez
 

What's hot (20)

Regression Analysis
Regression AnalysisRegression Analysis
Regression Analysis
 
Simple linear regression
Simple linear regressionSimple linear regression
Simple linear regression
 
Linear regression
Linear regressionLinear regression
Linear regression
 
Simple linear regression and correlation
Simple linear regression and correlationSimple linear regression and correlation
Simple linear regression and correlation
 
Regression analysis
Regression analysisRegression analysis
Regression analysis
 
Multiple regression presentation
Multiple regression presentationMultiple regression presentation
Multiple regression presentation
 
Regression analysis ppt
Regression analysis pptRegression analysis ppt
Regression analysis ppt
 
Regression
Regression Regression
Regression
 
Correlation and regression
Correlation and regressionCorrelation and regression
Correlation and regression
 
Regression ppt
Regression pptRegression ppt
Regression ppt
 
Regression
RegressionRegression
Regression
 
Correlation and Simple Regression
Correlation  and Simple RegressionCorrelation  and Simple Regression
Correlation and Simple Regression
 
Regression Analysis
Regression AnalysisRegression Analysis
Regression Analysis
 
Mpc 006 - 02-03 partial and multiple correlation
Mpc 006 - 02-03 partial and multiple correlationMpc 006 - 02-03 partial and multiple correlation
Mpc 006 - 02-03 partial and multiple correlation
 
Correlation and regression
Correlation and regressionCorrelation and regression
Correlation and regression
 
Basics of Regression analysis
 Basics of Regression analysis Basics of Regression analysis
Basics of Regression analysis
 
Statistics-Regression analysis
Statistics-Regression analysisStatistics-Regression analysis
Statistics-Regression analysis
 
correlation and regression
correlation and regressioncorrelation and regression
correlation and regression
 
Linear regression
Linear regressionLinear regression
Linear regression
 
Regression Analysis presentation by Al Arizmendez and Cathryn Lottier
Regression Analysis presentation by Al Arizmendez and Cathryn LottierRegression Analysis presentation by Al Arizmendez and Cathryn Lottier
Regression Analysis presentation by Al Arizmendez and Cathryn Lottier
 

Viewers also liked

Presentation On Regression
Presentation On RegressionPresentation On Regression
Presentation On Regressionalok tiwari
 
Measures of variability
Measures of variabilityMeasures of variability
Measures of variabilityMark Santos
 
Brm (one tailed and two tailed hypothesis)
Brm (one tailed and two tailed hypothesis)Brm (one tailed and two tailed hypothesis)
Brm (one tailed and two tailed hypothesis)Upama Dwivedi
 
Regression Analysis
Regression AnalysisRegression Analysis
Regression Analysisnadiazaheer
 
Normal Curve and Standard Scores
Normal Curve and Standard ScoresNormal Curve and Standard Scores
Normal Curve and Standard ScoresJenewel Azuelo
 
Test of significance (t-test, proportion test, chi-square test)
Test of significance (t-test, proportion test, chi-square test)Test of significance (t-test, proportion test, chi-square test)
Test of significance (t-test, proportion test, chi-square test)Ramnath Takiar
 
Pearson Correlation, Spearman Correlation &Linear Regression
Pearson Correlation, Spearman Correlation &Linear RegressionPearson Correlation, Spearman Correlation &Linear Regression
Pearson Correlation, Spearman Correlation &Linear RegressionAzmi Mohd Tamil
 
Statistics
StatisticsStatistics
Statisticspikuoec
 
Analysis of variance (ANOVA)
Analysis of variance (ANOVA)Analysis of variance (ANOVA)
Analysis of variance (ANOVA)Sneh Kumari
 

Viewers also liked (16)

Presentation On Regression
Presentation On RegressionPresentation On Regression
Presentation On Regression
 
An Overview of Simple Linear Regression
An Overview of Simple Linear RegressionAn Overview of Simple Linear Regression
An Overview of Simple Linear Regression
 
Regression analysis
Regression analysisRegression analysis
Regression analysis
 
Normal Distribution
Normal DistributionNormal Distribution
Normal Distribution
 
Measures of variability
Measures of variabilityMeasures of variability
Measures of variability
 
Brm (one tailed and two tailed hypothesis)
Brm (one tailed and two tailed hypothesis)Brm (one tailed and two tailed hypothesis)
Brm (one tailed and two tailed hypothesis)
 
Hypothesis
HypothesisHypothesis
Hypothesis
 
Regression analysis
Regression analysisRegression analysis
Regression analysis
 
Regression Analysis
Regression AnalysisRegression Analysis
Regression Analysis
 
Normal Curve and Standard Scores
Normal Curve and Standard ScoresNormal Curve and Standard Scores
Normal Curve and Standard Scores
 
Measures of dispersion
Measures  of  dispersionMeasures  of  dispersion
Measures of dispersion
 
Test of significance (t-test, proportion test, chi-square test)
Test of significance (t-test, proportion test, chi-square test)Test of significance (t-test, proportion test, chi-square test)
Test of significance (t-test, proportion test, chi-square test)
 
Pearson Correlation, Spearman Correlation &Linear Regression
Pearson Correlation, Spearman Correlation &Linear RegressionPearson Correlation, Spearman Correlation &Linear Regression
Pearson Correlation, Spearman Correlation &Linear Regression
 
Pearson Correlation
Pearson CorrelationPearson Correlation
Pearson Correlation
 
Statistics
StatisticsStatistics
Statistics
 
Analysis of variance (ANOVA)
Analysis of variance (ANOVA)Analysis of variance (ANOVA)
Analysis of variance (ANOVA)
 

Similar to Simple linear regression (final)

Regression and Co-Relation
Regression and Co-RelationRegression and Co-Relation
Regression and Co-Relationnuwan udugampala
 
Linear Regression
Linear Regression Linear Regression
Linear Regression Rupak Roy
 
Correlation and Regression Analysis.pptx
Correlation and Regression Analysis.pptxCorrelation and Regression Analysis.pptx
Correlation and Regression Analysis.pptxUnfold1
 
Cost Behavior
Cost BehaviorCost Behavior
Cost BehaviorAIS_USU
 
Corr-and-Regress (1).ppt
Corr-and-Regress (1).pptCorr-and-Regress (1).ppt
Corr-and-Regress (1).pptMuhammadAftab89
 
Cr-and-Regress.ppt
Cr-and-Regress.pptCr-and-Regress.ppt
Cr-and-Regress.pptRidaIrfan10
 
Corr-and-Regress.ppt
Corr-and-Regress.pptCorr-and-Regress.ppt
Corr-and-Regress.pptkrunal soni
 
Corr-and-Regress.ppt
Corr-and-Regress.pptCorr-and-Regress.ppt
Corr-and-Regress.pptMoinPasha12
 
Correlation & Regression for Statistics Social Science
Correlation & Regression for Statistics Social ScienceCorrelation & Regression for Statistics Social Science
Correlation & Regression for Statistics Social Sciencessuser71ac73
 
Numerical approximation
Numerical approximationNumerical approximation
Numerical approximationMileacre
 
CORRELATION-AND-REGRESSION.pdf for human resource
CORRELATION-AND-REGRESSION.pdf for human resourceCORRELATION-AND-REGRESSION.pdf for human resource
CORRELATION-AND-REGRESSION.pdf for human resourceSharon517605
 
regression and correlation
regression and correlationregression and correlation
regression and correlationPriya Sharma
 
Measure of dispersion
Measure of dispersionMeasure of dispersion
Measure of dispersionWaqar Abbasi
 
Physics_150612_01
Physics_150612_01Physics_150612_01
Physics_150612_01Art Traynor
 
Regression Analysis.pptx
Regression Analysis.pptxRegression Analysis.pptx
Regression Analysis.pptxShivankAggatwal
 

Similar to Simple linear regression (final) (20)

Regression and Co-Relation
Regression and Co-RelationRegression and Co-Relation
Regression and Co-Relation
 
Linear Regression
Linear Regression Linear Regression
Linear Regression
 
UNIT 4.pptx
UNIT 4.pptxUNIT 4.pptx
UNIT 4.pptx
 
Correlation and Regression Analysis.pptx
Correlation and Regression Analysis.pptxCorrelation and Regression Analysis.pptx
Correlation and Regression Analysis.pptx
 
Linear regression
Linear regressionLinear regression
Linear regression
 
Cost Behavior
Cost BehaviorCost Behavior
Cost Behavior
 
Corr-and-Regress (1).ppt
Corr-and-Regress (1).pptCorr-and-Regress (1).ppt
Corr-and-Regress (1).ppt
 
Corr-and-Regress.ppt
Corr-and-Regress.pptCorr-and-Regress.ppt
Corr-and-Regress.ppt
 
Cr-and-Regress.ppt
Cr-and-Regress.pptCr-and-Regress.ppt
Cr-and-Regress.ppt
 
Corr-and-Regress.ppt
Corr-and-Regress.pptCorr-and-Regress.ppt
Corr-and-Regress.ppt
 
Corr-and-Regress.ppt
Corr-and-Regress.pptCorr-and-Regress.ppt
Corr-and-Regress.ppt
 
Corr-and-Regress.ppt
Corr-and-Regress.pptCorr-and-Regress.ppt
Corr-and-Regress.ppt
 
Correlation & Regression for Statistics Social Science
Correlation & Regression for Statistics Social ScienceCorrelation & Regression for Statistics Social Science
Correlation & Regression for Statistics Social Science
 
Numerical approximation
Numerical approximationNumerical approximation
Numerical approximation
 
CORRELATION-AND-REGRESSION.pdf for human resource
CORRELATION-AND-REGRESSION.pdf for human resourceCORRELATION-AND-REGRESSION.pdf for human resource
CORRELATION-AND-REGRESSION.pdf for human resource
 
regression and correlation
regression and correlationregression and correlation
regression and correlation
 
Measure of dispersion
Measure of dispersionMeasure of dispersion
Measure of dispersion
 
Physics_150612_01
Physics_150612_01Physics_150612_01
Physics_150612_01
 
Correlation and Regression
Correlation and Regression Correlation and Regression
Correlation and Regression
 
Regression Analysis.pptx
Regression Analysis.pptxRegression Analysis.pptx
Regression Analysis.pptx
 

Recently uploaded

How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17Celine George
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxDenish Jangid
 
Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)Jisc
 
Plant propagation: Sexual and Asexual propapagation.pptx
Plant propagation: Sexual and Asexual propapagation.pptxPlant propagation: Sexual and Asexual propapagation.pptx
Plant propagation: Sexual and Asexual propapagation.pptxUmeshTimilsina1
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Jisc
 
ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.MaryamAhmad92
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...ZurliaSoop
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibitjbellavia9
 
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptxExploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptxPooja Bhuva
 
Towards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxTowards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxJisc
 
How to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptxHow to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptxCeline George
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.pptRamjanShidvankar
 
Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and ModificationsMJDuyan
 
FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024Elizabeth Walsh
 
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...Nguyen Thanh Tu Collection
 
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...Pooja Bhuva
 
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptxCOMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptxannathomasp01
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfPoh-Sun Goh
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsMebane Rash
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.christianmathematics
 

Recently uploaded (20)

How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)
 
Plant propagation: Sexual and Asexual propapagation.pptx
Plant propagation: Sexual and Asexual propapagation.pptxPlant propagation: Sexual and Asexual propapagation.pptx
Plant propagation: Sexual and Asexual propapagation.pptx
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)
 
ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibit
 
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptxExploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
 
Towards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxTowards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptx
 
How to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptxHow to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptx
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and Modifications
 
FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024
 
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
 
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
 
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptxCOMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdf
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan Fellows
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 

Simple linear regression (final)

  • 2. Module Objective Agenda •Introduce the concept of Simple Linear Regression •Walk through the process of plotting our data •Apply regression Techniques •Evaluate our model •Interpret the result Expected Learning •Understand key Simple Linear Regression terminology •Evaluate the relationship between a continuous X and continuous Y •Use regression analysis to make predictions about process
  • 3. Historical Note •Sir Francis Galton (1822-1911) used the term Regression. •To explain the relationship between the heights (inches) of fathers and their sons.
  • 4. Simple Linear Regression Regression analysis is used to predict the value of one variable (the dependent variable) on the basis of other variables (the independent variables). Dependent variable: denoted Y Independent variables: denoted X1, X2, …, Xk If we only have ONE independent variable, the model is
  • 5. Simple Linear Regression Analysis Variables: X = Independent Variable (we provide this) Y = Dependent Variable (we observe this) Parameters: β0 = Intercept The y-intercept of a line is the point at which the line crosses the y axis. ( i.e. where the x value equals 0) β1 = Slope Change in the mean of Y for a unit change in X ε ~ Normal Random Variable (με = 0, σε = ???) [Noise]
  • 6. Simple Linear Regression Analysis Meaning of and > 0 [positive slope] < 0 [negative slope] y rise run =slope (=rise/run) =y-intercept x
  • 7. Effect of Larger Values of σ ε Lower vs. Higher Variability Y 25K$ Y= β0+ β1 + X
  • 8. The least Square Method s nce dif fere these differences are ed quar called residuals or hes errors of t e sum line… ze s th d the imi nts an min poi ne s li n the Thi wee bet
  • 9. Exercise A black belt is connected with optimize a call center in a retail bank where clients place the inquiries and order over the phone during 6 am to 6 pm (Monday to Friday). The current staffing plan is begin with about 4 associates at 6 am and increases to about 35 associate by 9 am. At 3.30 pm the no. of associate begin to drop to about 7 by 6 pm. The black belt is anticipating an increasing in call volume and want to know how many call can be answer in 30 min time interval for various staffing level to better staff the call center. The black belt obtain the data on the no. of associate and the no. of calls answered for each 30 min interval for the last two weeks. You have a total of 240 Samples. Data sheet
  • 11. Fitted Line Plot Regression Analysis: CallsAnswd versus Staff The regression equation is CallsAnswd = - 8.844 + 3.099 Staff S = 11.7311 R-Sq = 83.4% R-Sq(adj) = 83.3% Analysis of Variance Source DF SS MS F P Regression 1 164091 164091 1192.37 0.000 Error 238 32753 138 Total 239 196844 Fitted Line: CallsAnswd versus Staff
  • 12. Minitab Output-Session Window CallsAnswd = - 8.844 + 3.099 Staff CallsAnswd = - 8.844 + 3.099 (17)=43.839 CallsAnswd = - 8.844 + 3.099 (22)=59.334 How Confident are you with this model? 59 Calls Staff of 22 ? 59 Calls 59 Calls
  • 13. Assessing The Model Evaluate the strength of the regression model 1. Determine how much variation in our calls answered data is actually explained by staff. 2. Determine The strength of the relationship between Call Answered and Staff Regression Analysis: CallsAnswd R-sq versus Staff The regression equation is CallsAnswd = - 8.844 + 3.099 Staff % of variation in the Y S = 11.7311 R-Sq = 83.4% R-Sq(adj) = 83.3% values explained by the linear relationship with X Analysis of Variance Source DF SS MS F P Regression 1 164091 164091 1192.37 0.000 Error 238 32753 138 % of variation Calls Answered Total 239 196844 explained by Staff Fitted Line: CallsAnswd versus Staff
  • 14. % of Variation-Explanation Explained Total Variation (Y) Variation (Y) Explained Variation R Squared = Total Variation = Between 0 and 1 (0% to 100%)
  • 15. Properties of R The Correlation Coefficient √ R2 = R Measure the strength of the linear relationship (Pearson’s Correlation of coefficient) R Strong Moderate Low Low Moderate Moderate -1 Relationship -0.8 Relationship -0.5 Relationship 0 Relationship 0.5 Relationship 0.8 Relationship +1
  • 16. Caution- Correlation and Causation Calls Answered Staff Correlated Correlation Causation Change in one variable causes Two things vary together change in another
  • 17. Regression Interpretation- Graphical •Residual is not normal •Residual Vs Fits is not constant
  • 18. Regression- Session Window Regression Analysis: CallsAnswd versus Staff Unusual Observations The regression equation is Obs Staff CallsAnswd Fit SE Fit Residual St Resid CallsAnswd = - 8.84 + 3.10 Staff 1 3.0 6.000 0.454 1.995 5.546 0.48 X 3 4.0 7.000 3.554 1.913 3.446 0.30 X 6 4.0 11.000 3.554 1.913 7.446 0.64 X Predictor Coef SE Coef T P 7 4.0 12.000 3.554 1.913 8.446 0.73 X Constant -8.844 2.247 -3.94 0.000 8 3.0 5.000 0.454 1.995 4.546 0.39 X Staff 3.09947 0.08976 34.53 0.000 9 3.0 7.000 0.454 1.995 6.546 0.57 X 67 29.0 109.000 81.040 0.901 27.960 2.39R 78 32.0 127.000 90.339 1.071 36.661 3.14R S = 11.7311 R-Sq = 83.4% R-Sq(adj) = 83.3% 141 30.0 134.000 84.140 0.952 49.860 4.26R 151 30.0 114.000 84.140 0.952 29.860 2.55R 161 33.0 128.000 93.438 1.136 34.562 2.96R Analysis of Variance 171 33.0 126.000 93.438 1.136 32.562 2.79R 183 32.0 59.000 90.339 1.071 -31.339 -2.68R Source DF SS MS F P 199 29.0 48.000 81.040 0.901 -33.040 -2.82R Regression 1 164091 164091 1192.37 0.000 202 25.0 35.000 68.643 0.768 -33.643 -2.87R Residual Error 238 32753 138 235 4.0 14.000 3.554 1.913 10.446 0.90 X Total 239 196844 236 4.0 7.000 3.554 1.913 3.446 0.30 X R denotes an observation with a large standardized residual (>2) X denotes an observation whose X value gives it large leverage
  • 19. Unusual Observations Unusual Observations Unusual Observations Obs Staff CallsAnswd Fit SE Fit Residual St Resid Obs Staff CallsAnswd Fit SE Fit Residual St Resid 1 3.0 6.000 0.454 1.995 5.546 0.48 X 1 3.0 6.000 0.454 1.995 5.546 0.48 X 3 4.0 7.000 3.554 1.913 3.446 0.30 X 3 4.0 7.000 3.554 1.913 3.446 0.30 X 6 4.0 11.000 3.554 1.913 7.446 0.64 X 6 4.0 11.000 3.554 1.913 7.446 0.64 X 7 4.0 12.000 3.554 1.913 8.446 0.73 X 7 4.0 12.000 3.554 1.913 8.446 0.73 X 8 3.0 5.000 0.454 1.995 4.546 0.39 X 8 3.0 5.000 0.454 1.995 4.546 0.39 X 9 3.0 7.000 0.454 1.995 6.546 0.57 X 9 3.0 7.000 0.454 1.995 6.546 0.57 X 67 29.0 109.000 81.040 0.901 27.960 2.39R 67 29.0 109.000 81.040 0.901 27.960 2.39R 78 32.0 127.000 90.339 1.071 36.661 3.14R 78 32.0 127.000 90.339 1.071 36.661 3.14R 141 30.0 134.000 84.140 0.952 49.860 4.26R 141 30.0 134.000 84.140 0.952 49.860 4.26R 151 30.0 114.000 84.140 0.952 29.860 2.55R 151 30.0 114.000 84.140 0.952 29.860 2.55R 161 33.0 128.000 93.438 1.136 34.562 2.96R 161 33.0 128.000 93.438 1.136 34.562 2.96R 171 33.0 126.000 93.438 1.136 32.562 2.79R 171 33.0 126.000 93.438 1.136 32.562 2.79R 183 32.0 59.000 90.339 1.071 -31.339 -2.68R 183 32.0 59.000 90.339 1.071 -31.339 -2.68R 199 29.0 48.000 81.040 0.901 -33.040 -2.82R 199 29.0 48.000 81.040 0.901 -33.040 -2.82R 202 25.0 35.000 68.643 0.768 -33.643 -2.87R 202 25.0 35.000 68.643 0.768 -33.643 -2.87R 235 4.0 14.000 3.554 1.913 10.446 0.90 X 235 4.0 14.000 3.554 1.913 10.446 0.90 X 236 4.0 7.000 3.554 1.913 3.446 0.30 X 236 4.0 7.000 3.554 1.913 3.446 0.30 X R denotes an observation with a large standardized residual (>2) X denotes an observation whose X value gives it large leverage 240 Data points Unusual observation 4% = 9 points Residual values (R) ≤ 5% of total observation
  • 20. Original Vs Altered Original Graph Altered Graph
  • 21. Interpreting The Model Regression Analysis: CallsAnswd versus Staff The regression equation is CallsAnswd = - 8.84 + 3.10 Staff Predictor Coef SE Coef T P Constant -8.844 2.247 -3.94 0.000 Predictor Table Staff 3.09947 0.08976 34.53 0.000 S = 11.7311 R-Sq = 83.4% R-Sq(adj) = 83.3% Additional Statistics Analysis of Variance Source DF SS MS F P Regression 1 164091 164091 1192.37 0.000 ANOVA Table Residual Error 238 32753 138 Total 239 196844
  • 22. Interpreting The Model Regression Analysis: CallsAnswd versus Staff The regression equation is CallsAnswd = - 8.84 + 3.10 Staff Predictor Table Predictor Coef SE Coef T P Constant -8.844 2.247 -3.94 0.000 Staff 3.09947 0.08976 34.53 0.000 Ho: Slope = 0 •No difference in Y when X changes S = 11.7311 R-Sq = 83.4% R-Sq(adj) = 83.3% •X has no impact on y Analysis of Variance Ha: Slope ≠ 0 Source DF SS MS F P Regression 1 164091 164091 1192.37 0.000 •Y changes as X changes Residual Error 238 32753 138 •X has impact on y Total 239 196844
  • 23. Interpreting The Model Regression Analysis: CallsAnswd versus Staff The regression equation is CallsAnswd = - 8.84 + 3.10 Staff Additional Statistics Predictor Coef SE Coef T P Constant -8.844 2.247 -3.94 0.000 R-squared Staff 3.09947 0.08976 34.53 0.000 •The amount of variation explained by this variation S = 11.7311 R-Sq = 83.4% R-Sq(adj) = 83.3% R-squared (Adjusted) •Account for the no. of X’s used in model (Used in multiple Analysis of Variance regression) Source DF SS MS F P S Regression 1 164091 164091 1192.37 0.000 Residual Error 238 32753 138 •Standard deviation for the Total 239 196844 leftover or unexplained variation. It is used for designing the confidence interval
  • 24. Interpreting The Model Regression Analysis: CallsAnswd versus Staff The regression equation is CallsAnswd = - 8.84 + 3.10 Staff ANOVA Table Predictor Coef SE Coef T P Constant -8.844 2.247 -3.94 0.000 Regression Staff 3.09947 0.08976 34.53 0.000 •The explained variation S = 11.7311 R-Sq = 83.4% R-Sq(adj) = 83.3% Residual Error •Unexplained Variation Analysis of Variance Source DF SS MS F P Ho: Regression 1 164091 164091 1192.37 0.000 •The model does not explain the Residual Error 238 32753 138 observed variation Total 239 196844 Ha: •The model does explain the observed variation
  • 25. Confidence on Output CallsAnswd = -8.844 + 3.099 Staff If Staff 10 then, - 8.844 + 3.099 (10) = 22 If Staff 20 then, - 8.844 + 3.099 (20) = 53 If Staff 30 then, - 8.844 + 3.099 (30) = 84 Confidence = ???% R-squared = 83.4% Report result using Unexplained Variation = 16.6% confidence intervals
  • 26. Confidence on Output 95% confidence the true regression line lies within 95% prediction interval expect 95% of the data points to fall within
  • 27. Confidence on Output Confidence interval report average performance Prediction intervals predict actual values
  • 28. Predicted Value for Y Let management decide max 20 staff, then how may call answer by these 20 executive Predicted Values for New Observations New Obs Fit SE Fit 95% CI 95% PI 1 53.145 0.822 (51.526, 54.765) (29.979, 76.312) Values of Predictors for New Observations New Obs Staff 1 20.0
  • 29. Review Methodology 1. Get Familiar with the data 3. Check the model & the assumption • Identify the output or Y variable • Look at the residual plot • Identify the X variable or predictor  Are assumption valid? • Look at the time series, dot plots, histogram  Do residual look ok? and scatter plot  Are there unusual observation? If yes, • Run descriptive statistics address, if possible and rerun the analysis  Check for outliers • What is r-squared?  Check for gaps in the data • How much variation can be explained by the model? • Look at P –values  Does X have significant impact on Y? 2. Fit the model to the data 4. Report Result & Use Equation • Run regression • Summarize the results for your stakeholders • Use the regression • If you have excluded data, explain why • If you kept outlier in the data, explain why • How much variation in Y can be explained by X? • Remember causation vs. correlation • Make a predictions using confidence and prediction intervals.