SlideShare a Scribd company logo
1 of 38
Chapter 4
Describing the Relation
Between Two Variables
4.1
Scatter Diagrams; Correlation
Bivariate data is data in which two
variables are measured on an individual.
The response variable is the variable
whose value can be explained or
determined based upon the value of the
predictor variable.
A lurking variable is one that is related to
the response and/or predictor variable, but
is excluded from the analysis
A scatter diagram shows the relationship
between two quantitative variables
measured on the same individual. Each
individual in the data set is represented by a
point in the scatter diagram. The predictor
variable is plotted on the horizontal axis and
the response variable is plotted on the
vertical axis. Do not connect the points
when drawing a scatter diagram.
EXAMPLE Drawing a Scatter Diagram
The following data are based on a study for
drilling rock. The researchers wanted to
determine whether the time it takes to dry drill
a distance of 5 feet in rock increases with the
depth at which the drilling begins. So, depth
at which drilling begins is the predictor
variable, x, and time (in minutes) to drill five
feet is the response variable, y. Draw a
scatter diagram of the data.
Source: Penner, R., and Watts, D.G. “Mining Information.” The American Statistician, Vol.
45, No. 1, Feb. 1991, p. 6.
Two variables that are linearly related are said to
be positively associated when above average
values of one variable are associated with above
average values of the corresponding variable.
That is, two variables are positively associated
when the values of the predictor variable increase,
the values of the response variable also increase.
Two variables that are linearly related are said to
be negatively associated when above average
values of one variable are associated with below
average values of the corresponding variable.
That is, two variables are negatively associated
when the values of the predictor variable increase,
the values of the response variable decrease
The linear correlation coefficient or Pearson
product moment correlation coefficient is a
measure of the strength of linear relation between
two quantitative variables. We use the Greek letter
(rho) to represent the population correlation
coefficient and r to represent the sample correlation
coefficient. We shall only present the formula for
the sample correlation coefficient.
1. The linear correlation coefficient is always
between -1 and 1, inclusive. That is, -1 < r < 1.
2. If r = +1, there is a perfect positive linear relation
between the two variables.
3. If r = -1, there is a perfect negative linear relation
between the two variables.
4. The closer r is to +1, the stronger the evidence of
positive association between the two variables.
5. The closer r is to -1, the stronger the evidence of
negative association between the two variables.
Properties of the Linear Correlation CoefficientProperties of the Linear Correlation Coefficient
6. If r is close to 0, there is evidence of no linear
relation between the two variables. Because the
linear correlation coefficient is a measure of
strength of linear relation, r close to 0 does not
imply no relation, just no linear relation.
7. It is a unitless measure of association. So, the
unit of measure for x and y plays no role in the
interpretation of r.
Properties of the Linear Correlation CoefficientProperties of the Linear Correlation Coefficient
EXAMPLE Drawing a Scatter Diagram and
Computing the Correlation Coefficient
For the following data
(a)Draw a scatter diagram and comment on the
type of relation that appears to exist between x
and y.
(b) By hand, compute the linear correlation
coefficient.
EXAMPLE Determining the Linear
Correlation Coefficient
Determine the linear correlation coefficient
of the drilling data.
i
x
x x
s
− i
y
y y
s
−
i i
x y
x x y y
s s
  − −
   
  
x =
y =
A linear correlation coefficient that implies
a strong positive or negative association
that is computed using observational data
does not imply causation among the
variables.
Chapter 4
Describing the Relation
Between Two Variables
4.2
Least-squares Regression
EXAMPLE Finding an Equation that Describes
a Linear Relation
(a) Find a linear equation that relates x (the
predictor variable) and y (the response variable)
by selecting two points and finding the equation
of the line containing the points.
(b) Graph the equation on the scatter diagram.
(c) Use the equation to predict y if x = 5.
Using the following sample data:
The difference between the observed value
of y and the predicted value of y is the error
or residual. That is
residual = observed - predicted
Compute the residual for the prediction
corresponding to x = 5.
EXAMPLE Finding the Least-squares
Regression Line
Using the sample data:
(a) Find the least-squares regression line.
(b) Interpret the slope and intercept.
(c) Predict y if x = 5.
(d) Compute the residual for x = 5.
(e) Draw the least-squares regression line on the
scatter diagram of the data.
EXAMPLE Computing the Sum of Squared
Residuals
Compute the sum of squared residuals for
the line describing the relation between x
and y that was obtained using two points.
Compute the sum of squared residuals for
the least-squares regression line. Which is
smaller?
EXAMPLE Finding the Least-squares
Regression Line
(a) Find the least-squares regression line
for the drilling data.
(b) Use the line to predict the drilling time
at x = 130 feet.
(c) Should the line be used to predict the
drilling time at x = 400 feet? Why?
(d) Interpret the slope and y-intercept.
Math n Statistic
Math n Statistic

More Related Content

What's hot

Regression analysis
Regression analysisRegression analysis
Regression analysissaba khan
 
Stats 3000 Week 2 - Winter 2011
Stats 3000 Week 2 - Winter 2011Stats 3000 Week 2 - Winter 2011
Stats 3000 Week 2 - Winter 2011Lauren Crosby
 
Sumit presentation
Sumit presentationSumit presentation
Sumit presentationSumit Bharti
 
Correlation by Neeraj Bhandari ( Surkhet.Nepal )
Correlation by Neeraj Bhandari ( Surkhet.Nepal )Correlation by Neeraj Bhandari ( Surkhet.Nepal )
Correlation by Neeraj Bhandari ( Surkhet.Nepal )Neeraj Bhandari
 
Regression & correlation
Regression & correlationRegression & correlation
Regression & correlationAtiq Rehman
 
Presentation On Regression
Presentation On RegressionPresentation On Regression
Presentation On Regressionalok tiwari
 
Correlation and Regression ppt
Correlation and Regression pptCorrelation and Regression ppt
Correlation and Regression pptSantosh Bhaskar
 
9.2 lin reg coeff of det
9.2 lin reg coeff of det9.2 lin reg coeff of det
9.2 lin reg coeff of detleblance
 
Regression analysis
Regression analysisRegression analysis
Regression analysisSrikant001p
 
Least Squares Regression Method | Edureka
Least Squares Regression Method | EdurekaLeast Squares Regression Method | Edureka
Least Squares Regression Method | EdurekaEdureka!
 
Simple lin regress_inference
Simple lin regress_inferenceSimple lin regress_inference
Simple lin regress_inferenceKemal İnciroğlu
 
Kendall's ,partial correlation and scatter plot
Kendall's ,partial correlation and scatter plotKendall's ,partial correlation and scatter plot
Kendall's ,partial correlation and scatter plotBharath kumar Karanam
 
4. regression analysis1
4. regression analysis14. regression analysis1
4. regression analysis1Karan Kukreja
 

What's hot (19)

Linear regression
Linear regressionLinear regression
Linear regression
 
Coefficient of correlation
Coefficient of correlationCoefficient of correlation
Coefficient of correlation
 
Spearman Rank
Spearman RankSpearman Rank
Spearman Rank
 
Regression analysis
Regression analysisRegression analysis
Regression analysis
 
Stats 3000 Week 2 - Winter 2011
Stats 3000 Week 2 - Winter 2011Stats 3000 Week 2 - Winter 2011
Stats 3000 Week 2 - Winter 2011
 
Sumit presentation
Sumit presentationSumit presentation
Sumit presentation
 
Correlation by Neeraj Bhandari ( Surkhet.Nepal )
Correlation by Neeraj Bhandari ( Surkhet.Nepal )Correlation by Neeraj Bhandari ( Surkhet.Nepal )
Correlation by Neeraj Bhandari ( Surkhet.Nepal )
 
Scatter diagram
Scatter diagramScatter diagram
Scatter diagram
 
Regression & correlation
Regression & correlationRegression & correlation
Regression & correlation
 
Regression
RegressionRegression
Regression
 
Presentation On Regression
Presentation On RegressionPresentation On Regression
Presentation On Regression
 
correlation
correlation correlation
correlation
 
Correlation and Regression ppt
Correlation and Regression pptCorrelation and Regression ppt
Correlation and Regression ppt
 
9.2 lin reg coeff of det
9.2 lin reg coeff of det9.2 lin reg coeff of det
9.2 lin reg coeff of det
 
Regression analysis
Regression analysisRegression analysis
Regression analysis
 
Least Squares Regression Method | Edureka
Least Squares Regression Method | EdurekaLeast Squares Regression Method | Edureka
Least Squares Regression Method | Edureka
 
Simple lin regress_inference
Simple lin regress_inferenceSimple lin regress_inference
Simple lin regress_inference
 
Kendall's ,partial correlation and scatter plot
Kendall's ,partial correlation and scatter plotKendall's ,partial correlation and scatter plot
Kendall's ,partial correlation and scatter plot
 
4. regression analysis1
4. regression analysis14. regression analysis1
4. regression analysis1
 

Viewers also liked

Scatter diagrams and correlation
Scatter diagrams and correlationScatter diagrams and correlation
Scatter diagrams and correlationkeithpeter
 
scatter diagram
 scatter diagram scatter diagram
scatter diagramshrey8916
 
Coefficient of correlation...ppt
Coefficient of correlation...pptCoefficient of correlation...ppt
Coefficient of correlation...pptRahul Dhaker
 
Lesson 8 Linear Correlation And Regression
Lesson 8 Linear Correlation And RegressionLesson 8 Linear Correlation And Regression
Lesson 8 Linear Correlation And RegressionSumit Prajapati
 
Correlation of subjects in school (b.ed notes)
Correlation of subjects in school (b.ed notes)Correlation of subjects in school (b.ed notes)
Correlation of subjects in school (b.ed notes)Namrata Saxena
 
7 c's of marketing.
7 c's of marketing.7 c's of marketing.
7 c's of marketing.ssagar88
 
Correlation analysis ppt
Correlation analysis pptCorrelation analysis ppt
Correlation analysis pptAnil Mishra
 
Mpc 006 - 02-01 product moment coefficient of correlation
Mpc 006 - 02-01 product moment coefficient of correlationMpc 006 - 02-01 product moment coefficient of correlation
Mpc 006 - 02-01 product moment coefficient of correlationVasant Kothari
 

Viewers also liked (16)

Scatter Diagrams
Scatter DiagramsScatter Diagrams
Scatter Diagrams
 
9.1
9.19.1
9.1
 
7 qc tools
7 qc tools7 qc tools
7 qc tools
 
Pearson Correlation
Pearson CorrelationPearson Correlation
Pearson Correlation
 
Scatter diagrams and correlation
Scatter diagrams and correlationScatter diagrams and correlation
Scatter diagrams and correlation
 
scatter diagram
 scatter diagram scatter diagram
scatter diagram
 
Scatter diagram in tqm
Scatter diagram in tqmScatter diagram in tqm
Scatter diagram in tqm
 
Coefficient of correlation...ppt
Coefficient of correlation...pptCoefficient of correlation...ppt
Coefficient of correlation...ppt
 
Lesson 8 Linear Correlation And Regression
Lesson 8 Linear Correlation And RegressionLesson 8 Linear Correlation And Regression
Lesson 8 Linear Correlation And Regression
 
Correlation
CorrelationCorrelation
Correlation
 
Correlation analysis
Correlation analysisCorrelation analysis
Correlation analysis
 
Correlation of subjects in school (b.ed notes)
Correlation of subjects in school (b.ed notes)Correlation of subjects in school (b.ed notes)
Correlation of subjects in school (b.ed notes)
 
7 c's of marketing.
7 c's of marketing.7 c's of marketing.
7 c's of marketing.
 
Correlation analysis ppt
Correlation analysis pptCorrelation analysis ppt
Correlation analysis ppt
 
Correlation ppt...
Correlation ppt...Correlation ppt...
Correlation ppt...
 
Mpc 006 - 02-01 product moment coefficient of correlation
Mpc 006 - 02-01 product moment coefficient of correlationMpc 006 - 02-01 product moment coefficient of correlation
Mpc 006 - 02-01 product moment coefficient of correlation
 

Similar to Math n Statistic (20)

9. parametric regression
9. parametric regression9. parametric regression
9. parametric regression
 
Correlation and regression
Correlation and regressionCorrelation and regression
Correlation and regression
 
Correlation and regression
Correlation and regressionCorrelation and regression
Correlation and regression
 
Exploring bivariate data
Exploring bivariate dataExploring bivariate data
Exploring bivariate data
 
Correlation and regression
Correlation and regressionCorrelation and regression
Correlation and regression
 
Correlation
CorrelationCorrelation
Correlation
 
Correlation and regression impt
Correlation and regression imptCorrelation and regression impt
Correlation and regression impt
 
Correlation
CorrelationCorrelation
Correlation
 
REGRESSION ANALYSIS THEORY EXPLAINED HERE
REGRESSION ANALYSIS THEORY EXPLAINED HEREREGRESSION ANALYSIS THEORY EXPLAINED HERE
REGRESSION ANALYSIS THEORY EXPLAINED HERE
 
Chapter 10
Chapter 10Chapter 10
Chapter 10
 
Chapter 10
Chapter 10Chapter 10
Chapter 10
 
Correlation and Regression
Correlation and RegressionCorrelation and Regression
Correlation and Regression
 
2-20-04.ppt
2-20-04.ppt2-20-04.ppt
2-20-04.ppt
 
Correlation and Regression
Correlation and Regression Correlation and Regression
Correlation and Regression
 
Stat
StatStat
Stat
 
Regression -Linear.pptx
Regression -Linear.pptxRegression -Linear.pptx
Regression -Linear.pptx
 
Scatterplots, Correlation, and Regression
Scatterplots, Correlation, and RegressionScatterplots, Correlation, and Regression
Scatterplots, Correlation, and Regression
 
Study of Correlation
Study of Correlation Study of Correlation
Study of Correlation
 
Measure of Association
Measure of AssociationMeasure of Association
Measure of Association
 
RMBS - CORRELATION.pptx
RMBS - CORRELATION.pptxRMBS - CORRELATION.pptx
RMBS - CORRELATION.pptx
 

Recently uploaded

IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 

Recently uploaded (20)

IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 

Math n Statistic

  • 1. Chapter 4 Describing the Relation Between Two Variables 4.1 Scatter Diagrams; Correlation
  • 2. Bivariate data is data in which two variables are measured on an individual. The response variable is the variable whose value can be explained or determined based upon the value of the predictor variable. A lurking variable is one that is related to the response and/or predictor variable, but is excluded from the analysis
  • 3. A scatter diagram shows the relationship between two quantitative variables measured on the same individual. Each individual in the data set is represented by a point in the scatter diagram. The predictor variable is plotted on the horizontal axis and the response variable is plotted on the vertical axis. Do not connect the points when drawing a scatter diagram.
  • 4. EXAMPLE Drawing a Scatter Diagram The following data are based on a study for drilling rock. The researchers wanted to determine whether the time it takes to dry drill a distance of 5 feet in rock increases with the depth at which the drilling begins. So, depth at which drilling begins is the predictor variable, x, and time (in minutes) to drill five feet is the response variable, y. Draw a scatter diagram of the data. Source: Penner, R., and Watts, D.G. “Mining Information.” The American Statistician, Vol. 45, No. 1, Feb. 1991, p. 6.
  • 5.
  • 6.
  • 7.
  • 8.
  • 9.
  • 10. Two variables that are linearly related are said to be positively associated when above average values of one variable are associated with above average values of the corresponding variable. That is, two variables are positively associated when the values of the predictor variable increase, the values of the response variable also increase.
  • 11. Two variables that are linearly related are said to be negatively associated when above average values of one variable are associated with below average values of the corresponding variable. That is, two variables are negatively associated when the values of the predictor variable increase, the values of the response variable decrease
  • 12. The linear correlation coefficient or Pearson product moment correlation coefficient is a measure of the strength of linear relation between two quantitative variables. We use the Greek letter (rho) to represent the population correlation coefficient and r to represent the sample correlation coefficient. We shall only present the formula for the sample correlation coefficient.
  • 13. 1. The linear correlation coefficient is always between -1 and 1, inclusive. That is, -1 < r < 1. 2. If r = +1, there is a perfect positive linear relation between the two variables. 3. If r = -1, there is a perfect negative linear relation between the two variables. 4. The closer r is to +1, the stronger the evidence of positive association between the two variables. 5. The closer r is to -1, the stronger the evidence of negative association between the two variables. Properties of the Linear Correlation CoefficientProperties of the Linear Correlation Coefficient
  • 14. 6. If r is close to 0, there is evidence of no linear relation between the two variables. Because the linear correlation coefficient is a measure of strength of linear relation, r close to 0 does not imply no relation, just no linear relation. 7. It is a unitless measure of association. So, the unit of measure for x and y plays no role in the interpretation of r. Properties of the Linear Correlation CoefficientProperties of the Linear Correlation Coefficient
  • 15.
  • 16.
  • 17.
  • 18.
  • 19.
  • 20.
  • 21.
  • 22.
  • 23.
  • 24. EXAMPLE Drawing a Scatter Diagram and Computing the Correlation Coefficient For the following data (a)Draw a scatter diagram and comment on the type of relation that appears to exist between x and y. (b) By hand, compute the linear correlation coefficient.
  • 25. EXAMPLE Determining the Linear Correlation Coefficient Determine the linear correlation coefficient of the drilling data.
  • 26.
  • 27. i x x x s − i y y y s − i i x y x x y y s s   − −        x = y =
  • 28. A linear correlation coefficient that implies a strong positive or negative association that is computed using observational data does not imply causation among the variables.
  • 29. Chapter 4 Describing the Relation Between Two Variables 4.2 Least-squares Regression
  • 30. EXAMPLE Finding an Equation that Describes a Linear Relation (a) Find a linear equation that relates x (the predictor variable) and y (the response variable) by selecting two points and finding the equation of the line containing the points. (b) Graph the equation on the scatter diagram. (c) Use the equation to predict y if x = 5. Using the following sample data:
  • 31. The difference between the observed value of y and the predicted value of y is the error or residual. That is residual = observed - predicted Compute the residual for the prediction corresponding to x = 5.
  • 32.
  • 33.
  • 34. EXAMPLE Finding the Least-squares Regression Line Using the sample data: (a) Find the least-squares regression line. (b) Interpret the slope and intercept. (c) Predict y if x = 5. (d) Compute the residual for x = 5. (e) Draw the least-squares regression line on the scatter diagram of the data.
  • 35. EXAMPLE Computing the Sum of Squared Residuals Compute the sum of squared residuals for the line describing the relation between x and y that was obtained using two points. Compute the sum of squared residuals for the least-squares regression line. Which is smaller?
  • 36. EXAMPLE Finding the Least-squares Regression Line (a) Find the least-squares regression line for the drilling data. (b) Use the line to predict the drilling time at x = 130 feet. (c) Should the line be used to predict the drilling time at x = 400 feet? Why? (d) Interpret the slope and y-intercept.