data:image/s3,"s3://crabby-images/00bcd/00bcd85c34eea0d0fca11425363c58e41ddc6c57" alt=""
If we look at the errors, we see that they are dependent on the value of x! Naturally (b/c of mean=x[i]).
data:image/s3,"s3://crabby-images/778a1/778a15d64ece2fcd9edc1f66950439f264c97b96" alt=""
What I should have done is something like this:
data:image/s3,"s3://crabby-images/241b6/241b62cac5630da030eaeacb44d9fdb116ad6e40" alt=""
data:image/s3,"s3://crabby-images/dbf62/dbf625d4467027f1a643d4974e0d94c0dc7a158f" alt=""
Nevertheless, I hope the essential points are clear:
• covariance is related to variance: cov(x,x) = var(x)
• correlation is the covariance of z-scores
• the slope of the regression line is: cov(x,y) / var(x)
• the regression line goes through x, y
• r ≈ cov(x,y) / sqrt(var(x)*var(y))
• the call to plot the line is abline(lm(y~x))
The proportionality for r is b/c we are not matching R's output for this. I think it is because we are missing a correction factor of n-1/n. I will have to look into that when I get back to my library at home.
With the change, we do a little better on guessing the slope: