More on Regression

Suppose we are given the set of data points

(x₁, y₁), (x₂, y₂),..., (x_n, y_n)

and that we are interested in finding a straigt line that "best" fits that data. We will begin with the linear model

Y_i = α-β(x_i - x̄) + ε_i

where we assume that for a partiular value of x, that the value of Y will differ from its mean by some random ammount ε and that the distribution of ε is N(0,σ). It can be show that the estimate for α is

α̂ =ȳ

and that the estimate for β is

̂β = ∑_iy_i(x_i - x̄)²/ ∑_i(x_i - x̄)²

x:
y: