Monday, June 6, 2011

CHAPTER 7: MULTIPLE REGRESSION ANALYSIS: THE PROBLEM OF ESTIMATION

ü  The Three-Variable Model Notation and Assumptions

Yi = β1 + β2X2i + β3X3i + ui
β2 and β3 are called the partial regression coefficients.
Assumptions:
1.      Zero mean value of ui or
       E(ui  X2i, X3i) = 0 for each i
2.      No serial correlation, or
cov(ui, uj) = 0         i   j
3.      Homoscedasticity, or
var(ui) = 
4.      Zero covariance between ui and each X variable, or
cov(ui, X2i) = cov(ui, X3i) = 0
5.      No specification bias, or
The model is correctly specified
6.      No exact collinearity between the X variables, or
No exact linear relationship between X2 and X3.
ü  Interpretation of Multiple Regression Equation

E(Yi  X2i, X3i) = β1 + β2X2i + β3iX3i
The conditional mean or expected value of Y conditional upon the given or fixed values of X2 and X3.
ü  The Meaning of Partial Regression Coefficients
β2 and β3 are known as partial regression or partial slope coefficients. β2 measures the change in the mean value of Y, E(Y), per unit change in X2, holding the value of X3 constant. Put differently, it gives the “direct” or “net” effect of a unit change in X2 on the mean value of Y, net of any effect that X3 may have on mean Y. Likewise, β3 measures the change in the mean value of Y per unit change in X3, holding the value of X2 constant. That is, it gives the “direct” or “net” effect of a unit change in X3 on the mean value of Y, net of any effect that X2 may have on mean Y.
ü  OLS and ML Estimation of the Partial Regression Coefficients
ü  OLS Estimators
OLS  estimator of the population intercept β1

β1 = Y – β2X2 – β3X3
OLS  estimator of the population intercept β2

β2 = (∑YiX2i)(∑X3i) – (∑YiX3i)(∑X2iX3i)
                    (∑X2i)(∑X3i) – (∑X2i3i)2




OLS  estimator of the population intercept β3

β3 = (∑YiX3i)(∑X2i) – (∑YiX2i)(∑X2iX3i)
                     (∑X2i)(∑X3i) – (∑X2i3i)2

Notation:
1)      Equations of β2 and β3 are symmetrical in nature because one can be obtained from the other by interchanging the roles of X2 and X3.
2)      The denominators of these two equations are identical.
3)      The three-variable case is a natural extension of the two-variable case.
ü  Variances and Standard Errors of OLS Estimators

var(β1) =  1   +     X2∑X3i + X3∑X2i - 2X2X3∑X2iX3i
                   n               ∑X2i∑X3i – (∑X2iX3i)2

se(β1) =   +   var(β1)

var(β2) =                ∑X3i
                  (∑X2i)(∑X3i) – (∑X2iX3i)2

or equivalently,

var(β2) =     
                 ∑X2i(1 – r23)

var(β3) =         ∑X2i
                 (∑X2i)(∑X3i) – (∑X2iX3i)2

se(β2) =   +    var(β2)

se(β3) =   +    var(β3)

cov(β2, β3) =        -r23
                        (1 – r23)  X2i   X3i
ü  Properties of OLS Estimators
1)      The three-variable regression line passes through the means Y, X2 and X3.
2)      The mean value of the estimated Yi(= Yi) is equal to the mean value of the actual Yi.
3)      ∑ui = u =0
4)      The residuals ui are uncorrelated with X2i and X3i, that is, ∑uiX2i = ∑uiX3i = 0
5)      The residuals ui are uncorrelated with Yi, that is, ∑uiYi = 0
6)      The correlation coefficient between X2 and X3 increases toward 1, the variances of β2 and β3 increase for given values of        and ∑X2i or ∑X3i.
ü  The Multiple Coefficient of Determination R2 and the Multiple Coefficient of Correlation R
r2 = measure the goodness of fit of the regression equation; that is, it gives the proportion or percentage, of the total variation in the dependent variable Y explained by the explanatory variable X.

R2 = RSS
        TSS

R2 = β2∑YiX2i + β3YiX3i
        ∑Yi2
ü  Simple Regression in the Context of Multiple Regression: Introduction to Specification Bias
Observe several things about this regression compared to the “true” multiple regression.
1.      In absolute terms, the PGNP coefficient has increased.
2.      The standard errors are different.
3.      The intercept values are different.
4.      The r2 values are dramatically different.
ü  Interpretation of Simple and Partial Correlation Coefficients
1.      Even if r12 = 0, r123 will not be zero unless r13 or r23 or both are zero.
2.      If r12 = 0 and r13 and r23 are nonzero and are of the same sign, r123 will be negative, whereas if they are of the opposite signs, it will be positive.
3.      In terms of r123 and r12 need not have the same sign.
4.      In the two-variable case we have seen that r2 lies between 0 and 1.
5.      The coefficient of partial determination may be interpreted as the proportion of the variation in Y not explained by the variable X3 that has been explained by the inclusion of X3 into the model.









No comments:

Post a Comment