3. Outline Introduction Production, cost and profit functions Single equation estimation Imposing equality restrictions Hypothesis testing Systems estimation ECON377/477 Topic 1.4 3
4. Introduction This section provides an overview of econometric methods for estimating economic relationships We focus mainly on the case where we have access to cross-sectional data (observations on several firms in a single period) but panel data (observations on several firms in several periods) are used in some assignment work The econometric concepts and methods we discuss carry over, often with little or no modification, to cases where we have time-series data (observations on a single firm in several periods) or panel data ECON377/477 Topic 1.4 4
5. Introduction Some of the more common functional forms found in the applied economics literature include the linear, Cobb-Douglas, normalised quadratic and translog functional forms In the next section, we describe how the parameters of different models can be estimated using the methods of ordinary least squares (OLS), maximum likelihood (ML) and nonlinear least squares In the following section, we show how these methods can be adapted to ensure the parameters of our models satisfy equality constraints implied by economic theory ECON377/477 Topic 1.4 5
6. Introduction In the final section, we show how to test the validity of these constraints using F-tests and likelihood-ratio (LR) tests At this point, students will have developed enough econometrics background to understand the basic stochastic frontier models 6 ECON377/477 Topic 1.4
7. Production, cost and profit functions Production, cost and profit functions express a single dependent variable as a function of one or more explanatory variables Mathematically, all these different functions can be written in the form: where y is the dependent variable; the xn (n = 1, …, N) are explanatory variables; and f(.) is a mathematical function The first step in estimating the relationship between the dependent and explanatory variables is to specify the algebraic form of f(.) ECON377/477 Topic 1.4 7
12. Common functional forms The βmn parameters satisfy the identifying condition βmn = βnm for all n and m This condition is sometimes known as a symmetrycondition Note that some functional forms are special cases of others For example, the Cobb-Douglas can be obtained from the translog by setting all βmn= 0 ECON377/477 Topic 1.4 9
13. Accounting for technological change Technological advances often cause economic relationships (especially production functions) to change over time If we have observations over time, we usually account for technological change by including a time trend in our model See the examples in CROB (p. 213) for the linear, Cobb-Douglas and translog functions The way we introduce a time trend into the model should reflect industry-specific knowledge of technological developments and how they affect important characteristics of economic behaviour ECON377/477 Topic 1.4 10
14. Single equation estimation We can account for the combined effects of errors (known as statistical noise) by including random variables in our models For the time being, we focus on models that are linear in the parameters and write: , i = 1, …, I where yi denotes the i-th observation on the dependent variable; xi is a Kx1 vector containing the explanatory variables; β is an associated Kx1 vector of unknown parameters; vi is a random error term representing statistical noise; and I denotes the number of observations in the data set ECON377/477 Topic 1.4 11
15. Single equation estimation Depending on the functional form chosen and the type of economic relationship being estimated, the dependent and explanatory variables in the model may be different functions of input and output prices and quantities Examples are logarithms, square roots and ratios If we choose a translog model to explain variations in an output qi as a function of inputs x1i, x2i and x3i, then the model for yi = lnqi can be expressed in compact form as shown on the next slide ECON377/477 Topic 1.4 12
17. Single equation estimation Our next task is to estimate the unknown parameter vector β The two main estimation methods are OLS and ML Both estimation methods are underpinned by important assumptions concerning the error terms ECON377/477 Topic 1.4 14
18. Ordinary least squares The most common assumptions made in OLS concerning the errors are: zero mean homoskedastic for all uncorrelated ECON377/477 Topic 1.4 15
19. Single equation estimation The least squares approach to estimating β involves minimising the sum of squared deviations between the yis and their means The function that expresses this sum of squares as a function of β is: ECON377/477 Topic 1.4 16
20. Single equation estimation The OLS estimator has a mean of β (unbiased) It is also a linear function of the yi observations and can be shown to have a variance that is no larger than the variance of any other linear unbiased estimator (efficient) We summarise these properties by saying that the OLS estimator is the best linear unbiased estimator (BLUE) of β ECON377/477 Topic 1.4 17
21. Maximum likelihood estimation The concept of ML estimation is underpinned by the idea that a particular sample of observations is more likely to have been generated from some distributions than from others The MLestimate of an unknown parameter is defined to be the value of the parameter that maximises the probability (or likelihood) of randomly drawing a particular sample of observations To use the ML principle to estimate the parameters of the classical linear regression model, we first need to make an assumption concerning the distributions of the error terms ECON377/477 Topic 1.4 18
22. Maximum likelihood estimation The most common assumption is that the errors are normally distributed. Combining this assumption with the three OLS assumptions, we write which says the errors are independently and identically distributed normal random variables with zero means and variances, σ2 Using the relationship between yiand vi together with well-known properties of normal random variables, we can then write: ECON377/477 Topic 1.4 19
23. Maximum likelihood estimation The joint density function for the vector of observations is This joint probability density function (pdf) is known as the likelihood function It expresses the likelihood of observing the sample observations as a function of the unknown parameters β and σ2 ECON377/477 Topic 1.4 20
24. Maximum likelihood estimation The ML estimator of β is obtained by maximising this function with respect to β Equivalently, it can be obtained by maximising the logarithm of the likelihood function: In the special case of the classical linear regression model with normally distributed errors, the ML estimator for β is identical to the OLS estimator ECON377/477 Topic 1.4 21
25. Maximum likelihood estimation The ML estimator has several desirable large-sample (i.e., asymptotic) properties It can be shown to be consistent and asymptotically normally distributed (CAN) with variances that are no larger than the variances of any other CAN estimator That is, the ML estimator is asymptotically efficient An estimator is consistent, for a scalar parameter, if its values approach the true parameter value and if its variances get smaller as the sample size increases indefinitely ECON377/477 Topic 1.4 22
26. Estimation of non-linear models The linear regression framework discussed above can be generalised to models that are nonlinear in the parameters Computations using a non-linear estimator can be done easily using well-known econometrics software packages Iterative optimisation procedures are not foolproof – sometimes they take us to a local rather than a global maximum, and sometimes they do not converge (to a maximum) at all ECON377/477 Topic 1.4 23
27. Estimation of non-linear models To confirm that the procedure has converged, it is important to check that the first derivatives of the log-likelihood function (i.e., the gradients) are close to zero To confirm that the maximum is a global maximum, we should establish that the log-likelihood function is globally concave Alternatively, we can simply re-estimate the model using different sets of starting values If very different starting values yield the same ML estimates, it is likely that a global maximum has been reached ECON377/477 Topic 1.4 24
28. Imposing equality constraints Specialist knowledge of an industry may give rise to an analyst imposing equality constraints For example, knowledge of smallholder rice production may lead the analyst to believe the production technology exhibits constant returns to scale (CRS) In the case of the translog production function model, this property implies: β1 + β2 + β3 = 1 β11 + β12 + β13 = 0 β12 + β22 + β23 = 0 β13 + β23 + β33 = 0 ECON377/477 Topic 1.4 25
29. Imposing equality constraints OLS and ML estimation methods can both be adapted to enforce these types of constraints on estimates for β We minimise the sum of squares function or maximise the likelihood function subject to the constraints Both of these constrained optimisation problems involve setting up a Lagrangean function, equating the first-order derivatives to zero and solving for β ECON377/477 Topic 1.4 26
30. Imposing equality constraints Consider the problem of imposing the constraints on the parameters of the translog model This model can be written as: ECON377/477 Topic 1.4 27
31. Imposing equality constraints The constraints can also be rewritten as β1 = 1 – β2 – β3 β11 = – β12 – β13 β22 = – β12 – β23 β33 = – β13 – β23 Substituting the constraints into the model yields (after some re-arrangement): where: ECON377/477 Topic 1.4 28
32. Imposing equality constraints This so-called restricted model is linear in the parameters and can be estimated by OLS Following estimation, estimates of β1, β11, β22 and β33 can be computed by substituting the estimates of β2, β3, β12, β13 and β23 into the right-hand sides of the constraints ECON377/477 Topic 1.4 29
33. Imposing equality constraints CROB illustrate these procedures, presenting in Table 8.4 annotated SHAZAM output from estimating the constant-returns-to-scale translog model Results from estimating the unrestricted version of this model are presented in Table 8.2 We consider methods for testing the validity of the constant returns to scale assumption in the next section ECON377/477 Topic 1.4 30
34. Hypothesis testing If the errors are normally distributed, or if the sample size is large, we can test hypotheses concerning a single coefficient using a t-test Let βk denote the k-th element of the vector β and let c be a known constant To test H0: βk = c against H1: βk ≠ c, we use the test statistic: where bk is the estimator for βk and se(bk) is the estimator for its standard error ECON377/477 Topic 1.4 31
35. Hypothesis testing Thus, we reject H0 if the absolute value of the test statistic is greater than the critical value If the alternative hypothesis is H1: βk < c, then we reject H0at the 100α% level of significance if the t-statistic is less thantα(I – K), the critical value If the alternative hypothesis is H1: βk > c, we reject H0 if the t-statistic is greater than t1–α(I–K) Sometimes we wish to conduct a joint test of several hypotheses concerning the coefficients, such as the joint null hypothesis that a translog production function exhibits constant returns to scale ECON377/477 Topic 1.4 32
36. Hypothesis testing Several procedures are available for testing hypotheses of this form All are underpinned by the idea that if the restrictions specified under the null hypothesis are true, then our restricted and unrestricted estimates of β should be very close This is because the restricted and unrestricted estimators are both consistent The F-statistic can be used, as a measure of the difference between the sums of squared residuals obtained from the restricted and unrestricted models ECON377/477 Topic 1.4 33
37. Hypothesis testing The value of the log-likelihood function evaluated at the unrestricted estimates should be close to the value of the log-likelihood function evaluated at the restricted estimates The likelihood ratio statistic is a measure of this closeness: where lnLR and lnLU denote the maximised values of the restricted and unrestricted log-likelihood functions and J is the number of restrictions ECON377/477 Topic 1.4 34
38. Hypothesis testing We reject H0 at the 100α% level of significance if the LR statistic exceeds the critical chi-squared value The F and LR tests are simple to construct but require estimation of both the restricted and unrestricted models Alternative testing procedures that require estimation of only one model are the Wald and Lagrange Multiplier (LM) tests ECON377/477 Topic 1.4 35
39. Hypothesis testing The Wald statistic measures how well the restrictions are satisfied when evaluated at the unrestricted estimates, so it only requires estimation of the unrestricted model The LM statistic looks at the first-order conditions for a maximum of the log-likelihood function when evaluated at the restricted estimates, so it only requires estimation of the restricted model All the tests discussed in this section are asymptotically justified (i.e., we can use them if the sample size is large) ECON377/477 Topic 1.4 36
40. Systems estimation We often want to estimate the parameters of several economic relationships Consider the single-output three-input translog cost function: where ci is the i-th observation on cost, the wniare input prices and qi is output We can obtain estimates of the parameters using OLS or ML ECON377/477 Topic 1.4 37
41. Systems estimation We can also use Shephard’s Lemma to derive the cost-share equations for n = 1, 2, 3 where sni is the n-th cost share and the vnis are random errors that are likely to be correlated with each other The correlations between these errors provide information that can be used in the estimation process to obtain more efficient estimates ECON377/477 Topic 1.4 38
42. Systems estimation In the present context, this means estimating a system of N = 3 equations comprising the cost function and, for reasons given on the next slide, only N – 1 = 2 of the cost-share equations Such systems of equations, where the errors in the equations are correlated, are known as seemingly unrelated regression (SUR) models Again, the parameters of SUR models can be estimated using OLS or ML techniques ECON377/477 Topic 1.4 39
43. Systems estimation If the dependent variables in a set of SUR equations sum to the value of a variable appearing on the right-hand sides of those equations, the covariance matrix of the errors is usually singular In this case, the estimation breaks down unless one equation is deleted from the system ECON377/477 Topic 1.4 40