{smcl}
{* 05Aug2010}{...}
{* *! Version 11 }
{hline}
help for {hi:gmmcovearn}{right:(update August 2010)}
{hline}

{title:Title}

{phang}
{bf: GMM estimator for covariance structure of earnings}


{title:Syntax}
{p 8 17 2}
{cmdab: gmmcovearn}
{it:earningsvar} {ifin}, {it:required options}
[{it:other options}]

{synoptset 20 tabbed}
{marker required_opts}{...}
{synopthdr :required_opts}
{synoptline}
{synopt:{opt modeln}} specifies the type of model to be estimated. The default is {cmd:modeln(1)}{p_end}

{tab}{tab}{tab}{tab}AR, no heterogeneity: {cmd:modeln(1)}
{tab}{tab}{tab}{tab}ARMA, no heterogeneity: {cmd:modeln(2)}
{tab}{tab}{tab}{tab}AR, random growth: {cmd:modeln(3)}
{tab}{tab}{tab}{tab}ARMA, random growth: {cmd:modeln(4)}
{tab}{tab}{tab}{tab}AR, random walk: {cmd:modeln(5)}
{tab}{tab}{tab}{tab}ARMA, random walk: {cmd:modeln(6)}
{tab}{tab}{tab}{tab}AR, combined random growth and random walk: {cmd:modeln(7)}
{tab}{tab}{tab}{tab}ARMA, combined random growth and random walk: {cmd:modeln(8)}

{tab}{tab}{tab}{tab}All models include time factor loadings on the permanent and transitory components. 

{synopt:{opt yearn}} specifies the number of years used for the analysis{p_end}

{synoptset 20 tabbed}
{marker other_opts}{...}
{synopthdr :other_opts}
{synoptline}
{synopt:{opt expvar}} specifies the name of the experience variable to be used for models that allow for heterogeneity in the life-cycle earnings profile ({cmd: modeln(3)} to {cmd: modeln(8)}).{p_end}
{synopt:{opt firstyr}} specifies the numeric year indicator attached to the first wave of earnings. The default value is 1. {it:earningsvar} and {it:expvar} are assumed to be indexed in consecutive integers from {it:firstyr} to ({it:firstyr} +              {it:yearn} -1).{p_end}
{synopt:{opt cohortn}} specifies the number of cohorts used for the analysis. The default is 1.{p_end}
{synopt:{opt cohortvar}} specifies the name of the cohort indicator variable. The default is {it: cohort}{p_end}
{synopt:{opt firstcohort}} specifies the numeric indicator of the first cohort. The default value is 1. Cohorts are assumed to be coded in consecutive integers from {it:firstcohort} to ({it:firstcohort} + {it:cohortn} -1). [e.g. In a model with                                  four                                cohorts {it: cohortvar} could contain values such as 1 to 4 or 1994 to 1997. However values such as 1960, 1970, 1980 and 1990 would have to be recoded before being used.]
{p_end}
{synopt:{opt stvalue}} specifies the starting values for the estimation. For T years of data and C cohorts,  values are entered in the following order, separated by commas: sigalpha, rho, sigv1, sige, l2-lT, p2-pT, q2-qC, s2-sC, sigbeta,                                     covalphabeta, sigw, theta.                            The user should specify starting values only for the parameters estimated in the chosen model.The default values for the l’s, p’s, q’s and s’s are 1; for sigalpha and rho                       they are 0.5; for sigv1 and sige they are 0.1; for sigbeta, covalphabeta and sigw they are 0; and for theta it is -0.5.{p_end}
{synopt:{opt newdataname}} allows the user to create a dataset called {it:newdataname} containing the sample moments used in the estimation and the number of observations used in calculating each of these moments. If a heterogeneous model is                   specified the dataset will also contain the average of {it:expvar} and the                    average of squared {it:expvar}.{p_end}
{synopt:{opt graph}} 1 if the user wants a graphical display of the estimated variance decomposition, 0 otherwise. The default value is 0.{p_end}





{synoptline}


{title:Description}

{pstd}{cmd:gmmcovearn} provides GMM estimates of the parameters of the covariance structure of earnings using the earnings variable specified in {it: earningsvar}. 

{tab}{tab}{tab}{tab}{pstd}The general earnings dynamics model ({cmd:modeln(8)}) for individual i, belonging to cohort c, with x years of experience, at time t, y_icxt, is specified as

{tab}{tab}{tab}{tab}y_icxt=q_c*p_t*(alpha_ix)+s_c*l_t*v_it

{tab}{tab}{tab}{tab}alpha_ix=alpha_i(x-1)+b_i+w_ix

{tab}{tab}{tab}{tab}And 

{tab}{tab}{tab}{tab}v_it=rho*v_i(t-1)+theta*e_i(t-1)+e_it

{tab}{tab}{tab}{tab}The parameters estimated are sigalpha, rho, sigv1, sige, l2-lT, 
{tab}{tab}{tab}{tab}p2-pT, q2-qC, s2-sC, sigbeta, covalphabeta, sigw, theta. 

{tab}{tab}{tab}{tab}The model is described in more detail in Doris et al (2010a).


{pstd} The command makes use of an additional program {cmd:nlgmmcovearn} that must be downloaded along with {cmd:gmmcovearn}. 
A detailed analysis of this approach to estimating the covariance structure of earnings  can be found in Doris et al (2010b).

{pstd} The program requires that the data be in wide format. It must contain an earnings (or earnings residual) variable. If using a model with                                                   heterogeneous profiles, it must also include a                  labour market experience variable. If using a model with cohort effects, it must also include a cohort indicator variable. 

{pstd}The Identity matrix is used as the GMM weighting matrix and standard errors are adjusted for unbalanced data using the approach reported in Haider(2001)





{title:Examples}
{bf: Example 1: NLS Earnings Data}
{pstd} In this example we make use of the NLS panel data set used in Wooldridge (2002) and available for download from within Stata. The dataset provides an unbalanced panel of data on earnings, schooling, and demographic information for 530                    individuals from the National Longitudinal Survey for the years 1981-1987.

. use http://www.stata.com/data/jwooldridge/eacsap/nls81_87.dta

{pstd}Since the data is in long format we must first reshape it prior to using gmmcovearn.

. keep id year exper lwage

. reshape wide lwage exper, i(id) j(year)

{pstd}We estimate a earnings covariance structure model on log wages without heterogeneous profiles and an AR(1) model for the transitory component as follows:

. gmmcovearn lwage, yearn(7) modeln(1) cohortn(1) firstyr(81)
(obs = 28)

Iteration 0:  residual SS =  .1643298
Iteration 1:  residual SS =    .05161
Iteration 2:  residual SS =  .0017779
Iteration 3:  residual SS =  .0016158
Iteration 4:  residual SS =   .001615
Iteration 5:  residual SS =   .001615
Iteration 6:  residual SS =   .001615
Iteration 7:  residual SS =   .001615
Iteration 8:  residual SS =   .001615
Iteration 9:  residual SS =   .001615

      Source |       SS       df       MS
-------------+------------------------------         Number of obs =        28
       Model |  .796840451    16  .049802528         R-squared     =    0.9980
    Residual |  .001614961    12   .00013458         Adj R-squared =    0.9953
-------------+------------------------------         Root MSE      =  .0116009
       Total |  .798455413    28  .028516265         Res. dev.     = -193.8376

------------------------------------------------------------------------------
      moment |      Coef.   Std. Err.      t    P>|t|     [95% Conf. Interval]
-------------+----------------------------------------------------------------
   /sigalpha |   .0683058   .0088071     7.76   0.000     .0491168    .0874948
        /rho |   .3130349   .0608686     5.14   0.000     .1804137    .4456561
      /sigv1 |    .201089   .0144893    13.88   0.000     .1695194    .2326586
       /sige |   .0588356   .0375667     1.57   0.143    -.0230152    .1406864
         /l2 |   1.209775   .3380594     3.58   0.004     .4732069    1.946343
         /l3 |   1.497133    .496198     3.02   0.011     .4160105    2.578256
         /l4 |   1.142064   .3835471     2.98   0.012     .3063868    1.977742
         /l5 |   1.317238   .4160183     3.17   0.008     .4108118    2.223664
         /l6 |   1.438042    .464473     3.10   0.009     .4260424    2.450042
         /l7 |   1.706241   .5657103     3.02   0.011     .4736643    2.938818
         /p2 |   .9159306   .0803285    11.40   0.000     .7409098    1.090951
         /p3 |   1.112308   .0998032    11.15   0.000      .894856    1.329761
         /p4 |   1.307378   .1214745    10.76   0.000     1.042708    1.572048
         /p5 |   1.449588   .1411007    10.27   0.000     1.142156     1.75702
         /p6 |   1.466273   .1388268    10.56   0.000     1.163796    1.768751
         /p7 |   1.470464   .1267276    11.60   0.000     1.194348     1.74658
------------------------------------------------------------------------------
 coefficients and corrected standard errors below
------------------------------------------------------------------------------
             |      Coef.   Std. Err.      z    P>|z|     [95% Conf. Interval]
-------------+----------------------------------------------------------------
    sigalpha |   .0683058    .024722     2.76   0.006     .0198516    .1167599
         rho |   .3130349    .077011     4.06   0.000     .1620961    .4639736
       sigv1 |    .201089   .0402344     5.00   0.000      .122231     .279947
        sige |   .0588356   .0456725     1.29   0.198    -.0306809    .1483521
          l2 |   1.209775   .4187484     2.89   0.004     .3890433    2.030507
          l3 |   1.497133   .6406559     2.34   0.019     .2414706    2.752796
          l4 |   1.142064   .4497494     2.54   0.011     .2605716    2.023557
          l5 |   1.317238   .5380952     2.45   0.014     .2625906    2.371885
          l6 |   1.438042   .5672408     2.54   0.011     .3262706    2.549814
          l7 |   1.706241   .6850775     2.49   0.013     .3635139    3.048968
          p2 |   .9159306   .1452523     6.31   0.000     .6312413     1.20062
          p3 |   1.112308   .2170894     5.12   0.000     .6868211    1.537796
          p4 |   1.307378    .237724     5.50   0.000     .8414475    1.773308
          p5 |   1.449588   .2657075     5.46   0.000     .9288111    1.970366
          p6 |   1.466273    .270121     5.43   0.000     .9368458    1.995701
          p7 |   1.470464   .3122724     4.71   0.000     .8584214    2.082507
-------------+----------------------------------------------------------------



{pstd}Hypothesis tests can be carried out using {cmd:test} after running the gmmcovearn command. For example a test that the permanent factor loadings, the p_t’s, are constant over time can be carried out using a Wald test as follows:

. test _b[p2]=_b[p3]=_b[p4]=_b[p5]=_b[p6]=_b[p7]=1

 ( 1)  p2 - p3 = 0
 ( 2)  p2 - p4 = 0
 ( 3)  p2 - p5 = 0
 ( 4)  p2 - p6 = 0
 ( 5)  p2 - p7 = 0
 ( 6)  p2 = 1

           chi2(  6) =   12.14
         Prob > chi2 =    0.0588


{pstd}In this example we reject constant permanent factor loadings at the 10% significance level. 


{bf: Example 2: German Earnings Data}
{pstd}To illustrate the use of gmmcovearn for a more complicated model including cohort effects we use data taken from the eight waves of the European Community Household Panel for Germany. This data set has earnings for 8 years denoted yi1994                             to yi2001, a potential experience variable denoted potexp1994 to potexp2001 and 4 cohorts labeled 1 to 4. To estimate a random growth model of the covariance structure with these data we type:

{pstd} {cmd:gmmcovearn} yi, yearn(8) modeln(3) cohortn(4) expvar(potexp) firstyr(1994){p_end}


Iteration 0:  residual SS =  .4483834
Iteration 1:  residual SS =  .0186083
Iteration 2:  residual SS =  .0086681
Iteration 3:  residual SS =  .0070441
Iteration 4:  residual SS =  .0070062
Iteration 5:  residual SS =  .0070061
Iteration 6:  residual SS =  .0070061
Iteration 7:  residual SS =  .0070061
Iteration 8:  residual SS =  .0070061

      Source |       SS       df       MS
-------------+------------------------------         Number of obs =       144
       Model |  2.16612446    26  .083312479         R-squared     =    0.9968
    Residual |  .007006143   118  .000059374         Adj R-squared =    0.9961
-------------+------------------------------         Root MSE      =  .0077055
       Total |  2.17313061   144  .015091185         Res. dev.     = -1021.378

------------------------------------------------------------------------------
      moment |      Coef.   Std. Err.      t    P>|t|     [95% Conf. Interval]
-------------+----------------------------------------------------------------
   /sigalpha |   .4386704   .0605217     7.25   0.000      .318821    .5585198
        /rho |   .3417541   .0361788     9.45   0.000     .2701102    .4133979
      /sigv1 |   .0742456   .0062163    11.94   0.000     .0619355    .0865556
       /sige |   .0301694   .0112735     2.68   0.009     .0078448    .0524939
         /l2 |   1.498371   .2427683     6.17   0.000     1.017624    1.979118
         /l3 |   1.312379   .2569202     5.11   0.000     .8036072    1.821151
         /l4 |   1.162531   .2301328     5.05   0.000     .7068056    1.618257
         /l5 |   1.193413   .2324563     5.13   0.000     .7330861     1.65374
         /l6 |   1.297056   .2507765     5.17   0.000     .8004506    1.793662
         /l7 |   1.279366   .2507241     5.10   0.000     .7828636    1.775868
         /l8 |   1.428739   .2791012     5.12   0.000     .8760424    1.981435
         /p2 |   .9842998   .0222086    44.32   0.000     .9403208    1.028279
         /p3 |    1.07534   .0254406    42.27   0.000     1.024961    1.125719
         /p4 |    1.08418   .0279634    38.77   0.000     1.028804    1.139555
         /p5 |   1.148706   .0318299    36.09   0.000     1.085674    1.211738
         /p6 |   1.168077    .033754    34.61   0.000     1.101235    1.234919
         /p7 |   1.205651   .0364914    33.04   0.000     1.133388    1.277914
         /p8 |   1.219382   .0372949    32.70   0.000     1.145528    1.293236
         /q2 |    .989772   .0419597    23.59   0.000     .9066803    1.072864
         /q3 |   .7322496   .0536754    13.64   0.000     .6259578    .8385415
         /q4 |   .4866869   .0387748    12.55   0.000     .4099022    .5634715
         /s2 |   .6293472   .0447309    14.07   0.000     .5407679    .7179265
         /s3 |   .8336288   .0377501    22.08   0.000     .7588734    .9083842
         /s4 |   1.214287   .0390211    31.12   0.000     1.137014    1.291559
    /sigbeta |   .0003872   .0000489     7.92   0.000     .0002904    .0004841
/covalphab~a |   -.012158    .001729    -7.03   0.000    -.0155819   -.0087341
------------------------------------------------------------------------------
 coefficients and corrected standard errors below
------------------------------------------------------------------------------
             |      Coef.   Std. Err.      z    P>|z|     [95% Conf. Interval]
-------------+----------------------------------------------------------------
    sigalpha |   .4386704   .1706614     2.57   0.010     .1041801    .7731606
         rho |   .3417541   .0361377     9.46   0.000     .2709254    .4125827
       sigv1 |   .0742456   .0118533     6.26   0.000     .0510135    .0974776
        sige |   .0301694   .0104324     2.89   0.004     .0097223    .0506164
          l2 |   1.498371   .2208264     6.79   0.000     1.065559    1.931183
          l3 |   1.312379   .2312379     5.68   0.000     .8591613    1.765597
          l4 |   1.162531   .1944341     5.98   0.000     .7814474    1.543615
          l5 |   1.193413   .1942618     6.14   0.000     .8126667    1.574159
          l6 |   1.297056   .2167269     5.98   0.000     .8722795    1.721833
          l7 |   1.279366   .2119609     6.04   0.000     .8639299    1.694801
          l8 |   1.428739   .2584156     5.53   0.000     .9222535    1.935224
          p2 |   .9842998   .0313717    31.38   0.000     .9228123    1.045787
          p3 |    1.07534   .0433522    24.80   0.000     .9903711    1.160309
          p4 |    1.08418    .050535    21.45   0.000     .9851328    1.183226
          p5 |   1.148706   .0694794    16.53   0.000     1.012529    1.284883
          p6 |   1.168077   .0789378    14.80   0.000     1.013362    1.322793
          p7 |   1.205651   .0887034    13.59   0.000     1.031796    1.379507
          p8 |   1.219382   .0922591    13.22   0.000     1.038558    1.400207
          q2 |    .989772    .111584     8.87   0.000     .7710713    1.208473
          q3 |   .7322496   .1311763     5.58   0.000     .4751487    .9893505
          q4 |   .4866869   .1032265     4.71   0.000     .2843667     .689007
          s2 |   .6293472   .0628975    10.01   0.000     .5060703    .7526241
          s3 |   .8336288   .0684399    12.18   0.000     .6994891    .9677684
          s4 |   1.214287   .0917179    13.24   0.000     1.034523    1.394051
     sigbeta |   .0003872   .0001758     2.20   0.028     .0000426    .0007319
covalphabeta |   -.012158    .005614    -2.17   0.030    -.0231613   -.0011547
------------------------------------------------------------------------------




{title:Saved results}

{pstd}
{cmd:gmmcovearn} saves the following in {cmd:e()}:

{synoptset 20 tabbed}{...}
{p2col 5 20 24 2: Scalars}{p_end}
{synopt:{cmd:e(nummomnent)}}number of moment conditions used in estimation{p_end}

{p2col 5 20 24 2: Vectors and Matrices}{p_end}
{synopt:{cmd:e(b)}}coefficient vector{p_end}
{synopt:{cmd:e(V)}}variance-covariance matrix of the estimators{p_end}
{synopt:{cmd:e(momentc)}}sample moments for earnings variable for cohort c, c=1 to {it: cohortn}{p_end}
{synopt:{cmd:e(permc)}}predicted permanent component of earnings variance for cohort c, c=1..{it: cohortn}{p_end}
{synopt:{cmd:e(tempc)}}predicted transitory component of earnings variance for cohortc, c=1..{it: cohortn}{p_end}

{title:References}

{p 5 5} Doris, A, D. O’Neill and O.Sweetman (2010a) “GMMCOVEARN: A Stata Module for GMM Estimation of the Covariance Structure of Earnings.,” NUIM Economics Working paper No.212-10.


{p 5 5} Doris, A, D. O’Neill and O.Sweetman (2010b) “Identification of the Covariance Structure of Earnings using the GMM Estimator,” IZA Working paper No. 4952.

{p 5 5} Haider, S. (2001), “Earnings Instability and Earnings Inequality of Males in the United States: 1967-1991’, Journal of Labor Economics, Vol. 19(4), pp. 799-836.

{p 5 5} Wooldridge, J. (2002), Econometric Analysis of Cross Section and Panel Data, MIT Press, Cambdridge, Massachusetts. 


{title:Authors}

{p 5 5}Aedin Doris, Donal O’Neill and Olive Sweetman, Economics, NUI Maynooth, Ireland.

{* Version 2.0 2010-08}
{* Version 1.0 2010-06-30}