Number of observations (_N) was 0, now 250.
Quantile regressions with multiple fixed effects
A Method of Moments Approach
Introduction
What do Quantile Regressions do?
Quantile Regressions are an alternative to standard linear regressions that help us to better understand the relationship between the distribution of
and .In constrast with Linear regression models, where one focuses on explaining
as a function of , quantile regressions aim to assess the relationship between with respect to .- This means that to interpret Q-regressions properly, one needs to “condition” on
but also on .
- This means that to interpret Q-regressions properly, one needs to “condition” on
Quantile Regressions are nonlinear-models, that “look” and can be interpreted as linear ones.
Estimation
- A Generalization of Quantile regression data generating function can be written as follows:
Base on this specification, few characteristics should be considered:
- Slopes
will vary across quantiles only if the model is Heteroskedastic. - Qreg can be considered as a semi-parametric-varying coefficient model, with unobserved running variable (
). - Coefficients are percentile specific, but functional forms can be used for more efficient estimation.
- It is possible to separate location effects (mean) from scale effect (deviation from the mean)
- Slopes
Standard
Semiparamatric (Kaplan (2022))
Functional (Bottai and Orsini (2019))
Location-Scale
The Problem
How to account for fixed effects?
- As said before, quantile regresions are nonlinear models. Thus, one cannot just add Dummies to address group fixed effects
This creates an incidental parameter problem. Neither
- But then how to solve it?
Some Solutions
Koenker (2004): Assume Fixed effects only have an impact on Location, and Shrink invidual effects (LASSO)
Canay (2011): Similar to Koenker (2004), but “eliminate them” before running Qreg
Abrevaya and Dahl (2008): Correlated Random effects Model
Machado and Santos Silva (2019)
This method was developed (and implemented using
xtqreg
) to incorporate individual fixed effects (Useful when considering panel data)In principle its an extension of He (1997), who proposed an strategy to estimate Qreg coefficients using a restricted location-scale model, assuming the following structure:
Thus, Quantile regression model is given by:
Instead of requiring
coefficients to identify the whole conditional distribution, it only requires .Also, Quantile curves will not cross!
How does Simplified He (1997) and Machado and Santos Silva (2019) works?
Step 1: Location model
Step 2: Scale model
Step 3: Standardized Quantile
Step 4: Aggregate
Extension: Multiple Fixed effects
Machado and Santos Silva (2019) propose an extension: To add a single set of fixed effect (panel) apply Frisch–Waugh–Lovell theorem.
This can be extended to controlling for multiple fixed effects:
And use almost the same moments as before:
Extension: Standard Errors
Along with the modified algorithm that allows for Panel Fixed effects, Machado and Santos Silva (2019) proposed a GLS type variance estimator.
- The model assumes the existance of Heteroskedasticity.
- Based on Law of Iterated Expectations, and assuming Heteroskedasticity is correctly specified, its possible to construct a GLS type Variance Estimator.
How does this work?
Location Model
Consider the location model:
Which has known heteroskedasticity:
Machado and Santos Silva (2019) GLS Variance would be:
However, this can be sensitive to the modeling of the Scale function.
But one can almost always Bootstrap…
Extension: Robust and Clustered SE
An extension offered in my implementation makes full use of the moment conditions, and the implicit Influence functions:
So we just stack Influence functions to obtain Robust and Clustered Standard errors:
Robust SE
Cluster SE
Easily allows for simultaneous Qregressions and weights
Command syntax mmqreg
[if in] [pw] , [Options] <-- Standard Syntax
mmqreg depvar indepvar
** Variance Options
Default: MSS(2019) GLS SE
-robust-: Robust Standard errors, no DOF Correction (GMM)
-cluster(varname)-: Clustered Standard errors, no DOF Correction (GMM)
-dfadj-: Simple Adjustment (n-k-1)
** Output - q(numlist): List of Quantiles to be estimated (default q=50)
- ls : Request Providing Location Scale coefficient models
There will be some differences with xtqreg
because of couple of differences in the math and programming.
Example
webuse nlswork, clear
qui:mmqreg ln_w age ttl_exp tenure not_smsa south, ///
abs(idcode)
est sto m1
qui:mmqreg ln_w age ttl_exp tenure not_smsa south, ///
abs(idcode) robust
est sto m2
qui:mmqreg ln_w age ttl_exp tenure not_smsa south, ///
abs(idcode) cluster(idcode)
est sto m3
qui:mmqreg ln_w ttl_exp tenure not_smsa south, ///
abs(idcode age) cluster(idcode)
est sto m4
esttab m1 m2 m3 m4, se nomtitle nogap
(National Longitudinal Survey of Young Women, 14-24 years old in 1968)
----------------------------------------------------------------------------
(1) (2) (3) (4)
----------------------------------------------------------------------------
location
age -0.00268** -0.00268** -0.00268*
(0.000886) (0.000900) (0.00129)
ttl_exp 0.0288*** 0.0288*** 0.0288*** 0.0340***
(0.00148) (0.00154) (0.00227) (0.00230)
tenure 0.0114*** 0.0114*** 0.0114*** 0.0105***
(0.000898) (0.000996) (0.00147) (0.00145)
not_smsa -0.0922*** -0.0922*** -0.0922*** -0.0878***
(0.00992) (0.0105) (0.0141) (0.0139)
south -0.0633*** -0.0633*** -0.0633*** -0.0596***
(0.0117) (0.0121) (0.0168) (0.0167)
_cons 1.592*** 1.592*** 1.592*** 1.481***
(0.0193) (0.0196) (0.0279) (0.0146)
----------------------------------------------------------------------------
scale
age -0.000218 -0.000218 -0.000218
(0.000554) (0.000526) (0.000645)
ttl_exp 0.00369*** 0.00369*** 0.00369*** 0.00150
(0.000923) (0.000911) (0.00111) (0.00110)
tenure -0.00596*** -0.00596*** -0.00596*** -0.00562***
(0.000561) (0.000600) (0.000736) (0.000711)
not_smsa -0.00361 -0.00361 -0.00361 -0.00535
(0.00620) (0.00632) (0.00738) (0.00707)
south -0.00640 -0.00640 -0.00640 -0.00757
(0.00731) (0.00711) (0.00825) (0.00826)
_cons 0.188*** 0.188*** 0.188*** 0.193***
(0.0121) (0.0115) (0.0141) (0.00685)
----------------------------------------------------------------------------
qtile
age -0.00270** -0.00270** -0.00270*
(0.000874) (0.000890) (0.00127)
ttl_exp 0.0292*** 0.0292*** 0.0292*** 0.0341***
(0.00146) (0.00153) (0.00225) (0.00229)
tenure 0.0108*** 0.0108*** 0.0108*** 0.0102***
(0.000888) (0.000993) (0.00145) (0.00144)
not_smsa -0.0925*** -0.0925*** -0.0925*** -0.0880***
(0.00977) (0.0104) (0.0139) (0.0138)
south -0.0640*** -0.0640*** -0.0640*** -0.0599***
(0.0115) (0.0121) (0.0167) (0.0167)
_cons 1.611*** 1.611*** 1.611*** 1.490***
(0.0191) (0.0195) (0.0277) (0.0147)
----------------------------------------------------------------------------
N 28093 28093 28093 28093
----------------------------------------------------------------------------
Standard errors in parentheses
* p<0.05, ** p<0.01, *** p<0.001
sysuse auto, clear
qui:mmqreg price mpg trunk ,
est sto m1
qui:mmqreg price mpg trunk , robust
est sto m2
qui:bootstrap, reps(250):mmqreg price mpg trunk
est sto m3
(1978 automobile data)
------------------------------------------------------------
(1) (2) (3)
default Robust Bootstrap
------------------------------------------------------------
location
mpg -220.2 -220.2** -220.2**
(71642.8) (70.97) (75.58)
trunk 43.56 43.56 43.56
(119358.5) (69.99) (73.49)
_cons 10254.9 10254.9*** 10254.9***
(2845248.0) (2380.9) (2538.1)
------------------------------------------------------------
scale
mpg -103.7 -103.7 -103.7**
(91001.0) (70.15) (37.51)
trunk 21.61 21.61 21.61
(151609.7) (63.25) (60.22)
_cons 3929.4 3929.4 3929.4*
(3614045.1) (2285.9) (1583.8)
------------------------------------------------------------
qtile
mpg -172.7 -172.7*** -172.7*
(27175.8) (49.04) (71.63)
trunk 33.67 33.67 33.67
(49802.1) (49.12) (56.11)
_cons 8457.3 8457.3*** 8457.3***
(1129127.0) (1718.3) (2303.5)
------------------------------------------------------------
N 74 74 74
------------------------------------------------------------
Standard errors in parentheses
* p<0.05, ** p<0.01, *** p<0.001
Plotting: qreg vs mmqreg
Conclusions
In this presentation I present a quick review of quantile regressions, with emphasis on solutions for adding fixed effects.
The Location and Scale model help with the problem because it reduces the number of coefficients to needed to be estimated for consistent estimates.
- In MSS2019, inconsistencies are still observed with Small T. (get long data)
- In my context, this may be less biding, because is more likely to have many obs per Fixed characteristic.
While the model may be restrictive (multiplicative heteroskedasticity) it may still be useful for exploration, and analysis when other methodologies are not feasible.