Multivariate return decomposition: theory and implications

Citation:

Anatolyev, Stanislav and Cheuk Fai Ng (2026) "Many-covariate and cluster robust estimation and inference", Econometric Reviews, vol. ??, no. ?, pp. ???-???

Abstract:

Empirical economists often use regression models employing large sets of covariates and presuming clustered data dependence. We provide inference methods for linear regressions with covariates whose number may be comparable to sample size, and observations that are clustered into possibly heterogeneous clusters. We present a leave-cluster-out-crossfit (LCOC) method of constructing an OLS asymptotic variance estimator, which extends leave-one-out variance estimation for independent data to clustered data, and which is robust to many covariates and heteroskedasticity. We show consistency of the LCOC estimator and asymptotic normality of the standardized OLS estimator. We demonstrate finite-sample properties of LCOC in simulations in comparison with available alternatives. Finally, we provide two empirical illustrations, where LCOC is applied to existing studies of effects of high school achievement awards and an impact of legalized abortion on crime reduction.

Paper in accepted version:

LCOC.pdf