Asymptotics of large-dimensional projection matrices (with Stanislav Anatolyev)
[
draft]
We characterize the joint asymptotic behavior of diagonal and off-diagonal elements of projection matrices, say $P = Z(Z'Z)^{-1}Z'$, whose underlying dimension $\ell$ is asymptotically proportional to sample size $n$, that is, $\ell/n \xrightarrow[]{} \alpha$ as $n \xrightarrow[]{} \infty$, with the aspect ratio $\alpha \in (0,1)$, under the rotated i.i.d. assumption for elements of $Z$. The rate of
convergence is $\sqrt{n}$, and the limiting distribution is multivariate centered Gaussian. The formulas for the asymptotic variances and covariances are expressed as functions
of $\alpha$ and moments of elements of $Z$. The instrumental tools in deriving the asymptotic
results are the Woodbury matrix identity, CLT for quadratic and bilinear forms, and
elements of the random matrix theory such as the Marchenko-Pastur law.
Many instruments estimation and inference under clustered dependence (with Stanislav Anatolyev)
[draft]
The literature on many weak instruments in a heteroskedastic environment under data independence is largely developed. When data dependence, in particular clustering, is present, it poses difficulties in making correct and convenient inferences about structural parameters. We show that clustering either deems the conventional jackknife instrumental variables estimation inconsistent or makes its inferences distorted. We suggest an alternative approach to the estimation of and making inferences about structural parameters, which is computationally attractive and allows general structures of intra-cluster correlations, presence of many instruments and possibly weak identification. We use the natural extension of jackknifing, the leave-cluster-out methodology, applied to the instrument projection matrix, which allows one to dispose of the cross-cluster dependencies in the influence function of the structural parameter estimate. We further weigh the observations by inverse cluster sizes to flexibly adjust for cluster size heterogeneity, which relaxes the usual requirements on the maximal cluster size growth and facilitates derivation of asymptotic properties. We set out a formal asymptotic framework to analyze the proposed weighted leave-cluster-out instrumental variables (WLCOIV) estimator, with an increasing number of clusters, possibly increasing cluster sizes, and presence of many possibly weak instruments. We prove a central limit theorem for the influence function embedded in the WLCOIV estimator under both strong and weak identification, and show consistency of the associated WLCOIV variance estimator. Finally, we run a small simulation experiment and illustrate with the celebrated Angrist and Krueger (1991) dataset, comparing the WLCOIV estimator to other estimators.