In this study, we consider PCA for Gaussian observations X1, …, X$$ with covariance Σ = ∑$$λ$$P$$ in the ’effective rank’ setting with model complexity governed by r(Σ) ≔ tr(Σ)∕∥Σ∥. We prove a Berry-Essen type bound for a Wald Statistic of the spectral projector . This can be used to construct non-asymptotic goodness of fit tests and confidence ellipsoids for spectral projectors P$$. Using higher order pertubation theory we are able to show that our Theorem remains valid even when .
Accepté le :
DOI : 10.1051/ps/2019002
Keywords: PCA, spectral projectors, central limit theorem, confidence sets, goodness of fit tests
Löffler, Matthias 1
@article{PS_2019__23__662_0,
author = {L\"offler, Matthias},
title = {Wald {Statistics} in high-dimensional {PCA}},
journal = {ESAIM: Probability and Statistics},
pages = {662--671},
year = {2019},
publisher = {EDP Sciences},
volume = {23},
doi = {10.1051/ps/2019002},
mrnumber = {4011571},
zbl = {1507.62260},
language = {en},
url = {https://www.numdam.org/articles/10.1051/ps/2019002/}
}
Löffler, Matthias. Wald Statistics in high-dimensional PCA. ESAIM: Probability and Statistics, Tome 23 (2019), pp. 662-671. doi: 10.1051/ps/2019002
[1] and , Asymptotic Chi-square tests for a large class of factor analysis models. Ann. Statist. 18 (1990) 1453–1463 | MR | Zbl | DOI
[2] , Asymptotic theory for principal component analysis. Ann. Math. Statist. 34 (1963) 122–148 | MR | Zbl | DOI
[3] , A testing method for covariance structure analysis. In Vol. 24 of IMS Lecture Notes Monogr. Ser., Inst. Math. Statist., Hayward, CA (1994) 123–136 | MR | DOI
[4] and , Optimal detection of sparse principal components in high dimension. Ann. Statist. 41 (2013) 1780–1815 | MR | Zbl | DOI
[5] , and , Sparse PCA: optimal rates and adaptive estimation. Ann. Statist. 41 (2013) 3074–3110 | MR | Zbl
[6] , Multivariate Statistics: A Vector Space Approach. John Wiley & Sons, Inc., New York (1983) | MR | Zbl
[7] and , Rate-optimal posterior contraction for sparse PCA. Ann. Statist. 43 (2015) 785–818 | MR | Zbl
[8] , On the distribution of the largest eigenvalue in principal components analysis. Ann. Statist. 29 (2001) 295–327 | MR | Zbl | DOI
[9] and , On consistency and sparsity for principal components analysis in high dimensions. J. Am. Statist. Assoc. 104 (2009) 682–693 | MR | Zbl | DOI
[10] and , Asymptotics and concentration bounds for bilinear forms of spectral projectors of sample covariance. Ann. Inst. Henri Poincaré Probab. Stat. 52 (2016) 1976–2013 | MR | Zbl | DOI
[11] and , Concentration inequalities and moment bounds for sample covariance operators. Bernoulli 23 (2017) 110–133 | MR | Zbl | DOI
[12] and , Normal approximation and concentration of spectral projectors of sample covariance. Ann. Statist. 45 (2017) 121–157 | MR | Zbl | DOI
[13] and , New asymptotic results in principal component analysis. Sankhya A 79 (2017) 254–297 | MR | Zbl | DOI
[14] , and , Efficient estimation of linear functionals of principal components. Preprint at (2017) | arXiv | MR
[15] , Finite sample approximation results for principal component analysis: a matrix perturbation approach. Ann. Statist. 36 (2008) 2791–2817 | MR | Zbl | DOI
[16] , and , Confidence sets for Spectral projectors of covariance matrices. Dokl. Math. 98 (2018) 511–514 | MR | Zbl | DOI
[17] , and , On estimation of the noise variance in high dimensional probabilistic principal component analysis. J. R. Statist. Soc. B 79 (2017) 51–67 | MR | Zbl | DOI
[18] , Asymptotics of sample eigenstructure for a large dimensional spiked covariance model. Stat. Sin. 17 (2007) 1617–1642 | MR | Zbl
[19] and , Non-asymptotic upper bounds for the reconstruction error of PCA. Preprint at (2016) | arXiv | MR
[20] and , Bayesian inference for spectral projectors of covariance matrix. Electr. J. Stat. 12 (2018) 1948–1987 | MR | Zbl
[21] , Assessing goodness of fit in confirmatory factor analysis. Meas. Eval. Counsel. Dev. 37 (2005) 240–256 | DOI
[22] and , Minimax sparse principal subspace estimation in high dimensions. Ann. Statist. 41 (2013) 2905–2947 | MR | Zbl
[23] , Tests of statistical hypotheses concerning several parameters when the number of observations is large. Trans. Am. Math. Soc. 54 (1943) 426–482 | MR | Zbl | DOI
[24] , and , Statistical and computational trade-offs in estimation of sparse principal components. Ann. Statist. 44 (2016) 1896–1930 | MR | Zbl | DOI
[25] and , Asymptotics of empirical eigenstructure for high dimensional spiked covariance. Ann. Statist. 45 (2017) 1342–1374 | MR | Zbl | DOI
Cité par Sources :





