We investigate the optimality for model selection of the so-called slope heuristics, -fold cross-validation and -fold penalization in a heteroscedatic with random design regression context. We consider a new class of linear models that we call strongly localized bases and that generalize histograms, piecewise polynomials and compactly supported wavelets. We derive sharp oracle inequalities that prove the asymptotic optimality of the slope heuristics – when the optimal penalty shape is known – and -fold penalization. Furthermore, -fold cross-validation seems to be suboptimal for a fixed value of since it recovers asymptotically the oracle learned from a sample size equal to of the original amount of data. Our results are based on genuine concentration inequalities for the true and empirical excess risks that are of independent interest. We show in our experiments the good behavior of the slope heuristics for the selection of linear wavelet models. Furthermore, -fold cross-validation and -fold penalization have comparable efficiency.
Accepté le :
DOI : 10.1051/ps/2017005
Keywords: Nonparametric regression, heteroscedastic noise, random design, model selection, cross-validation, wavelets
Navarro, Fabien 1 ; Saumard, Adrien 1
@article{PS_2017__21__412_0,
author = {Navarro, Fabien and Saumard, Adrien},
title = {Slope heuristics and {V-Fold} model selection in heteroscedastic regression using strongly localized bases},
journal = {ESAIM: Probability and Statistics},
pages = {412--451},
year = {2017},
publisher = {EDP Sciences},
volume = {21},
doi = {10.1051/ps/2017005},
mrnumber = {3743921},
zbl = {1395.62093},
language = {en},
url = {https://www.numdam.org/articles/10.1051/ps/2017005/}
}
TY - JOUR AU - Navarro, Fabien AU - Saumard, Adrien TI - Slope heuristics and V-Fold model selection in heteroscedastic regression using strongly localized bases JO - ESAIM: Probability and Statistics PY - 2017 SP - 412 EP - 451 VL - 21 PB - EDP Sciences UR - https://www.numdam.org/articles/10.1051/ps/2017005/ DO - 10.1051/ps/2017005 LA - en ID - PS_2017__21__412_0 ER -
%0 Journal Article %A Navarro, Fabien %A Saumard, Adrien %T Slope heuristics and V-Fold model selection in heteroscedastic regression using strongly localized bases %J ESAIM: Probability and Statistics %D 2017 %P 412-451 %V 21 %I EDP Sciences %U https://www.numdam.org/articles/10.1051/ps/2017005/ %R 10.1051/ps/2017005 %G en %F PS_2017__21__412_0
Navarro, Fabien; Saumard, Adrien. Slope heuristics and V-Fold model selection in heteroscedastic regression using strongly localized bases. ESAIM: Probability and Statistics, Tome 21 (2017), pp. 412-451. doi: 10.1051/ps/2017005
, and , Wavelet estimators in nonparametric regression: a comparative simulation study. J. Stat. Softw. 6 (2001) 1–83. | DOI
, and , Wavelet methods for curve estimation. J. Amer. Statist. Assoc. 89 (1994) 1340–1353. | MR | Zbl | DOI
S. Arlot, -fold cross-validation improved: -fold penalization. Preprint (2008). | arXiv
S. Arlot, Choosing a penalty for model selection in heteroscedastic regression (2010). | arXiv
and Data-driven calibration of linear estimators with minimal penalties. Adv. Neural Infor. Process. Syst. 22 (2009) 46–54.
and , A survey of cross-validation procedures for model selection. Stat. Surv. 4 (2010) 40–79. | MR | Zbl | DOI
and , Segmentation of the mean of heteroscedastic data via cross-validation. Stat. Comput. 21 (2011) 613–632. | MR | Zbl | DOI
and , Choice of for -fold cross-validation in least-squares density estimation. J. Mach. Learn. Res. 17 (2016) 1–50. | MR | Zbl
and , Data-driven calibration of penalties for least-squares regression. J. Mach. Learn. Res. 10 (2009) 245–279.
, and , Slope heuristics: overview and implementation. Stat. Comput. 22 (2012) 455–470. | MR | Zbl | DOI
and , Minimum contrast estimators on sieves: exponential bounds and rates of convergence. Bernoulli 4 (1998) 329–375. | MR | Zbl | DOI
and , Minimal penalties for Gaussian model selection. Probab. Theory Related Fields 138 (2007) 33–73. | MR | Zbl | DOI
, A comparative study of ordinary cross-validation, -fold cross-validation and the repeated learning-testing methods. Biometrika 76 (1989) 503–514. | MR | Zbl | DOI
, Adaptive wavelet estimation: a block thresholding and oracle inequality approach. Ann. Statist. 27 (1999) 898–924. | MR | Zbl
and , Wavelet shrinkage for nonequispaced samples. Ann. Statist. 26 (1998) 1783–1799. | MR | Zbl
and , Wavelet estimation for samples with random uniform design. Statist. Probab. Lett. 42 (1999) 313–321. | MR | Zbl | DOI
G. Castellan, Modified Akaike’s criterion for histogram density estimation. Technical report 99.61, Université Paris-Sud (1999).
, A new perspective on least squares under convex constraint. Ann. Statist. 42 (2014) 2340–2381, 12. | MR | Zbl | DOI
, and , Wavelets on the interval and fast wavelet transforms. Appl. Comput. Harmon. Anal. 1 (1993) 54–81. | MR | Zbl | DOI
A. Donoho, D. Maleki and M. Shahram, Wavelab 850 (2006).
and , Ideal spatial adaptation by wavelet shrinkage. Biometrika 81 (1994) 425–455. | MR | Zbl | DOI
, The predictive sample reuse method with applications. J. Amer. Statist. Assoc. 70 (1975) 320–328. | Zbl | DOI
L. Györfi, M. Kohler, A. Krzyżak and H. Walk, A distribution-free theory of nonparametric regression. Springer Series in Statistics. Springer Verlag, New York (2002). | MR | Zbl
and , Interpolation methods for nonlinear wavelet regression with irregularly spaced design. Ann. Statist. 25 (1997) 1912–1925. | MR | Zbl | DOI
W. Härdle, G. Kerkyacharian, D. Picard and A. Tsybakov, Wavelets, approximation, and statistical applications. Vol. 129 of Lect. Notes Statist. Springer Verlag, New York (1998). | MR | Zbl
and , Wavelet regression in random design with heteroscedastic dependent errors. Ann. Statist. 37 (2009) 3396–3430. | MR | Zbl | DOI
and , Oracle inequalities for cross-validation type procedures. Electron. J. Stat. 6 (2012) 1803–1837. | MR | Zbl | DOI
, Optimal model selection for density estimation of stationary data under various mixing conditions. Ann. Statist. 39 (2011) 1852–1877. | MR | Zbl | DOI
, Optimal model selection in density estimation. Ann. Inst. Henri Poincaré Probab. Stat. 48 (2012) 884–908. | MR | Zbl | Numdam | DOI
S. Mallat, A wavelet tour of signal processing: the sparse way. Academic press (2008). | MR | Zbl
, , , and , Exact risk analysis of wavelet regression. J. Comput. Graph. Statist. 7 (1998) 278–309.
P. Massart, Concentration inequalities and model selection, Vol. 1896 of Lect. Notes Math. Springer, Berlin (2007). Lectures from the 33rd Summer School on Probability Theory held in Saint-Flour, With a foreword by Jean Picard (2003) 6–23. | MR | Zbl
A. Muro and S. van de Geer, Concentration behavior of the penalized least squares estimator. Preprint (2015). | arXiv | MR
G. Nason, Wavelet shrinkage using cross-validation. J.R. Stat. Soc. Ser. B (1996) 463–479. | MR | Zbl
A. Saumard, Nonasymptotic quasi-optimality of AIC and the slope heuristics in maximum likelihood estimation of density using histogram models (2010). hal-00512310.
, Optimal upper and lower bounds for the true and empirical excess risks in heteroscedastic least-squares regression. Electron. J. Statist. 6 (2012) 579–655. | MR | Zbl | DOI
, Optimal model selection in heteroscedastic regression using piecewise polynomial functions. Electron. J. Statist. 7 (2013) 1184–1223. | MR | Zbl | DOI
, Optimal global rates of convergence for nonparametric regression. Ann. Statist. 10 (1982) 1040–1053. | MR | Zbl | DOI
S. van de Geer and M. Wainwright, On concentration for (regularized) empirical risk minimization. Preprint (2016). | arXiv | MR
, Model selection in nonparametric regression. Ann. Statist. 31 (2003) 252–273. | MR | Zbl | DOI
Cité par Sources :






