Cross validation in LASSO and its acceleration

We investigate leave-one-out cross validation (CV) as a determinator of the weight of the penalty term in the least absolute shrinkage and selection operator (LASSO). First, on the basis of the message passing algorithm and a perturbative discussion assuming that the number of observations is suffic...

Celý popis

Uložené v:
Podrobná bibliografia
Vydané v:Journal of statistical mechanics Ročník 2016; číslo 5
Hlavní autori: Obuchi, Tomoyuki, Kabashima, Yoshiyuki
Médium: Journal Article
Jazyk:English
Vydavateľské údaje: IOP Publishing and SISSA 31.05.2016
ISSN:1742-5468
On-line prístup:Získať plný text
Tagy: Pridať tag
Žiadne tagy, Buďte prvý, kto otaguje tento záznam!
Popis
Shrnutí:We investigate leave-one-out cross validation (CV) as a determinator of the weight of the penalty term in the least absolute shrinkage and selection operator (LASSO). First, on the basis of the message passing algorithm and a perturbative discussion assuming that the number of observations is sufficiently large, we provide simple formulas for approximately assessing two types of CV errors, which enable us to significantly reduce the necessary cost of computation. These formulas also provide a simple connection of the CV errors to the residual sums of squares between the reconstructed and the given measurements. Second, on the basis of this finding, we analytically evaluate the CV errors when the design matrix is given as a simple random matrix in the large size limit by using the replica method. Finally, these results are compared with those of numerical simulations on finite-size systems and are confirmed to be correct. We also apply the simple formulas of the first type of CV error to an actual dataset of the supernovae.
Bibliografia:JSTAT_043P_1215
ISSN:1742-5468
DOI:10.1088/1742-5468/2016/05/053304