A Newton-Type Method for ℓ0-Regularized Accelerated Failure Time Model Under the Case–Cohort Design

The case–cohort design has been widely used to reduce the cost of covariate measurements in large cohort studies. In this paper, we study the high-dimensional accelerated failure time (AFT) model under the case–cohort design. Based on ℓ 0 -regularization and a newly defined weight function, we propo...

Celý popis

Uloženo v:

Podrobná bibliografie
Vydáno v:	Acta mathematica Sinica. English series Ročník 41; číslo 9; s. 2275 - 2300
Hlavní autoři:	Liu, Yanyan, Tian, Ke, Wang, Danlu, Zhang, Jing
Médium:	Journal Article
Jazyk:	angličtina
Vydáno:	Berlin/Heidelberg Springer Berlin Heidelberg 01.09.2025 Springer Nature B.V
Témata:	Algorithms Failure times Gene expression Least squares method Mathematics Mathematics and Statistics Newton methods Parameter estimation Regularization Weighting functions 62N01 62P10 62N02 newton-type method regularization case–cohort design 62D99 Accelerated failure time model support detection and root finding algorithm
ISSN:	1439-8516, 1439-7617
On-line přístup:	Získat plný text
Tagy:	Přidat tag Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!

Popis
Shrnutí:	The case–cohort design has been widely used to reduce the cost of covariate measurements in large cohort studies. In this paper, we study the high-dimensional accelerated failure time (AFT) model under the case–cohort design. Based on ℓ 0 -regularization and a newly defined weight function, we propose a weighted least squares procedure for variable selection and parameter estimation. Computationally, we develop a support detection and root finding (SDAR) algorithm, where the support is first determined based on the primal and dual information, then the estimator is obtained by solving the weighted least squares problem restricted to the estimated support. We show the proposed algorithm is essentially one Newton-type algorithm, thus it is more efficient and stable compared with other regularized methods. Theoretically, we establish a sharp error bound for the solution sequences generated from the proposed method. Furthermore, we propose an adaptive version of the proposed SDAR algorithm, which determines the support size of the estimated coefficient in a data-driven manner. Extensive simulation studies demonstrate the superior performance of the proposed procedures, especially for the computational efficiency. As an illustration, we apply the proposed method to a malignant breast tumor gene expression data.
Bibliografie:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14
ISSN:	1439-8516 1439-7617
DOI:	10.1007/s10114-025-3226-2