Robust feature learning using contractive autoencoders for multi-omics clustering in cancer subtyping

•We propose a novel multi-omics framework using contractive autoencoder for robust feature extraction.•Cox regression integrates survival analysis to identify key survival-related features for clustering.•Experimental results demonstrate the superior performance of our method in multi-omics data ana...

Celý popis

Uložené v:
Podrobná bibliografia
Vydané v:Methods (San Diego, Calif.) Ročník 233; s. 52 - 60
Hlavní autori: Guo, Mengke, Ye, Xiucai, Huang, Dong, Sakurai, Tetsuya
Médium: Journal Article
Jazyk:English
Vydavateľské údaje: United States Elsevier Inc 01.01.2025
Predmet:
ISSN:1046-2023, 1095-9130, 1095-9130
On-line prístup:Získať plný text
Tagy: Pridať tag
Žiadne tagy, Buďte prvý, kto otaguje tento záznam!
Popis
Shrnutí:•We propose a novel multi-omics framework using contractive autoencoder for robust feature extraction.•Cox regression integrates survival analysis to identify key survival-related features for clustering.•Experimental results demonstrate the superior performance of our method in multi-omics data analysis. Cancer can manifest in virtually any tissue or organ, necessitating precise subtyping of cancer patients to enhance diagnosis, treatment, and prognosis. With the accumulation of vast amounts of omics data, numerous studies have focused on integrating multi-omics data for cancer subtyping using clustering techniques. However, due to the heterogeneity of different omics data, extracting important features to effectively integrate these data for accurate clustering analysis remains a significant challenge. This study proposes a new multi-omics clustering framework for cancer subtyping, which utilizes contractive autoencoder to extract robust features. By encouraging the learned representation to be less sensitive to small changes, the contractive autoencoder learns robust feature representations from different omics. To incorporate survival information into the clustering analysis, Cox proportional hazards regression is used to further select the key features significantly associated with survival for integration. Finally, we utilize K-means clustering on the integrated feature to obtain the clustering result. The proposed framework is evaluated on ten different cancer datasets across four levels of omics data and compared to other existing methods. The experimental results indicate that the proposed framework effectively integrates the four omics datasets and outperforms other methods, achieving higher C-index scores and showing more significant differences between survival curves. Additionally, differential gene analysis and pathway enrichment analysis are performed to further demonstrate the effectiveness of the proposed method framework.
Bibliografia:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:1046-2023
1095-9130
1095-9130
DOI:10.1016/j.ymeth.2024.11.013