A novel computational framework for integrating multidimensional data to enhance accuracy in predicting the prognosis of colorectal cancer

Accurate prognosis prediction is the key to achieving precision treatment and guiding the selection of adjuvant chemotherapy in high‐risk stage II/III colorectal cancer (CRC) patients. Here we developed a novel machine learning method, the random non‐negative matrix factorization (RNMF) algorithm, w...

Full description

Saved in:
Bibliographic Details
Published in:MedComm - Future medicine Vol. 1; no. 2
Main Authors: Zhang, Qinran, Xu, Yuhong, Kang, Shiyang, Chen, Junquan, Yao, Zhihao, Wang, Haitao, Wu, Qinian, Zhao, Qi, Zhang, Qihua, Xu, Rui‐hua, Zou, Xiufen, Luo, Huiyan
Format: Journal Article
Language:English
Published: London John Wiley & Sons, Inc 01.09.2022
Wiley
Subjects:
ISSN:2769-6456, 2769-6456
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Accurate prognosis prediction is the key to achieving precision treatment and guiding the selection of adjuvant chemotherapy in high‐risk stage II/III colorectal cancer (CRC) patients. Here we developed a novel machine learning method, the random non‐negative matrix factorization (RNMF) algorithm, which outperformed traditional non‐negative matrix factorization in terms of computational speed, accuracy, and robustness in simulated data sets. Moreover, based on multidimensional data from CRC patients from The Cancer Genome Atlas database and DNA methylation data from those from Sun Yat‐sen University cancer center, we further demonstrated the excellent performance of a novel prognostic computational framework based on the RNMF (PCF_RNMF), which is capable of integrating multidimensional training while allowing survival prediction when single dimensional data for validation is provided. This novel algorithm has great potential to mitigate the challenge of integrating various types of data in public databases with clinically available single‐dimensional data to allow cost‐effective survival prediction for CRC patients in clinical practice. The computation framework prognostic computational framework is based on random non‐negative matrix factorization (RNMF). (a) The multidimensional data in TCGA were used to select features. (b) The RNMF method was used to obtain the survival score matrix and the prediction coefficient matrix. (c) The prognosis model was built by using Cox stepwise regression. (d) The corresponding prognostic score can be obtained based on the prediction coefficient matrix of the methylation dimension by RNMF in (b).
Bibliography:Qinran Zhang, Yuhong Xu, ShiyangKang, and Junquan Chen contributed equally to this study.
ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ISSN:2769-6456
2769-6456
DOI:10.1002/mef2.27