A novel computational framework for integrating multidimensional data to enhance accuracy in predicting the prognosis of colorectal cancer
Accurate prognosis prediction is the key to achieving precision treatment and guiding the selection of adjuvant chemotherapy in high‐risk stage II/III colorectal cancer (CRC) patients. Here we developed a novel machine learning method, the random non‐negative matrix factorization (RNMF) algorithm, w...
Saved in:
| Published in: | MedComm - Future medicine Vol. 1; no. 2 |
|---|---|
| Main Authors: | , , , , , , , , , , , |
| Format: | Journal Article |
| Language: | English |
| Published: |
London
John Wiley & Sons, Inc
01.09.2022
Wiley |
| Subjects: | |
| ISSN: | 2769-6456, 2769-6456 |
| Online Access: | Get full text |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| Summary: | Accurate prognosis prediction is the key to achieving precision treatment and guiding the selection of adjuvant chemotherapy in high‐risk stage II/III colorectal cancer (CRC) patients. Here we developed a novel machine learning method, the random non‐negative matrix factorization (RNMF) algorithm, which outperformed traditional non‐negative matrix factorization in terms of computational speed, accuracy, and robustness in simulated data sets. Moreover, based on multidimensional data from CRC patients from The Cancer Genome Atlas database and DNA methylation data from those from Sun Yat‐sen University cancer center, we further demonstrated the excellent performance of a novel prognostic computational framework based on the RNMF (PCF_RNMF), which is capable of integrating multidimensional training while allowing survival prediction when single dimensional data for validation is provided. This novel algorithm has great potential to mitigate the challenge of integrating various types of data in public databases with clinically available single‐dimensional data to allow cost‐effective survival prediction for CRC patients in clinical practice.
The computation framework prognostic computational framework is based on random non‐negative matrix factorization (RNMF). (a) The multidimensional data in TCGA were used to select features. (b) The RNMF method was used to obtain the survival score matrix and the prediction coefficient matrix. (c) The prognosis model was built by using Cox stepwise regression. (d) The corresponding prognostic score can be obtained based on the prediction coefficient matrix of the methylation dimension by RNMF in (b). |
|---|---|
| Bibliography: | Qinran Zhang, Yuhong Xu, ShiyangKang, and Junquan Chen contributed equally to this study. ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 |
| ISSN: | 2769-6456 2769-6456 |
| DOI: | 10.1002/mef2.27 |