Predictive modeling of breast cancer-related lymphedema using machine learning algorithms
Breast cancer-related lymphedema (BCRL) is one of the common complications after breast cancer surgery. It can easily lead to limb swelling, deformation and upper limb dysfunction, which has a serious impact on the physical and mental health and quality of life of patients. Previous studies have mos...
Gespeichert in:
| Veröffentlicht in: | Gland surgery Jg. 13; H. 12; S. 2243 |
|---|---|
| Hauptverfasser: | , , |
| Format: | Journal Article |
| Sprache: | Englisch |
| Veröffentlicht: |
China (Republic : 1949- )
31.12.2024
|
| Schlagworte: | |
| ISSN: | 2227-684X |
| Online-Zugang: | Weitere Angaben |
| Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
| Abstract | Breast cancer-related lymphedema (BCRL) is one of the common complications after breast cancer surgery. It can easily lead to limb swelling, deformation and upper limb dysfunction, which has a serious impact on the physical and mental health and quality of life of patients. Previous studies have mostly used statistical methods such as linear regression and logistic regression to analyze the influencing factors, but all of them have certain limitations. Machine learning (ML) is an important branch of artificial intelligence, which can effectively overcome the problems of multivariate interaction and collinearity. This study aimed to explore the influencing factors for the occurrence of BCRL in breast cancer patients, and construct a predictive model with ML algorithms and validate its predictive value on this basis.
Clinical data of breast cancer patients admitted to Hainan Cancer Hospital from September 2018 to May 2024 were retrospectively collected. BCRL was considered as the outcome measurement, and the data were divided into training and validation sets in a ratio of 7:3. In the training set, random forest (RF), support vector machine (SVM), and eXtreme Gradient Boosting (XGBoost) algorithms were used to construct predictive models. The discrimination accuracy of the models was evaluated with receiver operating characteristic (ROC) curve analysis, sensitivity, specificity, and F1 score. The calibration of the models was assessed using calibration curves and the Hosmer-Lemeshow (H-L) Chi-squared test.
Two hundred and forty patients who met the inclusion criteria were screened, and they were randomly divided into a training set (168 patients) and a validation set (72 patients) in a 7:3 ratio. In the training set, 44 cases developed BCRL, while 124 did not. There were statistically significant differences (P<0.05) in hypertension history, number of dissected lymph nodes, postoperative complications, postoperative functional exercises, chemotherapy, radiotherapy, tumor node metastasis (TNM) stage, and level of axillary lymph node dissection between the BCRL and non-BCRL groups. Among the four models, the XGBoost model showed the best predictive performance, with an area under the curve (AUC) of 0.99 in the training set and 0.89 in the validation set. The XGBoost model demonstrated good calibration in both the training and validation sets, showing good consistency with the ideal model.
The ML-based XGBoost model for predicting BCRL exhibits excellent performance and assists healthcare professionals in rapidly and accurately assessing the risk of BCRL occurrence. |
|---|---|
| AbstractList | Breast cancer-related lymphedema (BCRL) is one of the common complications after breast cancer surgery. It can easily lead to limb swelling, deformation and upper limb dysfunction, which has a serious impact on the physical and mental health and quality of life of patients. Previous studies have mostly used statistical methods such as linear regression and logistic regression to analyze the influencing factors, but all of them have certain limitations. Machine learning (ML) is an important branch of artificial intelligence, which can effectively overcome the problems of multivariate interaction and collinearity. This study aimed to explore the influencing factors for the occurrence of BCRL in breast cancer patients, and construct a predictive model with ML algorithms and validate its predictive value on this basis.BackgroundBreast cancer-related lymphedema (BCRL) is one of the common complications after breast cancer surgery. It can easily lead to limb swelling, deformation and upper limb dysfunction, which has a serious impact on the physical and mental health and quality of life of patients. Previous studies have mostly used statistical methods such as linear regression and logistic regression to analyze the influencing factors, but all of them have certain limitations. Machine learning (ML) is an important branch of artificial intelligence, which can effectively overcome the problems of multivariate interaction and collinearity. This study aimed to explore the influencing factors for the occurrence of BCRL in breast cancer patients, and construct a predictive model with ML algorithms and validate its predictive value on this basis.Clinical data of breast cancer patients admitted to Hainan Cancer Hospital from September 2018 to May 2024 were retrospectively collected. BCRL was considered as the outcome measurement, and the data were divided into training and validation sets in a ratio of 7:3. In the training set, random forest (RF), support vector machine (SVM), and eXtreme Gradient Boosting (XGBoost) algorithms were used to construct predictive models. The discrimination accuracy of the models was evaluated with receiver operating characteristic (ROC) curve analysis, sensitivity, specificity, and F1 score. The calibration of the models was assessed using calibration curves and the Hosmer-Lemeshow (H-L) Chi-squared test.MethodsClinical data of breast cancer patients admitted to Hainan Cancer Hospital from September 2018 to May 2024 were retrospectively collected. BCRL was considered as the outcome measurement, and the data were divided into training and validation sets in a ratio of 7:3. In the training set, random forest (RF), support vector machine (SVM), and eXtreme Gradient Boosting (XGBoost) algorithms were used to construct predictive models. The discrimination accuracy of the models was evaluated with receiver operating characteristic (ROC) curve analysis, sensitivity, specificity, and F1 score. The calibration of the models was assessed using calibration curves and the Hosmer-Lemeshow (H-L) Chi-squared test.Two hundred and forty patients who met the inclusion criteria were screened, and they were randomly divided into a training set (168 patients) and a validation set (72 patients) in a 7:3 ratio. In the training set, 44 cases developed BCRL, while 124 did not. There were statistically significant differences (P<0.05) in hypertension history, number of dissected lymph nodes, postoperative complications, postoperative functional exercises, chemotherapy, radiotherapy, tumor node metastasis (TNM) stage, and level of axillary lymph node dissection between the BCRL and non-BCRL groups. Among the four models, the XGBoost model showed the best predictive performance, with an area under the curve (AUC) of 0.99 in the training set and 0.89 in the validation set. The XGBoost model demonstrated good calibration in both the training and validation sets, showing good consistency with the ideal model.ResultsTwo hundred and forty patients who met the inclusion criteria were screened, and they were randomly divided into a training set (168 patients) and a validation set (72 patients) in a 7:3 ratio. In the training set, 44 cases developed BCRL, while 124 did not. There were statistically significant differences (P<0.05) in hypertension history, number of dissected lymph nodes, postoperative complications, postoperative functional exercises, chemotherapy, radiotherapy, tumor node metastasis (TNM) stage, and level of axillary lymph node dissection between the BCRL and non-BCRL groups. Among the four models, the XGBoost model showed the best predictive performance, with an area under the curve (AUC) of 0.99 in the training set and 0.89 in the validation set. The XGBoost model demonstrated good calibration in both the training and validation sets, showing good consistency with the ideal model.The ML-based XGBoost model for predicting BCRL exhibits excellent performance and assists healthcare professionals in rapidly and accurately assessing the risk of BCRL occurrence.ConclusionsThe ML-based XGBoost model for predicting BCRL exhibits excellent performance and assists healthcare professionals in rapidly and accurately assessing the risk of BCRL occurrence. Breast cancer-related lymphedema (BCRL) is one of the common complications after breast cancer surgery. It can easily lead to limb swelling, deformation and upper limb dysfunction, which has a serious impact on the physical and mental health and quality of life of patients. Previous studies have mostly used statistical methods such as linear regression and logistic regression to analyze the influencing factors, but all of them have certain limitations. Machine learning (ML) is an important branch of artificial intelligence, which can effectively overcome the problems of multivariate interaction and collinearity. This study aimed to explore the influencing factors for the occurrence of BCRL in breast cancer patients, and construct a predictive model with ML algorithms and validate its predictive value on this basis. Clinical data of breast cancer patients admitted to Hainan Cancer Hospital from September 2018 to May 2024 were retrospectively collected. BCRL was considered as the outcome measurement, and the data were divided into training and validation sets in a ratio of 7:3. In the training set, random forest (RF), support vector machine (SVM), and eXtreme Gradient Boosting (XGBoost) algorithms were used to construct predictive models. The discrimination accuracy of the models was evaluated with receiver operating characteristic (ROC) curve analysis, sensitivity, specificity, and F1 score. The calibration of the models was assessed using calibration curves and the Hosmer-Lemeshow (H-L) Chi-squared test. Two hundred and forty patients who met the inclusion criteria were screened, and they were randomly divided into a training set (168 patients) and a validation set (72 patients) in a 7:3 ratio. In the training set, 44 cases developed BCRL, while 124 did not. There were statistically significant differences (P<0.05) in hypertension history, number of dissected lymph nodes, postoperative complications, postoperative functional exercises, chemotherapy, radiotherapy, tumor node metastasis (TNM) stage, and level of axillary lymph node dissection between the BCRL and non-BCRL groups. Among the four models, the XGBoost model showed the best predictive performance, with an area under the curve (AUC) of 0.99 in the training set and 0.89 in the validation set. The XGBoost model demonstrated good calibration in both the training and validation sets, showing good consistency with the ideal model. The ML-based XGBoost model for predicting BCRL exhibits excellent performance and assists healthcare professionals in rapidly and accurately assessing the risk of BCRL occurrence. |
| Author | Sun, Yang Xia, Xiaomin Liu, Xia |
| Author_xml | – sequence: 1 givenname: Yang surname: Sun fullname: Sun, Yang organization: Department of Breast Oncology, Hainan Cancer Hospital, Haikou, China – sequence: 2 givenname: Xiaomin surname: Xia fullname: Xia, Xiaomin organization: Department of Breast Oncology, Hainan Cancer Hospital, Haikou, China – sequence: 3 givenname: Xia surname: Liu fullname: Liu, Xia organization: Department of Breast Oncology, Hainan Cancer Hospital, Haikou, China |
| BackLink | https://www.ncbi.nlm.nih.gov/pubmed/39822356$$D View this record in MEDLINE/PubMed |
| BookMark | eNo1kMtKAzEYRrOo2Fq78QFklm5Gk0wuzVKKNyjoQkFXQy7_tJEkU5MZoW9vxbr64HA4i-8MTVKfAKELgq8pwY282ZSasppyOkEzSqmsxZK9T9GilE-MMWkoE4KeommjlpQ2XMzQx0sG5-3gv6GKvYPg06bqu8pk0GWorE4Wcp0h6AFcFfZxtwUHUVdj-TWjtlufoAqgc_oFOmz67IdtLOfopNOhwOK4c_R2f_e6eqzXzw9Pq9t1bRuphpo7bJiWDJxTmBtm1VIqYZjrlFWCW2wc4wqU0LIjYIAr4iQzRGmmNDGGztHVX3eX-68RytBGXyyEoBP0Y2kbwoU8RKU4qJdHdTQRXLvLPuq8b__voD9E9WMv |
| CitedBy_id | crossref_primary_10_1038_s41598_025_95604_8 crossref_primary_10_1155_humu_9755727 |
| ContentType | Journal Article |
| Copyright | 2024 AME Publishing Company. All rights reserved. |
| Copyright_xml | – notice: 2024 AME Publishing Company. All rights reserved. |
| DBID | NPM 7X8 |
| DOI | 10.21037/gs-24-252 |
| DatabaseName | PubMed MEDLINE - Academic |
| DatabaseTitle | PubMed MEDLINE - Academic |
| DatabaseTitleList | MEDLINE - Academic PubMed |
| Database_xml | – sequence: 1 dbid: NPM name: PubMed url: http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed sourceTypes: Index Database – sequence: 2 dbid: 7X8 name: MEDLINE - Academic url: https://search.proquest.com/medline sourceTypes: Aggregation Database |
| DeliveryMethod | no_fulltext_linktorsrc |
| ExternalDocumentID | 39822356 |
| Genre | Journal Article |
| GroupedDBID | ADBBV ALMA_UNASSIGNED_HOLDINGS BAWUL DIK GX1 HYE NPM OK1 RPM 7X8 |
| ID | FETCH-LOGICAL-c379t-5d0b4a74edd905b4c98796b4df9c965c0bd459e96a7f1ebe591d74b19a49a1bb2 |
| IEDL.DBID | 7X8 |
| ISICitedReferencesCount | 2 |
| ISICitedReferencesURI | http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=001406233500005&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| ISSN | 2227-684X |
| IngestDate | Thu Sep 04 16:53:14 EDT 2025 Mon Jan 20 15:11:55 EST 2025 |
| IsDoiOpenAccess | false |
| IsOpenAccess | true |
| IsPeerReviewed | true |
| IsScholarly | true |
| Issue | 12 |
| Keywords | Breast cancer machine learning (ML) predictive model eXtreme Gradient Boosting algorithms (XGBoost algorithms) breast cancer-related lymphedema (BCRL) |
| Language | English |
| License | 2024 AME Publishing Company. All rights reserved. |
| LinkModel | DirectLink |
| MergedId | FETCHMERGED-LOGICAL-c379t-5d0b4a74edd905b4c98796b4df9c965c0bd459e96a7f1ebe591d74b19a49a1bb2 |
| Notes | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23 |
| OpenAccessLink | https://doi.org/10.21037/gs-24-252 |
| PMID | 39822356 |
| PQID | 3156798776 |
| PQPubID | 23479 |
| ParticipantIDs | proquest_miscellaneous_3156798776 pubmed_primary_39822356 |
| PublicationCentury | 2000 |
| PublicationDate | 2024-Dec-31 |
| PublicationDateYYYYMMDD | 2024-12-31 |
| PublicationDate_xml | – month: 12 year: 2024 text: 2024-Dec-31 day: 31 |
| PublicationDecade | 2020 |
| PublicationPlace | China (Republic : 1949- ) |
| PublicationPlace_xml | – name: China (Republic : 1949- ) |
| PublicationTitle | Gland surgery |
| PublicationTitleAlternate | Gland Surg |
| PublicationYear | 2024 |
| SSID | ssj0001324662 |
| Score | 2.2922566 |
| Snippet | Breast cancer-related lymphedema (BCRL) is one of the common complications after breast cancer surgery. It can easily lead to limb swelling, deformation and... |
| SourceID | proquest pubmed |
| SourceType | Aggregation Database Index Database |
| StartPage | 2243 |
| Title | Predictive modeling of breast cancer-related lymphedema using machine learning algorithms |
| URI | https://www.ncbi.nlm.nih.gov/pubmed/39822356 https://www.proquest.com/docview/3156798776 |
| Volume | 13 |
| WOSCitedRecordID | wos001406233500005&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| hasFullText | |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| link | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwpV1LS8NAEF7UevDiA1_1xQpeF5PsK3sSEYsXSw8K8RT2GYW2qU0U_PfublI8CYKX3EKGmcm8Zz4Argx2WapFhhhRGBFuDZJhYzkjWinj_QUlOoJN8PE4Lwox6QtuTT9WubKJ0VCbWoca-TX2iQb3CTJnN4t3FFCjQne1h9BYBwPsQ5mg1bzIf2osPlpgEVM0bHwilpOiu1Caxbs-VeNJQlnYOfotuoxeZrTzX_p2wXYfX8LbTiH2wJqd74OXyTL0Y4JlgxH6xvsrWDuowkR6C3WQ_BLFtRZr4PTLS9gaO5MwTMVXcBYHLi3sESYqKKeV_3T7OmsOwPPo_unuAfWYCkhjLlpETaKI5MQaIxKqiPYkC6aIcUILRnWiDKHCCia5S72AqUgNJyoVkgiZKpUdgo15PbfHAOZEO58NOWtkToijgihJTR4uhCXaYjMElytOlV5nQyNCzm390ZQ_vBqCo47d5aI7rlHicFAQU3byh7dPwVbmY4zu7uIZGDj_x9pzsKk_27dmeRGVwT_Hk8dvnALBQg |
| linkProvider | ProQuest |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Predictive+modeling+of+breast+cancer-related+lymphedema+using+machine+learning+algorithms&rft.jtitle=Gland+surgery&rft.au=Sun%2C+Yang&rft.au=Xia%2C+Xiaomin&rft.au=Liu%2C+Xia&rft.date=2024-12-31&rft.issn=2227-684X&rft.volume=13&rft.issue=12&rft.spage=2243&rft_id=info:doi/10.21037%2Fgs-24-252&rft_id=info%3Apmid%2F39822356&rft_id=info%3Apmid%2F39822356&rft.externalDocID=39822356 |
| thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2227-684X&client=summon |
| thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2227-684X&client=summon |
| thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2227-684X&client=summon |