Predictive modeling of breast cancer-related lymphedema using machine learning algorithms

Breast cancer-related lymphedema (BCRL) is one of the common complications after breast cancer surgery. It can easily lead to limb swelling, deformation and upper limb dysfunction, which has a serious impact on the physical and mental health and quality of life of patients. Previous studies have mos...

Celý popis

Uložené v:
Podrobná bibliografia
Vydané v:Gland surgery Ročník 13; číslo 12; s. 2243
Hlavní autori: Sun, Yang, Xia, Xiaomin, Liu, Xia
Médium: Journal Article
Jazyk:English
Vydavateľské údaje: China (Republic : 1949- ) 31.12.2024
Predmet:
ISSN:2227-684X
On-line prístup:Zistit podrobnosti o prístupe
Tagy: Pridať tag
Žiadne tagy, Buďte prvý, kto otaguje tento záznam!
Abstract Breast cancer-related lymphedema (BCRL) is one of the common complications after breast cancer surgery. It can easily lead to limb swelling, deformation and upper limb dysfunction, which has a serious impact on the physical and mental health and quality of life of patients. Previous studies have mostly used statistical methods such as linear regression and logistic regression to analyze the influencing factors, but all of them have certain limitations. Machine learning (ML) is an important branch of artificial intelligence, which can effectively overcome the problems of multivariate interaction and collinearity. This study aimed to explore the influencing factors for the occurrence of BCRL in breast cancer patients, and construct a predictive model with ML algorithms and validate its predictive value on this basis. Clinical data of breast cancer patients admitted to Hainan Cancer Hospital from September 2018 to May 2024 were retrospectively collected. BCRL was considered as the outcome measurement, and the data were divided into training and validation sets in a ratio of 7:3. In the training set, random forest (RF), support vector machine (SVM), and eXtreme Gradient Boosting (XGBoost) algorithms were used to construct predictive models. The discrimination accuracy of the models was evaluated with receiver operating characteristic (ROC) curve analysis, sensitivity, specificity, and F1 score. The calibration of the models was assessed using calibration curves and the Hosmer-Lemeshow (H-L) Chi-squared test. Two hundred and forty patients who met the inclusion criteria were screened, and they were randomly divided into a training set (168 patients) and a validation set (72 patients) in a 7:3 ratio. In the training set, 44 cases developed BCRL, while 124 did not. There were statistically significant differences (P<0.05) in hypertension history, number of dissected lymph nodes, postoperative complications, postoperative functional exercises, chemotherapy, radiotherapy, tumor node metastasis (TNM) stage, and level of axillary lymph node dissection between the BCRL and non-BCRL groups. Among the four models, the XGBoost model showed the best predictive performance, with an area under the curve (AUC) of 0.99 in the training set and 0.89 in the validation set. The XGBoost model demonstrated good calibration in both the training and validation sets, showing good consistency with the ideal model. The ML-based XGBoost model for predicting BCRL exhibits excellent performance and assists healthcare professionals in rapidly and accurately assessing the risk of BCRL occurrence.
AbstractList Breast cancer-related lymphedema (BCRL) is one of the common complications after breast cancer surgery. It can easily lead to limb swelling, deformation and upper limb dysfunction, which has a serious impact on the physical and mental health and quality of life of patients. Previous studies have mostly used statistical methods such as linear regression and logistic regression to analyze the influencing factors, but all of them have certain limitations. Machine learning (ML) is an important branch of artificial intelligence, which can effectively overcome the problems of multivariate interaction and collinearity. This study aimed to explore the influencing factors for the occurrence of BCRL in breast cancer patients, and construct a predictive model with ML algorithms and validate its predictive value on this basis.BackgroundBreast cancer-related lymphedema (BCRL) is one of the common complications after breast cancer surgery. It can easily lead to limb swelling, deformation and upper limb dysfunction, which has a serious impact on the physical and mental health and quality of life of patients. Previous studies have mostly used statistical methods such as linear regression and logistic regression to analyze the influencing factors, but all of them have certain limitations. Machine learning (ML) is an important branch of artificial intelligence, which can effectively overcome the problems of multivariate interaction and collinearity. This study aimed to explore the influencing factors for the occurrence of BCRL in breast cancer patients, and construct a predictive model with ML algorithms and validate its predictive value on this basis.Clinical data of breast cancer patients admitted to Hainan Cancer Hospital from September 2018 to May 2024 were retrospectively collected. BCRL was considered as the outcome measurement, and the data were divided into training and validation sets in a ratio of 7:3. In the training set, random forest (RF), support vector machine (SVM), and eXtreme Gradient Boosting (XGBoost) algorithms were used to construct predictive models. The discrimination accuracy of the models was evaluated with receiver operating characteristic (ROC) curve analysis, sensitivity, specificity, and F1 score. The calibration of the models was assessed using calibration curves and the Hosmer-Lemeshow (H-L) Chi-squared test.MethodsClinical data of breast cancer patients admitted to Hainan Cancer Hospital from September 2018 to May 2024 were retrospectively collected. BCRL was considered as the outcome measurement, and the data were divided into training and validation sets in a ratio of 7:3. In the training set, random forest (RF), support vector machine (SVM), and eXtreme Gradient Boosting (XGBoost) algorithms were used to construct predictive models. The discrimination accuracy of the models was evaluated with receiver operating characteristic (ROC) curve analysis, sensitivity, specificity, and F1 score. The calibration of the models was assessed using calibration curves and the Hosmer-Lemeshow (H-L) Chi-squared test.Two hundred and forty patients who met the inclusion criteria were screened, and they were randomly divided into a training set (168 patients) and a validation set (72 patients) in a 7:3 ratio. In the training set, 44 cases developed BCRL, while 124 did not. There were statistically significant differences (P<0.05) in hypertension history, number of dissected lymph nodes, postoperative complications, postoperative functional exercises, chemotherapy, radiotherapy, tumor node metastasis (TNM) stage, and level of axillary lymph node dissection between the BCRL and non-BCRL groups. Among the four models, the XGBoost model showed the best predictive performance, with an area under the curve (AUC) of 0.99 in the training set and 0.89 in the validation set. The XGBoost model demonstrated good calibration in both the training and validation sets, showing good consistency with the ideal model.ResultsTwo hundred and forty patients who met the inclusion criteria were screened, and they were randomly divided into a training set (168 patients) and a validation set (72 patients) in a 7:3 ratio. In the training set, 44 cases developed BCRL, while 124 did not. There were statistically significant differences (P<0.05) in hypertension history, number of dissected lymph nodes, postoperative complications, postoperative functional exercises, chemotherapy, radiotherapy, tumor node metastasis (TNM) stage, and level of axillary lymph node dissection between the BCRL and non-BCRL groups. Among the four models, the XGBoost model showed the best predictive performance, with an area under the curve (AUC) of 0.99 in the training set and 0.89 in the validation set. The XGBoost model demonstrated good calibration in both the training and validation sets, showing good consistency with the ideal model.The ML-based XGBoost model for predicting BCRL exhibits excellent performance and assists healthcare professionals in rapidly and accurately assessing the risk of BCRL occurrence.ConclusionsThe ML-based XGBoost model for predicting BCRL exhibits excellent performance and assists healthcare professionals in rapidly and accurately assessing the risk of BCRL occurrence.
Breast cancer-related lymphedema (BCRL) is one of the common complications after breast cancer surgery. It can easily lead to limb swelling, deformation and upper limb dysfunction, which has a serious impact on the physical and mental health and quality of life of patients. Previous studies have mostly used statistical methods such as linear regression and logistic regression to analyze the influencing factors, but all of them have certain limitations. Machine learning (ML) is an important branch of artificial intelligence, which can effectively overcome the problems of multivariate interaction and collinearity. This study aimed to explore the influencing factors for the occurrence of BCRL in breast cancer patients, and construct a predictive model with ML algorithms and validate its predictive value on this basis. Clinical data of breast cancer patients admitted to Hainan Cancer Hospital from September 2018 to May 2024 were retrospectively collected. BCRL was considered as the outcome measurement, and the data were divided into training and validation sets in a ratio of 7:3. In the training set, random forest (RF), support vector machine (SVM), and eXtreme Gradient Boosting (XGBoost) algorithms were used to construct predictive models. The discrimination accuracy of the models was evaluated with receiver operating characteristic (ROC) curve analysis, sensitivity, specificity, and F1 score. The calibration of the models was assessed using calibration curves and the Hosmer-Lemeshow (H-L) Chi-squared test. Two hundred and forty patients who met the inclusion criteria were screened, and they were randomly divided into a training set (168 patients) and a validation set (72 patients) in a 7:3 ratio. In the training set, 44 cases developed BCRL, while 124 did not. There were statistically significant differences (P<0.05) in hypertension history, number of dissected lymph nodes, postoperative complications, postoperative functional exercises, chemotherapy, radiotherapy, tumor node metastasis (TNM) stage, and level of axillary lymph node dissection between the BCRL and non-BCRL groups. Among the four models, the XGBoost model showed the best predictive performance, with an area under the curve (AUC) of 0.99 in the training set and 0.89 in the validation set. The XGBoost model demonstrated good calibration in both the training and validation sets, showing good consistency with the ideal model. The ML-based XGBoost model for predicting BCRL exhibits excellent performance and assists healthcare professionals in rapidly and accurately assessing the risk of BCRL occurrence.
Author Sun, Yang
Xia, Xiaomin
Liu, Xia
Author_xml – sequence: 1
  givenname: Yang
  surname: Sun
  fullname: Sun, Yang
  organization: Department of Breast Oncology, Hainan Cancer Hospital, Haikou, China
– sequence: 2
  givenname: Xiaomin
  surname: Xia
  fullname: Xia, Xiaomin
  organization: Department of Breast Oncology, Hainan Cancer Hospital, Haikou, China
– sequence: 3
  givenname: Xia
  surname: Liu
  fullname: Liu, Xia
  organization: Department of Breast Oncology, Hainan Cancer Hospital, Haikou, China
BackLink https://www.ncbi.nlm.nih.gov/pubmed/39822356$$D View this record in MEDLINE/PubMed
BookMark eNo1kMtKAzEYRrOo2Fq78QFklm5Gk0wuzVKKNyjoQkFXQy7_tJEkU5MZoW9vxbr64HA4i-8MTVKfAKELgq8pwY282ZSasppyOkEzSqmsxZK9T9GilE-MMWkoE4KeommjlpQ2XMzQx0sG5-3gv6GKvYPg06bqu8pk0GWorE4Wcp0h6AFcFfZxtwUHUVdj-TWjtlufoAqgc_oFOmz67IdtLOfopNOhwOK4c_R2f_e6eqzXzw9Pq9t1bRuphpo7bJiWDJxTmBtm1VIqYZjrlFWCW2wc4wqU0LIjYIAr4iQzRGmmNDGGztHVX3eX-68RytBGXyyEoBP0Y2kbwoU8RKU4qJdHdTQRXLvLPuq8b__voD9E9WMv
CitedBy_id crossref_primary_10_1038_s41598_025_95604_8
crossref_primary_10_1155_humu_9755727
ContentType Journal Article
Copyright 2024 AME Publishing Company. All rights reserved.
Copyright_xml – notice: 2024 AME Publishing Company. All rights reserved.
DBID NPM
7X8
DOI 10.21037/gs-24-252
DatabaseName PubMed
MEDLINE - Academic
DatabaseTitle PubMed
MEDLINE - Academic
DatabaseTitleList MEDLINE - Academic
PubMed
Database_xml – sequence: 1
  dbid: NPM
  name: PubMed
  url: http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed
  sourceTypes: Index Database
– sequence: 2
  dbid: 7X8
  name: MEDLINE - Academic
  url: https://search.proquest.com/medline
  sourceTypes: Aggregation Database
DeliveryMethod no_fulltext_linktorsrc
ExternalDocumentID 39822356
Genre Journal Article
GroupedDBID ADBBV
ALMA_UNASSIGNED_HOLDINGS
BAWUL
DIK
GX1
HYE
NPM
OK1
RPM
7X8
ID FETCH-LOGICAL-c379t-5d0b4a74edd905b4c98796b4df9c965c0bd459e96a7f1ebe591d74b19a49a1bb2
IEDL.DBID 7X8
ISICitedReferencesCount 2
ISICitedReferencesURI http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=001406233500005&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
ISSN 2227-684X
IngestDate Thu Sep 04 16:53:14 EDT 2025
Mon Jan 20 15:11:55 EST 2025
IsDoiOpenAccess false
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Issue 12
Keywords Breast cancer
machine learning (ML)
predictive model
eXtreme Gradient Boosting algorithms (XGBoost algorithms)
breast cancer-related lymphedema (BCRL)
Language English
License 2024 AME Publishing Company. All rights reserved.
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c379t-5d0b4a74edd905b4c98796b4df9c965c0bd459e96a7f1ebe591d74b19a49a1bb2
Notes ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
OpenAccessLink https://doi.org/10.21037/gs-24-252
PMID 39822356
PQID 3156798776
PQPubID 23479
ParticipantIDs proquest_miscellaneous_3156798776
pubmed_primary_39822356
PublicationCentury 2000
PublicationDate 2024-Dec-31
PublicationDateYYYYMMDD 2024-12-31
PublicationDate_xml – month: 12
  year: 2024
  text: 2024-Dec-31
  day: 31
PublicationDecade 2020
PublicationPlace China (Republic : 1949- )
PublicationPlace_xml – name: China (Republic : 1949- )
PublicationTitle Gland surgery
PublicationTitleAlternate Gland Surg
PublicationYear 2024
SSID ssj0001324662
Score 2.2922566
Snippet Breast cancer-related lymphedema (BCRL) is one of the common complications after breast cancer surgery. It can easily lead to limb swelling, deformation and...
SourceID proquest
pubmed
SourceType Aggregation Database
Index Database
StartPage 2243
Title Predictive modeling of breast cancer-related lymphedema using machine learning algorithms
URI https://www.ncbi.nlm.nih.gov/pubmed/39822356
https://www.proquest.com/docview/3156798776
Volume 13
WOSCitedRecordID wos001406233500005&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
hasFullText
inHoldings 1
isFullTextHit
isPrint
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwpV1LS8QwEA7qevDiA1_riwheg22TNs1JRFw86LIHlXoqebUKbrtuq-C_d5J22ZMgeOmpTcPMZDLJzHwfQhcKzgTKwgJUMeUEvKSDvDUpkQ7VmmoRMM8N-HzPx-M0y8Skv3Br-rLKhU_0jtrU2t2RX1IYlMMBmSdXsw_iWKNcdrWn0FhFAwqhjLNqnqXLOxaIFhLPKeo6PkmSsqxDKI08rk_ZkIiRyPUc_RZd-l1mtPXf-W2jzT6-xNedQeygFVvtopfJ3OVjnGfDnvoG9itcF1i5ivQWa6f5OfFtLdbg92_QsDV2KrGrii_x1BdcWtwzTJRYvpfw6_Z12uyhp9Ht480d6TkViKZctCQ2gWKSM2uMCGLFNExZJIqZQmiRxDpQhsXCikTyIgQFxyI0nKlQSCZkqFS0j9aqurKHCFtwD2EoowBeYJGxKii4Dooo1FTTQIshOl9IKgebdYkIWdn6s8mXshqig07c-awD18ipAxSkcXL0h6-P0UYEMUaHu3iCBgWsWHuK1vVX-9bMz7wxwHM8efgBmETAMg
linkProvider ProQuest
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Predictive+modeling+of+breast+cancer-related+lymphedema+using+machine+learning+algorithms&rft.jtitle=Gland+surgery&rft.au=Sun%2C+Yang&rft.au=Xia%2C+Xiaomin&rft.au=Liu%2C+Xia&rft.date=2024-12-31&rft.issn=2227-684X&rft.volume=13&rft.issue=12&rft.spage=2243&rft_id=info:doi/10.21037%2Fgs-24-252&rft_id=info%3Apmid%2F39822356&rft_id=info%3Apmid%2F39822356&rft.externalDocID=39822356
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2227-684X&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2227-684X&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2227-684X&client=summon