Genomic Selection for Cashmere Traits in Inner Mongolian Cashmere Goats Using Random Forest, Gradient Boosting Decision Tree, Extreme Gradient Boosting and Light Gradient Boosting Machine Methods.

Saved in:
Bibliographic Details
Title: Genomic Selection for Cashmere Traits in Inner Mongolian Cashmere Goats Using Random Forest, Gradient Boosting Decision Tree, Extreme Gradient Boosting and Light Gradient Boosting Machine Methods.
Authors: Liu, Jiaqi, Yan, Xiaochun, Li, Wenze, Xue, Shan-Hui, Wang, Zhiying, Su, Rui
Source: Animals (2076-2615); Oct2025, Vol. 15 Issue 20, p2940, 14p
Subject Terms: CASHMERE, ANIMAL breeding, GENETICS, BOOSTING algorithms, ENSEMBLE learning, RANDOM forest algorithms, MACHINE learning
Abstract: Simple Summary: This study aims to perform genome selection of cashmere traits in Inner Mongolian cashmere goats using machine learning algorithms. By comparing the prediction accuracy of various machine learning algorithms, it explores the feasibility of applying different machine learning algorithms to genome selection of cashmere traits in Inner Mongolian cashmere goats, with the goal of improving the accuracy of genomic selection and enhancing breeding efficiency. Fiber length and cashmere production can enhance the economic value of cashmere goats. We analyzed cashmere trait data from 2299 cashmere goats, including fiber length, cashmere diameter, and cashmere production. We used RF, XGBoost, GBDT, and LightGBM for genome selection in Inner Mongolian cashmere goats. For fiber length, cashmere production, and cashmere diameter, LightGBM, RF, and GBDT achieved the highest selection accuracy after hyperparameter optimization. However, in the case of cashmere traits, the prediction accuracy of XGBoost was the lowest among all the models, at 0.541, 0.309, and 0.387 for fiber length, cashmere production, and cashmere diameter, respectively. For machine learning methods, hyperparameter tuning is essential, as it can improve prediction accuracy. In recent years, Machine Learning (ML) has garnered increasing attention for its applications in genomic prediction. ML effectively processes high-dimensional genomic data and establishes nonlinear models. Compared to traditional Genomic Selection (GS) methods, ML algorithms enhance computational efficiency and offer higher prediction accuracy. Therefore, this study strives to achieve the optimal machine learning algorithm for genome-wide selection of cashmere traits in Inner Mongolian cashmere goats. This study compared the genomic prediction accuracy of cashmere traits using four machine learning algorithms—Random Forest (RF), Extreme Gradient Boosting Tree (XGBoost), Gradient Boosting Decision Tree (GBDT), and LightGBM—based on genotype data and cashmere trait phenotypic data from 2299 Inner Mongolian cashmere goats. The results showed that after parameter optimization, LightGBM achieved the highest selection accuracy for fiber length (56.4%), RF achieved the highest selection accuracy for cashmere production (35.2%), and GBDT achieved the highest selection accuracy for cashmere diameter (40.4%), compared with GBLUP, the accuracy improved by 0.8–2.7%. Among the three traits, XGBoost exhibited the lowest prediction accuracy, at 0.541, 0.309, and 0.387. Additionally, following parameter optimization, the prediction accuracy of the four machine learning methods for cashmere fineness, cashmere yield, and fiber length improved by an average of 2.9%, 2.7%, and 3.8%, respectively. The mean squared error (MSE) and mean absolute error (MAE) for all machine learning methods also decreased, indicating that hyperparameter tuning can enhance prediction accuracy in ML algorithms. [ABSTRACT FROM AUTHOR]
Copyright of Animals (2076-2615) is the property of MDPI and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
Database: Biomedical Index
FullText Text:
  Availability: 0
CustomLinks:
  – Url: http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=search&db=pmc&term=2076-2615[TA]+AND+2940[PG]+AND+2025[PDAT]
    Name: FREE - PubMed Central (ISSN based link)
    Category: fullText
    Text: Full Text
    Icon: https://imageserver.ebscohost.com/NetImages/iconPdf.gif
    MouseOverText: Check this PubMed for the article full text.
  – Url: https://resolver.ebscohost.com/openurl?sid=EBSCO:edm&genre=article&issn=20762615&ISBN=&volume=15&issue=20&date=20251015&spage=2940&pages=2940-2953&title=Animals (2076-2615)&atitle=Genomic%20Selection%20for%20Cashmere%20Traits%20in%20Inner%20Mongolian%20Cashmere%20Goats%20Using%20Random%20Forest%2C%20Gradient%20Boosting%20Decision%20Tree%2C%20Extreme%20Gradient%20Boosting%20and%20Light%20Gradient%20Boosting%20Machine%20Methods.&aulast=Liu%2C%20Jiaqi&id=DOI:10.3390/ani15202940
    Name: Full Text Finder
    Category: fullText
    Text: Full Text Finder
    Icon: https://imageserver.ebscohost.com/branding/images/FTF.gif
    MouseOverText: Full Text Finder
  – Url: https://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=EBSCO&SrcAuth=EBSCO&DestApp=WOS&ServiceName=TransferToWoS&DestLinkType=GeneralSearchSummary&Func=Links&author=Liu%20J
    Name: ISI
    Category: fullText
    Text: Nájsť tento článok vo Web of Science
    Icon: https://imagesrvr.epnet.com/ls/20docs.gif
    MouseOverText: Nájsť tento článok vo Web of Science
Header DbId: edm
DbLabel: Biomedical Index
An: 188957504
RelevancyScore: 1041
AccessLevel: 6
PubType: Academic Journal
PubTypeId: academicJournal
PreciseRelevancyScore: 1040.80737304688
IllustrationInfo
Items – Name: Title
  Label: Title
  Group: Ti
  Data: Genomic Selection for Cashmere Traits in Inner Mongolian Cashmere Goats Using Random Forest, Gradient Boosting Decision Tree, Extreme Gradient Boosting and Light Gradient Boosting Machine Methods.
– Name: Author
  Label: Authors
  Group: Au
  Data: <searchLink fieldCode="AR" term="%22Liu%2C+Jiaqi%22">Liu, Jiaqi</searchLink><br /><searchLink fieldCode="AR" term="%22Yan%2C+Xiaochun%22">Yan, Xiaochun</searchLink><br /><searchLink fieldCode="AR" term="%22Li%2C+Wenze%22">Li, Wenze</searchLink><br /><searchLink fieldCode="AR" term="%22Xue%2C+Shan-Hui%22">Xue, Shan-Hui</searchLink><br /><searchLink fieldCode="AR" term="%22Wang%2C+Zhiying%22">Wang, Zhiying</searchLink><br /><searchLink fieldCode="AR" term="%22Su%2C+Rui%22">Su, Rui</searchLink>
– Name: TitleSource
  Label: Source
  Group: Src
  Data: Animals (2076-2615); Oct2025, Vol. 15 Issue 20, p2940, 14p
– Name: Subject
  Label: Subject Terms
  Group: Su
  Data: <searchLink fieldCode="DE" term="%22CASHMERE%22">CASHMERE</searchLink><br /><searchLink fieldCode="DE" term="%22ANIMAL+breeding%22">ANIMAL breeding</searchLink><br /><searchLink fieldCode="DE" term="%22GENETICS%22">GENETICS</searchLink><br /><searchLink fieldCode="DE" term="%22BOOSTING+algorithms%22">BOOSTING algorithms</searchLink><br /><searchLink fieldCode="DE" term="%22ENSEMBLE+learning%22">ENSEMBLE learning</searchLink><br /><searchLink fieldCode="DE" term="%22RANDOM+forest+algorithms%22">RANDOM forest algorithms</searchLink><br /><searchLink fieldCode="DE" term="%22MACHINE+learning%22">MACHINE learning</searchLink>
– Name: Abstract
  Label: Abstract
  Group: Ab
  Data: Simple Summary: This study aims to perform genome selection of cashmere traits in Inner Mongolian cashmere goats using machine learning algorithms. By comparing the prediction accuracy of various machine learning algorithms, it explores the feasibility of applying different machine learning algorithms to genome selection of cashmere traits in Inner Mongolian cashmere goats, with the goal of improving the accuracy of genomic selection and enhancing breeding efficiency. Fiber length and cashmere production can enhance the economic value of cashmere goats. We analyzed cashmere trait data from 2299 cashmere goats, including fiber length, cashmere diameter, and cashmere production. We used RF, XGBoost, GBDT, and LightGBM for genome selection in Inner Mongolian cashmere goats. For fiber length, cashmere production, and cashmere diameter, LightGBM, RF, and GBDT achieved the highest selection accuracy after hyperparameter optimization. However, in the case of cashmere traits, the prediction accuracy of XGBoost was the lowest among all the models, at 0.541, 0.309, and 0.387 for fiber length, cashmere production, and cashmere diameter, respectively. For machine learning methods, hyperparameter tuning is essential, as it can improve prediction accuracy. In recent years, Machine Learning (ML) has garnered increasing attention for its applications in genomic prediction. ML effectively processes high-dimensional genomic data and establishes nonlinear models. Compared to traditional Genomic Selection (GS) methods, ML algorithms enhance computational efficiency and offer higher prediction accuracy. Therefore, this study strives to achieve the optimal machine learning algorithm for genome-wide selection of cashmere traits in Inner Mongolian cashmere goats. This study compared the genomic prediction accuracy of cashmere traits using four machine learning algorithms—Random Forest (RF), Extreme Gradient Boosting Tree (XGBoost), Gradient Boosting Decision Tree (GBDT), and LightGBM—based on genotype data and cashmere trait phenotypic data from 2299 Inner Mongolian cashmere goats. The results showed that after parameter optimization, LightGBM achieved the highest selection accuracy for fiber length (56.4%), RF achieved the highest selection accuracy for cashmere production (35.2%), and GBDT achieved the highest selection accuracy for cashmere diameter (40.4%), compared with GBLUP, the accuracy improved by 0.8–2.7%. Among the three traits, XGBoost exhibited the lowest prediction accuracy, at 0.541, 0.309, and 0.387. Additionally, following parameter optimization, the prediction accuracy of the four machine learning methods for cashmere fineness, cashmere yield, and fiber length improved by an average of 2.9%, 2.7%, and 3.8%, respectively. The mean squared error (MSE) and mean absolute error (MAE) for all machine learning methods also decreased, indicating that hyperparameter tuning can enhance prediction accuracy in ML algorithms. [ABSTRACT FROM AUTHOR]
– Name: Abstract
  Label:
  Group: Ab
  Data: <i>Copyright of Animals (2076-2615) is the property of MDPI and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract.</i> (Copyright applies to all Abstracts.)
PLink https://erproxy.cvtisr.sk/sfx/access?url=https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&db=edm&AN=188957504
RecordInfo BibRecord:
  BibEntity:
    Identifiers:
      – Type: doi
        Value: 10.3390/ani15202940
    Languages:
      – Code: eng
        Text: English
    PhysicalDescription:
      Pagination:
        PageCount: 14
        StartPage: 2940
    Subjects:
      – SubjectFull: CASHMERE
        Type: general
      – SubjectFull: ANIMAL breeding
        Type: general
      – SubjectFull: GENETICS
        Type: general
      – SubjectFull: BOOSTING algorithms
        Type: general
      – SubjectFull: ENSEMBLE learning
        Type: general
      – SubjectFull: RANDOM forest algorithms
        Type: general
      – SubjectFull: MACHINE learning
        Type: general
    Titles:
      – TitleFull: Genomic Selection for Cashmere Traits in Inner Mongolian Cashmere Goats Using Random Forest, Gradient Boosting Decision Tree, Extreme Gradient Boosting and Light Gradient Boosting Machine Methods.
        Type: main
  BibRelationships:
    HasContributorRelationships:
      – PersonEntity:
          Name:
            NameFull: Liu, Jiaqi
      – PersonEntity:
          Name:
            NameFull: Yan, Xiaochun
      – PersonEntity:
          Name:
            NameFull: Li, Wenze
      – PersonEntity:
          Name:
            NameFull: Xue, Shan-Hui
      – PersonEntity:
          Name:
            NameFull: Wang, Zhiying
      – PersonEntity:
          Name:
            NameFull: Su, Rui
    IsPartOfRelationships:
      – BibEntity:
          Dates:
            – D: 15
              M: 10
              Text: Oct2025
              Type: published
              Y: 2025
          Identifiers:
            – Type: issn-print
              Value: 20762615
          Numbering:
            – Type: volume
              Value: 15
            – Type: issue
              Value: 20
          Titles:
            – TitleFull: Animals (2076-2615)
              Type: main
ResultId 1