Predicting antifreeze proteins with weighted generalized dipeptide composition and multi-regression feature selection ensemble

Background Antifreeze proteins (AFPs) are a group of proteins that inhibit body fluids from growing to ice crystals and thus improve biological antifreeze ability. It is vital to the survival of living organisms in extremely cold environments. However, little research is performed on sequences featu...

Celý popis

Uloženo v:
Podrobná bibliografie
Vydáno v:BMC bioinformatics Ročník 22; číslo Suppl 3; s. 340 - 21
Hlavní autoři: Wang, Shunfang, Deng, Lin, Xia, Xinnan, Cao, Zicheng, Fei, Yu
Médium: Journal Article
Jazyk:angličtina
Vydáno: London BioMed Central 23.06.2021
BioMed Central Ltd
Springer Nature B.V
BMC
Témata:
ISSN:1471-2105, 1471-2105
On-line přístup:Získat plný text
Tagy: Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
Abstract Background Antifreeze proteins (AFPs) are a group of proteins that inhibit body fluids from growing to ice crystals and thus improve biological antifreeze ability. It is vital to the survival of living organisms in extremely cold environments. However, little research is performed on sequences feature extraction and selection for antifreeze proteins classification in the structure and function prediction, which is of great significance. Results In this paper, to predict the antifreeze proteins, a feature representation of weighted generalized dipeptide composition (W-GDipC) and an ensemble feature selection based on two-stage and multi-regression method (LRMR-Ri) are proposed. Specifically, four feature selection algorithms: Lasso regression, Ridge regression, Maximal information coefficient and Relief are used to select the feature sets, respectively, which is the first stage of LRMR-Ri method. If there exists a common feature subset among the above four sets, it is the optimal subset; otherwise we use Ridge regression to select the optimal subset from the public set pooled by the four sets, which is the second stage of LRMR-Ri. The LRMR-Ri method combined with W-GDipC was performed both on the antifreeze proteins dataset (binary classification), and on the membrane protein dataset (multiple classification). Experimental results show that this method has good performance in support vector machine (SVM), decision tree (DT) and stochastic gradient descent (SGD). The values of ACC, RE and MCC of LRMR-Ri and W-GDipC with antifreeze proteins dataset and SVM classifier have reached as high as 95.56%, 97.06% and 0.9105, respectively, much higher than those of each single method: Lasso, Ridge, Mic and Relief, nearly 13% higher than single Lasso for ACC. Conclusion The experimental results show that the proposed LRMR-Ri and W-GDipC method can significantly improve the accuracy of antifreeze proteins prediction compared with other similar single feature methods. In addition, our method has also achieved good results in the classification and prediction of membrane proteins, which verifies its widely reliability to a certain extent.
AbstractList Abstract Background Antifreeze proteins (AFPs) are a group of proteins that inhibit body fluids from growing to ice crystals and thus improve biological antifreeze ability. It is vital to the survival of living organisms in extremely cold environments. However, little research is performed on sequences feature extraction and selection for antifreeze proteins classification in the structure and function prediction, which is of great significance. Results In this paper, to predict the antifreeze proteins, a feature representation of weighted generalized dipeptide composition (W-GDipC) and an ensemble feature selection based on two-stage and multi-regression method (LRMR-Ri) are proposed. Specifically, four feature selection algorithms: Lasso regression, Ridge regression, Maximal information coefficient and Relief are used to select the feature sets, respectively, which is the first stage of LRMR-Ri method. If there exists a common feature subset among the above four sets, it is the optimal subset; otherwise we use Ridge regression to select the optimal subset from the public set pooled by the four sets, which is the second stage of LRMR-Ri. The LRMR-Ri method combined with W-GDipC was performed both on the antifreeze proteins dataset (binary classification), and on the membrane protein dataset (multiple classification). Experimental results show that this method has good performance in support vector machine (SVM), decision tree (DT) and stochastic gradient descent (SGD). The values of ACC, RE and MCC of LRMR-Ri and W-GDipC with antifreeze proteins dataset and SVM classifier have reached as high as 95.56%, 97.06% and 0.9105, respectively, much higher than those of each single method: Lasso, Ridge, Mic and Relief, nearly 13% higher than single Lasso for ACC. Conclusion The experimental results show that the proposed LRMR-Ri and W-GDipC method can significantly improve the accuracy of antifreeze proteins prediction compared with other similar single feature methods. In addition, our method has also achieved good results in the classification and prediction of membrane proteins, which verifies its widely reliability to a certain extent.
Background Antifreeze proteins (AFPs) are a group of proteins that inhibit body fluids from growing to ice crystals and thus improve biological antifreeze ability. It is vital to the survival of living organisms in extremely cold environments. However, little research is performed on sequences feature extraction and selection for antifreeze proteins classification in the structure and function prediction, which is of great significance. Results In this paper, to predict the antifreeze proteins, a feature representation of weighted generalized dipeptide composition (W-GDipC) and an ensemble feature selection based on two-stage and multi-regression method (LRMR-Ri) are proposed. Specifically, four feature selection algorithms: Lasso regression, Ridge regression, Maximal information coefficient and Relief are used to select the feature sets, respectively, which is the first stage of LRMR-Ri method. If there exists a common feature subset among the above four sets, it is the optimal subset; otherwise we use Ridge regression to select the optimal subset from the public set pooled by the four sets, which is the second stage of LRMR-Ri. The LRMR-Ri method combined with W-GDipC was performed both on the antifreeze proteins dataset (binary classification), and on the membrane protein dataset (multiple classification). Experimental results show that this method has good performance in support vector machine (SVM), decision tree (DT) and stochastic gradient descent (SGD). The values of ACC, RE and MCC of LRMR-Ri and W-GDipC with antifreeze proteins dataset and SVM classifier have reached as high as 95.56%, 97.06% and 0.9105, respectively, much higher than those of each single method: Lasso, Ridge, Mic and Relief, nearly 13% higher than single Lasso for ACC. Conclusion The experimental results show that the proposed LRMR-Ri and W-GDipC method can significantly improve the accuracy of antifreeze proteins prediction compared with other similar single feature methods. In addition, our method has also achieved good results in the classification and prediction of membrane proteins, which verifies its widely reliability to a certain extent.
Antifreeze proteins (AFPs) are a group of proteins that inhibit body fluids from growing to ice crystals and thus improve biological antifreeze ability. It is vital to the survival of living organisms in extremely cold environments. However, little research is performed on sequences feature extraction and selection for antifreeze proteins classification in the structure and function prediction, which is of great significance. In this paper, to predict the antifreeze proteins, a feature representation of weighted generalized dipeptide composition (W-GDipC) and an ensemble feature selection based on two-stage and multi-regression method (LRMR-Ri) are proposed. Specifically, four feature selection algorithms: Lasso regression, Ridge regression, Maximal information coefficient and Relief are used to select the feature sets, respectively, which is the first stage of LRMR-Ri method. If there exists a common feature subset among the above four sets, it is the optimal subset; otherwise we use Ridge regression to select the optimal subset from the public set pooled by the four sets, which is the second stage of LRMR-Ri. The LRMR-Ri method combined with W-GDipC was performed both on the antifreeze proteins dataset (binary classification), and on the membrane protein dataset (multiple classification). Experimental results show that this method has good performance in support vector machine (SVM), decision tree (DT) and stochastic gradient descent (SGD). The values of ACC, RE and MCC of LRMR-Ri and W-GDipC with antifreeze proteins dataset and SVM classifier have reached as high as 95.56%, 97.06% and 0.9105, respectively, much higher than those of each single method: Lasso, Ridge, Mic and Relief, nearly 13% higher than single Lasso for ACC. The experimental results show that the proposed LRMR-Ri and W-GDipC method can significantly improve the accuracy of antifreeze proteins prediction compared with other similar single feature methods. In addition, our method has also achieved good results in the classification and prediction of membrane proteins, which verifies its widely reliability to a certain extent.
Antifreeze proteins (AFPs) are a group of proteins that inhibit body fluids from growing to ice crystals and thus improve biological antifreeze ability. It is vital to the survival of living organisms in extremely cold environments. However, little research is performed on sequences feature extraction and selection for antifreeze proteins classification in the structure and function prediction, which is of great significance.BACKGROUNDAntifreeze proteins (AFPs) are a group of proteins that inhibit body fluids from growing to ice crystals and thus improve biological antifreeze ability. It is vital to the survival of living organisms in extremely cold environments. However, little research is performed on sequences feature extraction and selection for antifreeze proteins classification in the structure and function prediction, which is of great significance.In this paper, to predict the antifreeze proteins, a feature representation of weighted generalized dipeptide composition (W-GDipC) and an ensemble feature selection based on two-stage and multi-regression method (LRMR-Ri) are proposed. Specifically, four feature selection algorithms: Lasso regression, Ridge regression, Maximal information coefficient and Relief are used to select the feature sets, respectively, which is the first stage of LRMR-Ri method. If there exists a common feature subset among the above four sets, it is the optimal subset; otherwise we use Ridge regression to select the optimal subset from the public set pooled by the four sets, which is the second stage of LRMR-Ri. The LRMR-Ri method combined with W-GDipC was performed both on the antifreeze proteins dataset (binary classification), and on the membrane protein dataset (multiple classification). Experimental results show that this method has good performance in support vector machine (SVM), decision tree (DT) and stochastic gradient descent (SGD). The values of ACC, RE and MCC of LRMR-Ri and W-GDipC with antifreeze proteins dataset and SVM classifier have reached as high as 95.56%, 97.06% and 0.9105, respectively, much higher than those of each single method: Lasso, Ridge, Mic and Relief, nearly 13% higher than single Lasso for ACC.RESULTSIn this paper, to predict the antifreeze proteins, a feature representation of weighted generalized dipeptide composition (W-GDipC) and an ensemble feature selection based on two-stage and multi-regression method (LRMR-Ri) are proposed. Specifically, four feature selection algorithms: Lasso regression, Ridge regression, Maximal information coefficient and Relief are used to select the feature sets, respectively, which is the first stage of LRMR-Ri method. If there exists a common feature subset among the above four sets, it is the optimal subset; otherwise we use Ridge regression to select the optimal subset from the public set pooled by the four sets, which is the second stage of LRMR-Ri. The LRMR-Ri method combined with W-GDipC was performed both on the antifreeze proteins dataset (binary classification), and on the membrane protein dataset (multiple classification). Experimental results show that this method has good performance in support vector machine (SVM), decision tree (DT) and stochastic gradient descent (SGD). The values of ACC, RE and MCC of LRMR-Ri and W-GDipC with antifreeze proteins dataset and SVM classifier have reached as high as 95.56%, 97.06% and 0.9105, respectively, much higher than those of each single method: Lasso, Ridge, Mic and Relief, nearly 13% higher than single Lasso for ACC.The experimental results show that the proposed LRMR-Ri and W-GDipC method can significantly improve the accuracy of antifreeze proteins prediction compared with other similar single feature methods. In addition, our method has also achieved good results in the classification and prediction of membrane proteins, which verifies its widely reliability to a certain extent.CONCLUSIONThe experimental results show that the proposed LRMR-Ri and W-GDipC method can significantly improve the accuracy of antifreeze proteins prediction compared with other similar single feature methods. In addition, our method has also achieved good results in the classification and prediction of membrane proteins, which verifies its widely reliability to a certain extent.
Background Antifreeze proteins (AFPs) are a group of proteins that inhibit body fluids from growing to ice crystals and thus improve biological antifreeze ability. It is vital to the survival of living organisms in extremely cold environments. However, little research is performed on sequences feature extraction and selection for antifreeze proteins classification in the structure and function prediction, which is of great significance. Results In this paper, to predict the antifreeze proteins, a feature representation of weighted generalized dipeptide composition (W-GDipC) and an ensemble feature selection based on two-stage and multi-regression method (LRMR-Ri) are proposed. Specifically, four feature selection algorithms: Lasso regression, Ridge regression, Maximal information coefficient and Relief are used to select the feature sets, respectively, which is the first stage of LRMR-Ri method. If there exists a common feature subset among the above four sets, it is the optimal subset; otherwise we use Ridge regression to select the optimal subset from the public set pooled by the four sets, which is the second stage of LRMR-Ri. The LRMR-Ri method combined with W-GDipC was performed both on the antifreeze proteins dataset (binary classification), and on the membrane protein dataset (multiple classification). Experimental results show that this method has good performance in support vector machine (SVM), decision tree (DT) and stochastic gradient descent (SGD). The values of ACC, RE and MCC of LRMR-Ri and W-GDipC with antifreeze proteins dataset and SVM classifier have reached as high as 95.56%, 97.06% and 0.9105, respectively, much higher than those of each single method: Lasso, Ridge, Mic and Relief, nearly 13% higher than single Lasso for ACC. Conclusion The experimental results show that the proposed LRMR-Ri and W-GDipC method can significantly improve the accuracy of antifreeze proteins prediction compared with other similar single feature methods. In addition, our method has also achieved good results in the classification and prediction of membrane proteins, which verifies its widely reliability to a certain extent. Keywords: Antifreeze proteins prediction, Weighted general dipeptide composition, Lasso regression, Ridge regression, Ensemble feature selection, Two-stage multiple regressions
Antifreeze proteins (AFPs) are a group of proteins that inhibit body fluids from growing to ice crystals and thus improve biological antifreeze ability. It is vital to the survival of living organisms in extremely cold environments. However, little research is performed on sequences feature extraction and selection for antifreeze proteins classification in the structure and function prediction, which is of great significance. In this paper, to predict the antifreeze proteins, a feature representation of weighted generalized dipeptide composition (W-GDipC) and an ensemble feature selection based on two-stage and multi-regression method (LRMR-Ri) are proposed. Specifically, four feature selection algorithms: Lasso regression, Ridge regression, Maximal information coefficient and Relief are used to select the feature sets, respectively, which is the first stage of LRMR-Ri method. If there exists a common feature subset among the above four sets, it is the optimal subset; otherwise we use Ridge regression to select the optimal subset from the public set pooled by the four sets, which is the second stage of LRMR-Ri. The LRMR-Ri method combined with W-GDipC was performed both on the antifreeze proteins dataset (binary classification), and on the membrane protein dataset (multiple classification). Experimental results show that this method has good performance in support vector machine (SVM), decision tree (DT) and stochastic gradient descent (SGD). The values of ACC, RE and MCC of LRMR-Ri and W-GDipC with antifreeze proteins dataset and SVM classifier have reached as high as 95.56%, 97.06% and 0.9105, respectively, much higher than those of each single method: Lasso, Ridge, Mic and Relief, nearly 13% higher than single Lasso for ACC. The experimental results show that the proposed LRMR-Ri and W-GDipC method can significantly improve the accuracy of antifreeze proteins prediction compared with other similar single feature methods. In addition, our method has also achieved good results in the classification and prediction of membrane proteins, which verifies its widely reliability to a certain extent.
Background Antifreeze proteins (AFPs) are a group of proteins that inhibit body fluids from growing to ice crystals and thus improve biological antifreeze ability. It is vital to the survival of living organisms in extremely cold environments. However, little research is performed on sequences feature extraction and selection for antifreeze proteins classification in the structure and function prediction, which is of great significance. Results In this paper, to predict the antifreeze proteins, a feature representation of weighted generalized dipeptide composition (W-GDipC) and an ensemble feature selection based on two-stage and multi-regression method (LRMR-Ri) are proposed. Specifically, four feature selection algorithms: Lasso regression, Ridge regression, Maximal information coefficient and Relief are used to select the feature sets, respectively, which is the first stage of LRMR-Ri method. If there exists a common feature subset among the above four sets, it is the optimal subset; otherwise we use Ridge regression to select the optimal subset from the public set pooled by the four sets, which is the second stage of LRMR-Ri. The LRMR-Ri method combined with W-GDipC was performed both on the antifreeze proteins dataset (binary classification), and on the membrane protein dataset (multiple classification). Experimental results show that this method has good performance in support vector machine (SVM), decision tree (DT) and stochastic gradient descent (SGD). The values of ACC, RE and MCC of LRMR-Ri and W-GDipC with antifreeze proteins dataset and SVM classifier have reached as high as 95.56%, 97.06% and 0.9105, respectively, much higher than those of each single method: Lasso, Ridge, Mic and Relief, nearly 13% higher than single Lasso for ACC. Conclusion The experimental results show that the proposed LRMR-Ri and W-GDipC method can significantly improve the accuracy of antifreeze proteins prediction compared with other similar single feature methods. In addition, our method has also achieved good results in the classification and prediction of membrane proteins, which verifies its widely reliability to a certain extent.
ArticleNumber 340
Audience Academic
Author Wang, Shunfang
Xia, Xinnan
Deng, Lin
Fei, Yu
Cao, Zicheng
Author_xml – sequence: 1
  givenname: Shunfang
  orcidid: 0000-0002-1927-8753
  surname: Wang
  fullname: Wang, Shunfang
  email: sfwang_66@ynu.edu.cn
  organization: Department of Computer Science and Engineering, School of Information Science and Engineering, Yunnan University
– sequence: 2
  givenname: Lin
  surname: Deng
  fullname: Deng, Lin
  organization: Department of Computer Science and Engineering, School of Information Science and Engineering, Yunnan University
– sequence: 3
  givenname: Xinnan
  surname: Xia
  fullname: Xia, Xinnan
  email: xiaxinnan1@163.com
  organization: Department of Computer Science and Engineering, School of Information Science and Engineering, Yunnan University
– sequence: 4
  givenname: Zicheng
  surname: Cao
  fullname: Cao, Zicheng
  organization: School of Public Health (Shenzhen), Sun Yat-Sen University
– sequence: 5
  givenname: Yu
  surname: Fei
  fullname: Fei, Yu
  email: feiyu@ynufe.edu.cn
  organization: School of Statistics and Mathematics, Yunnan University of Finance and Economics
BackLink https://www.ncbi.nlm.nih.gov/pubmed/34162327$$D View this record in MEDLINE/PubMed
BookMark eNp9kktv1DAUhSNURB_wB1igSGxgkeJn4tkgVRWPkSqBeKwtx77JeJTYwXYozILfjqdT2k6FqixsXX_nODk5x8WB8w6K4jlGpxiL-k3ERPBFhQiuECMcV5tHxRFmDa4IRvzgzv6wOI5xjRBuBOJPikPKcE0oaY6KP58DGKuTdX2pXLJdANhAOQWfwLpYXtq0Ki_B9qsEpuzBQVCD3eS9sRNMyRootR8nH22y3mUPU47zkGwVoA8Q43bYgUpzgDLCAPoKAxdhbAd4Wjzu1BDh2fV6Unx__-7b-cfq4tOH5fnZRaV5g1OlGGEEMY1Zx7BqGMOM19QYoAqRFgMC6LhhugGmCaNYtAZMJ0jLFRZaU3pSLHe-xqu1nIIdVfgtvbLyauBDL1VIVg8gabMATBmlpDasbers0QhOa6QAqO5I9nq785rmdgSjwaWcyZ7p_omzK9n7n1IQgupFnQ1eXRsE_2OGmORoo4ZhUA78HCXhjIlGMCIy-vIeuvZzcDmqTHGS_yaq-S3Vq_wB1nU-36u3pvKsbgipUcO21Ol_qPwYGK3O1epsnu8JXu8JMpPgV-rVHKNcfv2yz764G8pNGv-algGyA3TwMQbobhCM5LbOcldnmessr-osN1kk7om0TWrboPzqdnhYSnfSmO9xPYTb5B5Q_QWYJAsJ
CitedBy_id crossref_primary_10_1093_bib_bbaf026
crossref_primary_10_1242_jeb_243409
crossref_primary_10_1186_s12986_025_00917_0
crossref_primary_10_1016_j_compbiomed_2024_108534
crossref_primary_10_1016_j_isci_2025_112077
crossref_primary_10_1109_TCBB_2024_3467261
crossref_primary_10_1016_j_biotechadv_2025_108545
crossref_primary_10_1016_j_compbiolchem_2022_107680
Cites_doi 10.1016/j.compbiolchem.2019.107094
10.1093/bioinformatics/btl158
10.1093/bfgp/elz036
10.1016/j.bbrc.2007.01.011
10.1080/00401706.1970.10488634
10.1109/TNNLS.2012.2199516
10.1007/s13369-018-03713-6
10.1016/j.omtn.2020.05.006
10.1002/med.21658
10.1093/nar/gkv1266
10.1016/j.jtbi.2008.02.031
10.3390/ijms18122718
10.1186/s12859-019-3276-5
10.1007/s00232-015-9811-z
10.1016/S0893-6080(05)80023-1
10.1504/IJDMB.2013.056078
10.1016/j.ygeno.2013.05.006
10.3390/molecules23061448
10.1098/rstb.2002.1081
10.1109/TCBB.2019.2930993
10.1016/j.jtbi.2010.10.037
10.1016/j.jtbi.2008.08.028
10.1016/j.future.2018.01.006
10.1093/bioinformatics/btz609
10.1109/TCBB.2016.2617337
10.1002/jcp.1030490103
10.1016/j.jtbi.2010.12.024
10.4238/gmr.15039013
10.1007/3-540-57868-4_57
10.1109/IJCNN.2001.1016716
10.1002/(SICI)1097-0134(199707)28:3<405::AID-PROT10>3.0.CO;2-L
10.1093/bioinformatics/btx302
10.1371/journal.pone.0195636
10.1007/s00232-016-9935-9
10.1109/CCIP.2015.7100687
10.1145/1015330.1015332
10.1093/nar/25.17.3389
10.3390/ijms13022196
10.1016/j.jtbi.2008.10.026
10.1016/j.jpdc.2017.08.009
10.1155/2017/9861752
10.1007/s13369-017-2738-1
10.1016/j.patcog.2007.08.016
10.3390/ijms160921191
10.3390/ijms161226237
10.1109/TCBB.2014.2351821
10.1016/j.neucom.2017.07.004
10.1142/S021972001950029X
10.1016/j.cplett.2012.02.030
10.1093/nar/gky113
10.1126/science.1205438
10.1016/j.bbrc.2007.06.027
10.1111/j.1399-3054.1997.tb04790.x
10.1016/j.jtbi.2014.04.006
10.1073/pnas.94.8.3485
ContentType Journal Article
Copyright The Author(s) 2021
COPYRIGHT 2021 BioMed Central Ltd.
2021. This work is licensed under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Copyright_xml – notice: The Author(s) 2021
– notice: COPYRIGHT 2021 BioMed Central Ltd.
– notice: 2021. This work is licensed under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
DBID C6C
AAYXX
CITATION
CGR
CUY
CVF
ECM
EIF
NPM
ISR
3V.
7QO
7SC
7X7
7XB
88E
8AL
8AO
8FD
8FE
8FG
8FH
8FI
8FJ
8FK
ABUWG
AEUYN
AFKRA
ARAPS
AZQEC
BBNVY
BENPR
BGLVJ
BHPHI
CCPQU
DWQXO
FR3
FYUFA
GHDGH
GNUQQ
HCIFZ
JQ2
K7-
K9.
L7M
LK8
L~C
L~D
M0N
M0S
M1P
M7P
P5Z
P62
P64
PHGZM
PHGZT
PIMPY
PJZUB
PKEHL
PPXIY
PQEST
PQGLB
PQQKQ
PQUKI
PRINS
Q9U
7X8
5PM
DOA
DOI 10.1186/s12859-021-04251-z
DatabaseName Springer Nature OA Free Journals
CrossRef
Medline
MEDLINE
MEDLINE (Ovid)
MEDLINE
MEDLINE
PubMed
Gale In Context: Science
ProQuest Central (Corporate)
Biotechnology Research Abstracts
Computer and Information Systems Abstracts
Health & Medical Collection
ProQuest Central (purchase pre-March 2016)
Medical Database (Alumni Edition)
Computing Database (Alumni Edition)
ProQuest Pharma Collection
Technology Research Database
ProQuest SciTech Collection
ProQuest Technology Collection
ProQuest Natural Science Collection
Hospital Premium Collection
Hospital Premium Collection (Alumni Edition)
ProQuest Central (Alumni) (purchase pre-March 2016)
ProQuest Central (Alumni)
ProQuest One Sustainability
ProQuest Central UK/Ireland
Advanced Technologies & Computer Science Collection
ProQuest Central Essentials
Biological Science Collection
ProQuest Central
ProQuest Technology Collection
Natural Science Collection
ProQuest One
ProQuest Central Korea
Engineering Research Database
ProQuest Health & Medical Collection
Health Research Premium Collection (Alumni)
ProQuest Central Student
SciTech Premium Collection
ProQuest Computer Science Collection
Computer Science Database
ProQuest Health & Medical Complete (Alumni)
Advanced Technologies Database with Aerospace
Biological Sciences
Computer and Information Systems Abstracts – Academic
Computer and Information Systems Abstracts Professional
Computing Database
ProQuest Health & Medical Collection
Medical Database
Biological Science Database (ProQuest)
Advanced Technologies & Aerospace Database
ProQuest Advanced Technologies & Aerospace Collection
Biotechnology and BioEngineering Abstracts
ProQuest Central Premium
ProQuest One Academic (New)
ProQuest Publicly Available Content Database
ProQuest Health & Medical Research Collection
ProQuest One Academic Middle East (New)
ProQuest One Health & Nursing
ProQuest One Academic Eastern Edition (DO NOT USE)
ProQuest One Applied & Life Sciences
ProQuest One Academic (retired)
ProQuest One Academic UKI Edition
ProQuest Central China
ProQuest Central Basic
MEDLINE - Academic
PubMed Central (Full Participant titles)
DOAJ Directory of Open Access Journals
DatabaseTitle CrossRef
MEDLINE
Medline Complete
MEDLINE with Full Text
PubMed
MEDLINE (Ovid)
Publicly Available Content Database
Computer Science Database
ProQuest Central Student
ProQuest Advanced Technologies & Aerospace Collection
ProQuest Central Essentials
ProQuest Computer Science Collection
Computer and Information Systems Abstracts
SciTech Premium Collection
ProQuest Central China
ProQuest One Applied & Life Sciences
ProQuest One Sustainability
Health Research Premium Collection
Natural Science Collection
Health & Medical Research Collection
Biological Science Collection
ProQuest Central (New)
ProQuest Medical Library (Alumni)
Advanced Technologies & Aerospace Collection
ProQuest Biological Science Collection
ProQuest One Academic Eastern Edition
ProQuest Hospital Collection
ProQuest Technology Collection
Health Research Premium Collection (Alumni)
Biological Science Database
ProQuest Hospital Collection (Alumni)
Biotechnology and BioEngineering Abstracts
ProQuest Health & Medical Complete
ProQuest One Academic UKI Edition
Engineering Research Database
ProQuest One Academic
ProQuest One Academic (New)
Technology Collection
Technology Research Database
Computer and Information Systems Abstracts – Academic
ProQuest One Academic Middle East (New)
ProQuest Health & Medical Complete (Alumni)
ProQuest Central (Alumni Edition)
ProQuest One Community College
ProQuest One Health & Nursing
ProQuest Natural Science Collection
ProQuest Pharma Collection
ProQuest Central
ProQuest Health & Medical Research Collection
Biotechnology Research Abstracts
Health and Medicine Complete (Alumni Edition)
ProQuest Central Korea
Advanced Technologies Database with Aerospace
ProQuest Computing
ProQuest Central Basic
ProQuest Computing (Alumni Edition)
ProQuest SciTech Collection
Computer and Information Systems Abstracts Professional
Advanced Technologies & Aerospace Database
ProQuest Medical Library
ProQuest Central (Alumni)
MEDLINE - Academic
DatabaseTitleList
Publicly Available Content Database
MEDLINE
MEDLINE - Academic




Database_xml – sequence: 1
  dbid: DOA
  name: DOAJ Directory of Open Access Journals
  url: https://www.doaj.org/
  sourceTypes: Open Website
– sequence: 2
  dbid: NPM
  name: PubMed
  url: http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed
  sourceTypes: Index Database
– sequence: 3
  dbid: PIMPY
  name: Publicly Available Content Database
  url: http://search.proquest.com/publiccontent
  sourceTypes: Aggregation Database
DeliveryMethod fulltext_linktorsrc
Discipline Biology
EISSN 1471-2105
EndPage 21
ExternalDocumentID oai_doaj_org_article_379e1343326d4b76b5a785360aee3cf2
PMC8220696
A672260745
34162327
10_1186_s12859_021_04251_z
Genre Journal Article
GeographicLocations China
GeographicLocations_xml – name: China
GrantInformation_xml – fundername: Training Plan for Young and Middle-aged Academic Leaders of Yunnan Province
  grantid: 2018HB031
– fundername: National Natural Science Foundation of China
  grantid: 62062067; 11661081; 11971421
  funderid: http://dx.doi.org/10.13039/501100001809
– fundername: Natural Science Foundation of Yunnan Province
  grantid: 2017FA032
  funderid: http://dx.doi.org/10.13039/501100005273
– fundername: National Natural Science Foundation of China
  grantid: 11971421
– fundername: National Natural Science Foundation of China
  grantid: 62062067
– fundername: Natural Science Foundation of Yunnan Province
  grantid: 2017FA032
– fundername: National Natural Science Foundation of China
  grantid: 11661081
– fundername: ;
  grantid: 62062067; 11661081; 11971421
– fundername: ;
  grantid: 2018HB031
– fundername: ;
  grantid: 2017FA032
GroupedDBID ---
0R~
23N
2WC
53G
5VS
6J9
7X7
88E
8AO
8FE
8FG
8FH
8FI
8FJ
AAFWJ
AAJSJ
AAKPC
AASML
ABDBF
ABUWG
ACGFO
ACGFS
ACIHN
ACIWK
ACPRK
ACUHS
ADBBV
ADMLS
ADUKV
AEAQA
AENEX
AEUYN
AFKRA
AFPKN
AFRAH
AHBYD
AHMBA
AHYZX
ALMA_UNASSIGNED_HOLDINGS
AMKLP
AMTXH
AOIJS
ARAPS
AZQEC
BAPOH
BAWUL
BBNVY
BCNDV
BENPR
BFQNJ
BGLVJ
BHPHI
BMC
BPHCQ
BVXVI
C6C
CCPQU
CS3
DIK
DU5
DWQXO
E3Z
EAD
EAP
EAS
EBD
EBLON
EBS
EMB
EMK
EMOBN
ESX
F5P
FYUFA
GNUQQ
GROUPED_DOAJ
GX1
HCIFZ
HMCUK
HYE
IAO
ICD
IHR
INH
INR
ISR
ITC
K6V
K7-
KQ8
LK8
M1P
M48
M7P
MK~
ML0
M~E
O5R
O5S
OK1
OVT
P2P
P62
PGMZT
PHGZM
PHGZT
PIMPY
PJZUB
PPXIY
PQGLB
PQQKQ
PROAC
PSQYO
PUEGO
RBZ
RNS
ROL
RPM
RSV
SBL
SOJ
SV3
TR2
TUS
UKHRP
W2D
WOQ
WOW
XH6
XSB
AAYXX
AFFHD
CITATION
-A0
3V.
ACRMQ
ADINQ
ALIPV
C24
CGR
CUY
CVF
ECM
EIF
M0N
NPM
7QO
7SC
7XB
8AL
8FD
8FK
FR3
JQ2
K9.
L7M
L~C
L~D
P64
PKEHL
PQEST
PQUKI
PRINS
Q9U
7X8
5PM
ID FETCH-LOGICAL-c571t-a424204c14f41a74414563dde3a02b1e0eef5d4c7e4c24318bdedf82b5a18cc33
IEDL.DBID DOA
ISICitedReferencesCount 8
ISICitedReferencesURI http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000665002000001&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
ISSN 1471-2105
IngestDate Fri Oct 03 12:45:24 EDT 2025
Tue Nov 04 01:46:36 EST 2025
Sun Nov 09 09:39:51 EST 2025
Mon Oct 06 18:39:01 EDT 2025
Tue Nov 11 10:28:23 EST 2025
Tue Nov 04 17:34:36 EST 2025
Thu Nov 13 14:37:55 EST 2025
Wed Feb 19 02:06:26 EST 2025
Sat Nov 29 05:40:10 EST 2025
Tue Nov 18 21:48:01 EST 2025
Sat Sep 06 07:27:38 EDT 2025
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Issue Suppl 3
Keywords Ridge regression
Antifreeze proteins prediction
Lasso regression
Ensemble feature selection
Two-stage multiple regressions
Weighted general dipeptide composition
Language English
License Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c571t-a424204c14f41a74414563dde3a02b1e0eef5d4c7e4c24318bdedf82b5a18cc33
Notes ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
content type line 23
ORCID 0000-0002-1927-8753
OpenAccessLink https://doaj.org/article/379e1343326d4b76b5a785360aee3cf2
PMID 34162327
PQID 2552805065
PQPubID 44065
PageCount 21
ParticipantIDs doaj_primary_oai_doaj_org_article_379e1343326d4b76b5a785360aee3cf2
pubmedcentral_primary_oai_pubmedcentral_nih_gov_8220696
proquest_miscellaneous_2544878428
proquest_journals_2552805065
gale_infotracmisc_A672260745
gale_infotracacademiconefile_A672260745
gale_incontextgauss_ISR_A672260745
pubmed_primary_34162327
crossref_primary_10_1186_s12859_021_04251_z
crossref_citationtrail_10_1186_s12859_021_04251_z
springer_journals_10_1186_s12859_021_04251_z
PublicationCentury 2000
PublicationDate 2021-06-23
PublicationDateYYYYMMDD 2021-06-23
PublicationDate_xml – month: 06
  year: 2021
  text: 2021-06-23
  day: 23
PublicationDecade 2020
PublicationPlace London
PublicationPlace_xml – name: London
– name: England
PublicationTitle BMC bioinformatics
PublicationTitleAbbrev BMC Bioinformatics
PublicationTitleAlternate BMC Bioinformatics
PublicationYear 2021
Publisher BioMed Central
BioMed Central Ltd
Springer Nature B.V
BMC
Publisher_xml – name: BioMed Central
– name: BioMed Central Ltd
– name: Springer Nature B.V
– name: BMC
References A Anand (4251_CR16) 2008; 253
LY Wei (4251_CR12) 2018; 117
X He (4251_CR30) 2015; 248
RB Huang (4251_CR14) 2009; 256
J Yan (4251_CR43) 2020; 20
SF Wang (4251_CR15) 2019; 81
4251_CR35
EL Sonnhammer (4251_CR45) 1997; 28
4251_CR39
P Agrawal (4251_CR48) 2016; 44
Y Runtao (4251_CR29) 2015; 16
A Nath (4251_CR34) 2018; 272
W Li (4251_CR47) 2006; 22
X Zhao (4251_CR27) 2012; 13
S Wang (4251_CR9) 2019; 20
Z Jian (4251_CR60) 2018; 23
K Kira (4251_CR51) 1992; 2
Y Yan (4251_CR41) 2018; 46
DH Wolpert (4251_CR53) 1992; 5
DN Reshef (4251_CR56) 2011; 334
JM Logsdon (4251_CR4) 1997; 94
MM Tab (4251_CR1) 2018; 43
F Yuan (4251_CR6) 2019; 17
HJ Yu (4251_CR10) 2012; 531
SF Altschul (4251_CR46) 1997; 25
AE Hoerl (4251_CR55) 1970; 12
RJ Tibshirani (4251_CR54) 1996; 73
S Wang (4251_CR11) 2020; 17
KC Chou (4251_CR58) 2011; 273
Q Jiang (4251_CR17) 2013; 8
S Basith (4251_CR42) 2020; 40
PL Davies (4251_CR5) 2002; 357
4251_CR13
4251_CR57
S Khan (4251_CR33) 2016; 15
DS Huang (4251_CR40) 2008; 41
S Mondal (4251_CR28) 2014; 356
KK Kandaswamy (4251_CR26) 2011; 270
S Lalwani (4251_CR38) 2019; 44
KC Chou (4251_CR44) 2007; 360
X Xiao (4251_CR31) 2016; 249
M Griffith (4251_CR2) 1997; 100
W Peng (4251_CR52) 2018; 82
JD Qiu (4251_CR18) 2009; 256
Z Wen (4251_CR19) 2020; 36
J Wang (4251_CR8) 2017; 33
S Wang (4251_CR22) 2017; 18
G Yu (4251_CR23) 2015; 12
SF Wang (4251_CR37) 2018; 13
PF Scholander (4251_CR3) 2010; 49
J Zahiri (4251_CR49) 2013; 102
JG Moreno-Torres (4251_CR59) 2012; 23
S Wang (4251_CR20) 2015; 16
4251_CR25
4251_CR24
X Wang (4251_CR50) 2014; 2014
H Lin (4251_CR21) 2007; 354
Q Zou (4251_CR36) 2020; 21
R Pratiwi (4251_CR32) 2017; 2017
SW Sun (4251_CR7) 2020; 19
References_xml – volume: 81
  start-page: 9
  year: 2019
  ident: 4251_CR15
  publication-title: Comput Biol Chem
  doi: 10.1016/j.compbiolchem.2019.107094
– volume: 22
  start-page: 1658
  issue: 13
  year: 2006
  ident: 4251_CR47
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/btl158
– volume: 19
  start-page: 40
  issue: 1
  year: 2020
  ident: 4251_CR7
  publication-title: Brief Funct Genomics
  doi: 10.1093/bfgp/elz036
– volume: 354
  start-page: 1
  issue: 2
  year: 2007
  ident: 4251_CR21
  publication-title: Biochem Biophys Res Commun
  doi: 10.1016/j.bbrc.2007.01.011
– volume: 12
  start-page: 55
  issue: 1
  year: 1970
  ident: 4251_CR55
  publication-title: Technometrics
  doi: 10.1080/00401706.1970.10488634
– volume: 23
  start-page: 1304
  issue: 8
  year: 2012
  ident: 4251_CR59
  publication-title: IEEE Trans Neural Netw Learn Syst
  doi: 10.1109/TNNLS.2012.2199516
– volume: 44
  start-page: 2899
  issue: 4
  year: 2019
  ident: 4251_CR38
  publication-title: Arab J Sci Eng.
  doi: 10.1007/s13369-018-03713-6
– volume: 20
  start-page: 882
  year: 2020
  ident: 4251_CR43
  publication-title: Mol Ther-Nucleic Acids.
  doi: 10.1016/j.omtn.2020.05.006
– volume: 40
  start-page: 1276
  issue: 4
  year: 2020
  ident: 4251_CR42
  publication-title: Med Res Rev.
  doi: 10.1002/med.21658
– volume: 44
  start-page: D1098
  issue: D1
  year: 2016
  ident: 4251_CR48
  publication-title: Nucleic Acids Res
  doi: 10.1093/nar/gkv1266
– volume: 253
  start-page: 375
  issue: 2
  year: 2008
  ident: 4251_CR16
  publication-title: J Theor Biol
  doi: 10.1016/j.jtbi.2008.02.031
– volume: 18
  start-page: 2718
  issue: 12
  year: 2017
  ident: 4251_CR22
  publication-title: Int J Mol Sci
  doi: 10.3390/ijms18122718
– volume: 20
  start-page: 701
  issue: 25
  year: 2019
  ident: 4251_CR9
  publication-title: BMC Bioinform
  doi: 10.1186/s12859-019-3276-5
– volume: 248
  start-page: 1005
  issue: 6
  year: 2015
  ident: 4251_CR30
  publication-title: J Membr Biol
  doi: 10.1007/s00232-015-9811-z
– volume: 5
  start-page: 241
  issue: 2
  year: 1992
  ident: 4251_CR53
  publication-title: Neural Netw
  doi: 10.1016/S0893-6080(05)80023-1
– volume: 8
  start-page: 282
  issue: 3
  year: 2013
  ident: 4251_CR17
  publication-title: Int J Data Min Bioinform
  doi: 10.1504/IJDMB.2013.056078
– volume: 102
  start-page: 237
  issue: 4
  year: 2013
  ident: 4251_CR49
  publication-title: Genomics
  doi: 10.1016/j.ygeno.2013.05.006
– volume: 23
  start-page: 1448
  issue: 6
  year: 2018
  ident: 4251_CR60
  publication-title: Molecules
  doi: 10.3390/molecules23061448
– volume: 357
  start-page: 927
  issue: 1423
  year: 2002
  ident: 4251_CR5
  publication-title: Philos Trans R Soc Lond
  doi: 10.1098/rstb.2002.1081
– volume: 17
  start-page: 739
  issue: 3
  year: 2020
  ident: 4251_CR11
  publication-title: IEEE/ACM Trans Comput Biol Bioinf
  doi: 10.1109/TCBB.2019.2930993
– volume: 270
  start-page: 56
  issue: 1
  year: 2011
  ident: 4251_CR26
  publication-title: J Theor Biol
  doi: 10.1016/j.jtbi.2010.10.037
– ident: 4251_CR39
– volume: 256
  start-page: 428
  issue: 3
  year: 2009
  ident: 4251_CR14
  publication-title: J Theor Biol
  doi: 10.1016/j.jtbi.2008.08.028
– volume: 82
  start-page: 119
  year: 2018
  ident: 4251_CR52
  publication-title: Future Gen Comput Syst
  doi: 10.1016/j.future.2018.01.006
– volume: 36
  start-page: 478
  issue: 2
  year: 2020
  ident: 4251_CR19
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/btz609
– volume: 15
  start-page: 244
  issue: 1
  year: 2016
  ident: 4251_CR33
  publication-title: IEEE/ACM Trans Comput Biol Bioinform
  doi: 10.1109/TCBB.2016.2617337
– volume: 2014
  start-page: 86
  year: 2014
  ident: 4251_CR50
  publication-title: IEEE
– volume: 49
  start-page: 5
  issue: 1
  year: 2010
  ident: 4251_CR3
  publication-title: J Cell Physiol
  doi: 10.1002/jcp.1030490103
– volume: 273
  start-page: 236
  issue: 1
  year: 2011
  ident: 4251_CR58
  publication-title: J Theor Biol
  doi: 10.1016/j.jtbi.2010.12.024
– ident: 4251_CR35
  doi: 10.4238/gmr.15039013
– ident: 4251_CR57
  doi: 10.1007/3-540-57868-4_57
– ident: 4251_CR13
  doi: 10.1109/IJCNN.2001.1016716
– volume: 28
  start-page: 405
  issue: 3
  year: 1997
  ident: 4251_CR45
  publication-title: Proteins
  doi: 10.1002/(SICI)1097-0134(199707)28:3<405::AID-PROT10>3.0.CO;2-L
– volume: 33
  start-page: 2756
  issue: 17
  year: 2017
  ident: 4251_CR8
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/btx302
– volume: 13
  start-page: 0195636
  issue: 4
  year: 2018
  ident: 4251_CR37
  publication-title: PLoS ONE
  doi: 10.1371/journal.pone.0195636
– volume: 249
  start-page: 1
  issue: 6
  year: 2016
  ident: 4251_CR31
  publication-title: J Membr Biol
  doi: 10.1007/s00232-016-9935-9
– ident: 4251_CR25
  doi: 10.1109/CCIP.2015.7100687
– ident: 4251_CR24
  doi: 10.1145/1015330.1015332
– volume: 73
  start-page: 273
  issue: 1
  year: 1996
  ident: 4251_CR54
  publication-title: J R Stat Soc Ser B Methodol
– volume: 25
  start-page: 3389
  issue: 17
  year: 1997
  ident: 4251_CR46
  publication-title: Nucleic Acids Res
  doi: 10.1093/nar/25.17.3389
– volume: 21
  start-page: 1
  issue: 1
  year: 2020
  ident: 4251_CR36
  publication-title: Brief Bioinform
– volume: 13
  start-page: 2196
  issue: 12
  year: 2012
  ident: 4251_CR27
  publication-title: Int J Mol Sci
  doi: 10.3390/ijms13022196
– volume: 256
  start-page: 625
  issue: 4
  year: 2009
  ident: 4251_CR18
  publication-title: J Theor Biol
  doi: 10.1016/j.jtbi.2008.10.026
– volume: 117
  start-page: 212
  year: 2018
  ident: 4251_CR12
  publication-title: J Parallel Distrib Comput
  doi: 10.1016/j.jpdc.2017.08.009
– volume: 2
  start-page: 129
  year: 1992
  ident: 4251_CR51
  publication-title: Aaai
– volume: 2017
  start-page: 1
  year: 2017
  ident: 4251_CR32
  publication-title: J Chem.
  doi: 10.1155/2017/9861752
– volume: 43
  start-page: 133
  issue: 1
  year: 2018
  ident: 4251_CR1
  publication-title: Arab J Sci Eng.
  doi: 10.1007/s13369-017-2738-1
– volume: 41
  start-page: 1316
  issue: 4
  year: 2008
  ident: 4251_CR40
  publication-title: Pattern Recognit
  doi: 10.1016/j.patcog.2007.08.016
– volume: 16
  start-page: 21191
  issue: 9
  year: 2015
  ident: 4251_CR29
  publication-title: Int J Mol Sci
  doi: 10.3390/ijms160921191
– volume: 16
  start-page: 30343
  issue: 12
  year: 2015
  ident: 4251_CR20
  publication-title: Int J Mol Sci
  doi: 10.3390/ijms161226237
– volume: 12
  start-page: 219
  issue: 1
  year: 2015
  ident: 4251_CR23
  publication-title: IEEE/ACM Trans Comput Biol Bioinform
  doi: 10.1109/TCBB.2014.2351821
– volume: 272
  start-page: 294
  issue: 10
  year: 2018
  ident: 4251_CR34
  publication-title: Neurocomputing
  doi: 10.1016/j.neucom.2017.07.004
– volume: 17
  start-page: 1950029
  issue: 4
  year: 2019
  ident: 4251_CR6
  publication-title: J Bioinform Comput Biol
  doi: 10.1142/S021972001950029X
– volume: 531
  start-page: 261
  year: 2012
  ident: 4251_CR10
  publication-title: Chem Phys Lett
  doi: 10.1016/j.cplett.2012.02.030
– volume: 46
  start-page: e56
  issue: 9
  year: 2018
  ident: 4251_CR41
  publication-title: Nucleic Acids Res
  doi: 10.1093/nar/gky113
– volume: 334
  start-page: 1518
  issue: 6062
  year: 2011
  ident: 4251_CR56
  publication-title: Science
  doi: 10.1126/science.1205438
– volume: 360
  start-page: 1
  issue: 2
  year: 2007
  ident: 4251_CR44
  publication-title: Biochem Biophys Res Commun
  doi: 10.1016/j.bbrc.2007.06.027
– volume: 100
  start-page: 327
  issue: 2
  year: 1997
  ident: 4251_CR2
  publication-title: Physiol Plant
  doi: 10.1111/j.1399-3054.1997.tb04790.x
– volume: 356
  start-page: 30
  year: 2014
  ident: 4251_CR28
  publication-title: J Theor Biol
  doi: 10.1016/j.jtbi.2014.04.006
– volume: 94
  start-page: 3485
  issue: 8
  year: 1997
  ident: 4251_CR4
  publication-title: Proc Natl Acad Sci
  doi: 10.1073/pnas.94.8.3485
SSID ssj0017805
Score 2.400457
Snippet Background Antifreeze proteins (AFPs) are a group of proteins that inhibit body fluids from growing to ice crystals and thus improve biological antifreeze...
Antifreeze proteins (AFPs) are a group of proteins that inhibit body fluids from growing to ice crystals and thus improve biological antifreeze ability. It is...
Background Antifreeze proteins (AFPs) are a group of proteins that inhibit body fluids from growing to ice crystals and thus improve biological antifreeze...
Abstract Background Antifreeze proteins (AFPs) are a group of proteins that inhibit body fluids from growing to ice crystals and thus improve biological...
SourceID doaj
pubmedcentral
proquest
gale
pubmed
crossref
springer
SourceType Open Website
Open Access Repository
Aggregation Database
Index Database
Enrichment Source
Publisher
StartPage 340
SubjectTerms Accuracy
Algorithms
Amino acids
Analysis
Antifreeze proteins
Antifreeze Proteins - genetics
Antifreeze proteins prediction
Bioinformatics
Biomedical and Life Sciences
Body fluids
Classification
Composition
Computational Biology/Bioinformatics
Computer Appl. in Life Sciences
Crystal growth
Crystals
Datasets
Decision trees
Dipeptides
Ensemble feature selection
Experiments
Feature extraction
Feature selection
Ice
Ice crystals
Lasso regression
Life Sciences
Machine learning
Membrane proteins
Membranes
Methods
Microarrays
Optimization techniques
Performance evaluation
Predictions
Principal components analysis
Proteins
Regression
Reproducibility of Results
Ridge regression
Structure-function relationships
Support Vector Machine
Support vector machines
Two-stage multiple regressions
Weighted general dipeptide composition
SummonAdditionalLinks – databaseName: Health & Medical Collection
  dbid: 7X7
  link: http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwpV1Lj9MwELZgAYkL70dgQQYhcYBo4zxs94QWxAouqxUPaW-W44xLpSUtSQuiB347M47bJYvYC5eqip2qtsfjb-zP3zD2TDd2IhxGqpXCj1KSH3RSpJXN6XKaratqSDahDg_18fHkKG649ZFWufGJwVE3c0d75HsIfXOdVbhivlp8SylrFJ2uxhQaF9klSptNdq6OtwGXIL3-zUUZLfd6QWptKZESyFRFuh4tRkGz_2_P_MfSdJY2eebsNCxJB9f_tzE32LUIRvn-YD032QVob7ErQ3rKn7fZr6OOjnGIGM0tkYo6gDXwIO0wa3tOe7j8R9hbhYZPBwHr2Rq_N7MFsWUa4ERZj7ww_I2GBwJj2sF04N-23EOQFuV9SMhDTzCwhq_1Cdxhnw_efnrzLo35GlJXKbFMbYnrfVY6UfpSWIVAC9FZgf6zsFleC8gAfNWUTkHpcgQuum6g8TqvKyu0c0Vxl-208xbuMw7aelEIrzIPpUQMpLMibybel1A5rVTCxGbgjIti5pRT48SEoEZLMwy2wcE2YbDNOmEvtu8sBimPc2u_JnvY1iQZ7vBg3k1NnNWmUBMQBUnAyaaslcSWKMQ_MrMAhfN5wp6SNRkS2miJyTO1q7437z9-MPtSIfJFAFcl7Hms5OfYBmfjxQjsCdLmGtXcHdVET-DGxRtrM9ET9ebU1BL2ZFtMbxK7roX5iupgkK40RqIJuzfY-LbdiHIQIefY42pk_aOOGZe0sy9BpxyxZyYnMmEvN_Pk9G_9u-MfnN-Kh-xqHmawTPNil-0suxU8Ypfd9-Ws7x6H-f8bjtBjiQ
  priority: 102
  providerName: ProQuest
– databaseName: SpringerLink Contemporary Journals
  dbid: RSV
  link: http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1Lb9QwELaggNQLb2igIIOQOEBEnIftPRZEBZeqooB6sxxnvI1UslWyC2IP_HZmnGQh5SHBZbWKJ6t4djz-Jv78mbEnurIz4bBSLRR-5JLyoJMiLmxKm9NsWRT9YRPq4EAfH88Oh01h3ch2H5ckQ6YOw1rLF50grbWYKAUUaCJeX2SXClKboRr96ONm7YBU-sftMb-9bzIFBaX-X_PxTxPSebLkuRXTMBHtX_u_LlxnVwfgyff6SLnBLkBzk13pj6L8eot9O2xpyYZI0NwSgagFWAMPMg5103F6X8u_hPeoUPF5L1Zdr_F7VZ8RM6YCTvT0gQOGv1HxQFaMW5j3XNuGewgyorwLh-_QFSyi4VN5CrfZh_3X71-9iYezGWJXKLGMbY5ze5I7kftcWIWgCpFYhrkys0laCkgAfFHlTkHuUgQpuqyg8jotCyu0c1l2h201iwZ2GAdtvciEV4mHXCLe0UmWVjPvcyicVipiYvy7jBuEy-n8jFMTChgtTe9Xg341wa9mHbFnm3vOetmOv1q_pCjYWJLkdriwaOdmGMEmUzMQGcm9ySovlcSeKMQ6MrEAmfNpxB5TDBkS1WiItTO3q64zb4_emT2pEOUiWCsi9nQw8gvsg7PDJgj0BOlwTSx3J5Y46t20eQxVM2SdzmB5mGLwI6qM2KNNM91JTLoGFiuywYJcaaw6I3a3j-xNvxHRIBpO0eNqEvMTx0xbmvokaJIjzkzkTEbs-Rj5Px7rz46_92_m99l2GgaPjNNsl20t2xU8YJfd52XdtQ9DFvgOfE9ajw
  priority: 102
  providerName: Springer Nature
Title Predicting antifreeze proteins with weighted generalized dipeptide composition and multi-regression feature selection ensemble
URI https://link.springer.com/article/10.1186/s12859-021-04251-z
https://www.ncbi.nlm.nih.gov/pubmed/34162327
https://www.proquest.com/docview/2552805065
https://www.proquest.com/docview/2544878428
https://pubmed.ncbi.nlm.nih.gov/PMC8220696
https://doaj.org/article/379e1343326d4b76b5a785360aee3cf2
Volume 22
WOSCitedRecordID wos000665002000001&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVADU
  databaseName: BioMedCentral
  customDbUrl:
  eissn: 1471-2105
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0017805
  issn: 1471-2105
  databaseCode: RBZ
  dateStart: 20000101
  isFulltext: true
  titleUrlDefault: https://www.biomedcentral.com/search/
  providerName: BioMedCentral
– providerCode: PRVAON
  databaseName: DOAJ Directory of Open Access Journals
  customDbUrl:
  eissn: 1471-2105
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0017805
  issn: 1471-2105
  databaseCode: DOA
  dateStart: 20000101
  isFulltext: true
  titleUrlDefault: https://www.doaj.org/
  providerName: Directory of Open Access Journals
– providerCode: PRVHPJ
  databaseName: ROAD: Directory of Open Access Scholarly Resources
  customDbUrl:
  eissn: 1471-2105
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0017805
  issn: 1471-2105
  databaseCode: M~E
  dateStart: 20000101
  isFulltext: true
  titleUrlDefault: https://road.issn.org
  providerName: ISSN International Centre
– providerCode: PRVPQU
  databaseName: Advanced Technologies & Aerospace Database
  customDbUrl:
  eissn: 1471-2105
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0017805
  issn: 1471-2105
  databaseCode: P5Z
  dateStart: 20090101
  isFulltext: true
  titleUrlDefault: https://search.proquest.com/hightechjournals
  providerName: ProQuest
– providerCode: PRVPQU
  databaseName: Biological Science Database (ProQuest)
  customDbUrl:
  eissn: 1471-2105
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0017805
  issn: 1471-2105
  databaseCode: M7P
  dateStart: 20090101
  isFulltext: true
  titleUrlDefault: http://search.proquest.com/biologicalscijournals
  providerName: ProQuest
– providerCode: PRVPQU
  databaseName: Computer Science Database
  customDbUrl:
  eissn: 1471-2105
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0017805
  issn: 1471-2105
  databaseCode: K7-
  dateStart: 20090101
  isFulltext: true
  titleUrlDefault: http://search.proquest.com/compscijour
  providerName: ProQuest
– providerCode: PRVPQU
  databaseName: Health & Medical Collection
  customDbUrl:
  eissn: 1471-2105
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0017805
  issn: 1471-2105
  databaseCode: 7X7
  dateStart: 20090101
  isFulltext: true
  titleUrlDefault: https://search.proquest.com/healthcomplete
  providerName: ProQuest
– providerCode: PRVPQU
  databaseName: ProQuest Central
  customDbUrl:
  eissn: 1471-2105
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0017805
  issn: 1471-2105
  databaseCode: BENPR
  dateStart: 20090101
  isFulltext: true
  titleUrlDefault: https://www.proquest.com/central
  providerName: ProQuest
– providerCode: PRVPQU
  databaseName: Publicly Available Content Database
  customDbUrl:
  eissn: 1471-2105
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0017805
  issn: 1471-2105
  databaseCode: PIMPY
  dateStart: 20090101
  isFulltext: true
  titleUrlDefault: http://search.proquest.com/publiccontent
  providerName: ProQuest
– providerCode: PRVAVX
  databaseName: SpringerLINK Contemporary 1997-Present
  customDbUrl:
  eissn: 1471-2105
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0017805
  issn: 1471-2105
  databaseCode: RSV
  dateStart: 20001201
  isFulltext: true
  titleUrlDefault: https://link.springer.com/search?facet-content-type=%22Journal%22
  providerName: Springer Nature
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV1Lb9QwELaggMQF8SZQVgEhcYCocR62c2xRKyrEKtoCWrhYjjNZIpVstdkFsQd-OzNOdmmKgAsXK4knUTIe2zPx528Ye6ZKk3GLkWoqsUgEjYNW8CA1EW1OM0Wadskm5HisptMsP5fqizBhHT1wp7i9WGbAY2LZEmVSSFGkRuIUI0IDENvKjb6hzDbBVL9-QEz9my0ySuy1nHjaAoIjkJHyYD2Yhhxb_-9j8rlJ6SJg8sKqqZuMjm6yG70X6e93b3-LXYLmNrvW5ZX8fof9yBe0_kKIZt8QGmgBsAbfcTLUTevTz1f_m_spCqU_65in6zUel_UZwVxK8Alr3gO68Bml75CHwQJmHXC28StwnKB-6zLp0BWMiOFLcQp32fujw3evXgd9ooXAppIvA5PgRB0mlidVwo1EDwndqhgHvtiEUcEhBKjSMrESEhuhx6GKEspKRdgWXFkbx_fYTjNv4AHzQZmKx7ySYQWJQOdFhXFUZlWVQGqVlB7jG71r27OQUzKMU-2iESV011Ya20q7ttJrj73Y3nPWcXD8VfqAmnMrSfzZ7gJale6tSv_Lqjz2lIxBE0NGQxCcmVm1rT4-meh9IdFlRc8r9djzXqia4zdY0-9oQE0QqdZAcncgiV3YDqs3Nqf7IaTVGOtFaMXoInrsybaa7iRYXAPzFclgdC0VhpAeu9-Z6Pa70T1B1zZCjcuB8Q4UM6xp6s-OYBydxlBkwmMvN2b-67X-rPiH_0Pxj9j1yHVTEUTxLttZLlbwmF21X5d1uxixy3IqXalG7MrB4TifjFzHx_KNDEaE3M2xzNNPWJ8fv80_4tnk5MNPp-9d7g
linkProvider Directory of Open Access Journals
linkToHtml http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMw1V1Lb9NAEF5VLQguvB-GAgaBOIBV7_qxmwNC5VE1ahtFUKRyWuz1OEQqTrATqubAT-I3MrO2U1xEbz1wiSJ7HHk389z99hvGnqos6XGDlWok8SOMyQ-amHtRIuhwWpJGUd1sQg4G6uCgN1xhv9qzMASrbH2iddTZxNAa-QamvkL5EUbM19PvHnWNot3VtoVGrRY7cHyEJVv1qv8O_99nQmy933-77TVdBTwTST7zkhCjkh8aHuYhTySmA5hDBGjlQeKLlIMPkEdZaCSERmB4VWkGWa5EGiVcGUMLoOjy10JUdn-VrQ37e8PPy30L6hDQHs1R8UbFiR_OIxgEGQf3Fp3wZ7sE_B0L_giGp4Gap3ZrbRDcuvq_Td81dqVJt93N2j6usxUobrCLdQPO45vs57CkjSqCfrsJwaZKgAW4lrxiXFQurVK7R3b1GDJ3VFN0jxf4PRtPCQ-UgUug_Ab5hr-RuRai6ZUwqhHGhZuDJU91K9tyiK5AUcG39BBusU_nMvjbbLWYFHCXuaCSnAc8l34OYYxZnvIDkfXyPITIKCkdxltF0aaha6euIYfalm0q1rVyaVQubZVLLxz2YvnMtCYrOVP6DenfUpKIxu2FSTnSjd_SgewBD4jkLs7CVMY4EokZXuwnAIHJhcOekPZqohIpCKs0SuZVpfsfP-jNWGJujylq5LDnjVA-wTGYpDn6gTNB7GMdyfWOJPo6073dardufG2lT1TbYY-Xt-lJwg8WMJmTTIiVucJa22F3aptajhvzOKwBBM647FhbZ2K6d4rxV8vEjtm1H_dih71s7fLktf498ffOHsUjdml7f29X7_YHO_fZZWG9R-yJYJ2tzso5PGAXzI_ZuCofNt7HZV_O22J_AymEwmc
linkToPdf http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwpV1Lj9MwELZgeYgLb5bAAgEhcYBo48Sx0-PyqFiBqmoX0N4sxxmXSrtplbQgeuC3M-OkZbM8JMSlquJJFU_H42_iz58Ze5qXZsAtVqqZwg8hKQ9ayaPMJLQ5zRRZ1h42oUaj_OhoMD61i9-z3ddLku2eBlJpqha789K1QzyXuw0n3bWI6AUUdDxanWcXBFYyROo6OPy0WUcgxf71Vpnf3tebjrxq_6-5-dTkdJY4eWb11E9Kw2v_353r7GoHSMO9NoJusHNQ3WSX2iMqv91i38c1LeUQOTo0RCyqAVYQenmHadWE9B43_Orfr0IZTloR6-kKv5fTOTFmSgiJtt5xw_A3ytCTGKMaJi0HtwodeHnRsPGH8tAVLK7hpDiG2-zj8M2HV2-j7syGyGaKLyIjcM6PheXCCW4Ugi1EaCnm0NTEScEhBnBZKawCYRMEL3lRQunypMgMz61N0ztsq5pVcJeFkBvHU-5U7EBIxEF5nCblwDkBmc2VChhf_3XadoLmdK7GsfaFTS5161eNftXer3oVsOebe-atnMdfrV9SRGwsSYrbX5jVE92NbJ2qAfCUZOBkKQolsScKMZCMDUBqXRKwJxRPmsQ2KmLzTMyyafT-4YHekwrRL4K4LGDPOiM3wz5Y022OQE-QPlfPcqdnidnA9pvXYau7bNRoLBsTHAiINgP2eNNMdxLDroLZkmywUFc5VqMB226jfNNvRDqIkhP0uOrFf88x_ZZq-tlrlSP-jOVABuzFehT8fKw_O_7ev5k_YpfHr4f6_f7o3X12JfHjSEZJusO2FvUSHrCL9sti2tQPfXL4Aa09Zlc
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Predicting+antifreeze+proteins+with+weighted+generalized+dipeptide+composition+and+multi-regression+feature+selection+ensemble&rft.jtitle=BMC+bioinformatics&rft.au=Wang%2C+Shunfang&rft.au=Deng%2C+Lin&rft.au=Xia%2C+Xinnan&rft.au=Cao%2C+Zicheng&rft.date=2021-06-23&rft.pub=BioMed+Central&rft.eissn=1471-2105&rft.volume=22&rft.issue=Suppl+3&rft_id=info:doi/10.1186%2Fs12859-021-04251-z&rft_id=info%3Apmid%2F34162327&rft.externalDocID=PMC8220696
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1471-2105&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1471-2105&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1471-2105&client=summon