Predicting antifreeze proteins with weighted generalized dipeptide composition and multi-regression feature selection ensemble
Background Antifreeze proteins (AFPs) are a group of proteins that inhibit body fluids from growing to ice crystals and thus improve biological antifreeze ability. It is vital to the survival of living organisms in extremely cold environments. However, little research is performed on sequences featu...
Uloženo v:
| Vydáno v: | BMC bioinformatics Ročník 22; číslo Suppl 3; s. 340 - 21 |
|---|---|
| Hlavní autoři: | , , , , |
| Médium: | Journal Article |
| Jazyk: | angličtina |
| Vydáno: |
London
BioMed Central
23.06.2021
BioMed Central Ltd Springer Nature B.V BMC |
| Témata: | |
| ISSN: | 1471-2105, 1471-2105 |
| On-line přístup: | Získat plný text |
| Tagy: |
Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
|
| Abstract | Background
Antifreeze proteins (AFPs) are a group of proteins that inhibit body fluids from growing to ice crystals and thus improve biological antifreeze ability. It is vital to the survival of living organisms in extremely cold environments. However, little research is performed on sequences feature extraction and selection for antifreeze proteins classification in the structure and function prediction, which is of great significance.
Results
In this paper, to predict the antifreeze proteins, a feature representation of weighted generalized dipeptide composition (W-GDipC) and an ensemble feature selection based on two-stage and multi-regression method (LRMR-Ri) are proposed. Specifically, four feature selection algorithms: Lasso regression, Ridge regression, Maximal information coefficient and Relief are used to select the feature sets, respectively, which is the first stage of LRMR-Ri method. If there exists a common feature subset among the above four sets, it is the optimal subset; otherwise we use Ridge regression to select the optimal subset from the public set pooled by the four sets, which is the second stage of LRMR-Ri. The LRMR-Ri method combined with W-GDipC was performed both on the antifreeze proteins dataset (binary classification), and on the membrane protein dataset (multiple classification). Experimental results show that this method has good performance in support vector machine (SVM), decision tree (DT) and stochastic gradient descent (SGD). The values of ACC, RE and MCC of LRMR-Ri and W-GDipC with antifreeze proteins dataset and SVM classifier have reached as high as 95.56%, 97.06% and 0.9105, respectively, much higher than those of each single method: Lasso, Ridge, Mic and Relief, nearly 13% higher than single Lasso for ACC.
Conclusion
The experimental results show that the proposed LRMR-Ri and W-GDipC method can significantly improve the accuracy of antifreeze proteins prediction compared with other similar single feature methods. In addition, our method has also achieved good results in the classification and prediction of membrane proteins, which verifies its widely reliability to a certain extent. |
|---|---|
| AbstractList | Abstract Background Antifreeze proteins (AFPs) are a group of proteins that inhibit body fluids from growing to ice crystals and thus improve biological antifreeze ability. It is vital to the survival of living organisms in extremely cold environments. However, little research is performed on sequences feature extraction and selection for antifreeze proteins classification in the structure and function prediction, which is of great significance. Results In this paper, to predict the antifreeze proteins, a feature representation of weighted generalized dipeptide composition (W-GDipC) and an ensemble feature selection based on two-stage and multi-regression method (LRMR-Ri) are proposed. Specifically, four feature selection algorithms: Lasso regression, Ridge regression, Maximal information coefficient and Relief are used to select the feature sets, respectively, which is the first stage of LRMR-Ri method. If there exists a common feature subset among the above four sets, it is the optimal subset; otherwise we use Ridge regression to select the optimal subset from the public set pooled by the four sets, which is the second stage of LRMR-Ri. The LRMR-Ri method combined with W-GDipC was performed both on the antifreeze proteins dataset (binary classification), and on the membrane protein dataset (multiple classification). Experimental results show that this method has good performance in support vector machine (SVM), decision tree (DT) and stochastic gradient descent (SGD). The values of ACC, RE and MCC of LRMR-Ri and W-GDipC with antifreeze proteins dataset and SVM classifier have reached as high as 95.56%, 97.06% and 0.9105, respectively, much higher than those of each single method: Lasso, Ridge, Mic and Relief, nearly 13% higher than single Lasso for ACC. Conclusion The experimental results show that the proposed LRMR-Ri and W-GDipC method can significantly improve the accuracy of antifreeze proteins prediction compared with other similar single feature methods. In addition, our method has also achieved good results in the classification and prediction of membrane proteins, which verifies its widely reliability to a certain extent. Background Antifreeze proteins (AFPs) are a group of proteins that inhibit body fluids from growing to ice crystals and thus improve biological antifreeze ability. It is vital to the survival of living organisms in extremely cold environments. However, little research is performed on sequences feature extraction and selection for antifreeze proteins classification in the structure and function prediction, which is of great significance. Results In this paper, to predict the antifreeze proteins, a feature representation of weighted generalized dipeptide composition (W-GDipC) and an ensemble feature selection based on two-stage and multi-regression method (LRMR-Ri) are proposed. Specifically, four feature selection algorithms: Lasso regression, Ridge regression, Maximal information coefficient and Relief are used to select the feature sets, respectively, which is the first stage of LRMR-Ri method. If there exists a common feature subset among the above four sets, it is the optimal subset; otherwise we use Ridge regression to select the optimal subset from the public set pooled by the four sets, which is the second stage of LRMR-Ri. The LRMR-Ri method combined with W-GDipC was performed both on the antifreeze proteins dataset (binary classification), and on the membrane protein dataset (multiple classification). Experimental results show that this method has good performance in support vector machine (SVM), decision tree (DT) and stochastic gradient descent (SGD). The values of ACC, RE and MCC of LRMR-Ri and W-GDipC with antifreeze proteins dataset and SVM classifier have reached as high as 95.56%, 97.06% and 0.9105, respectively, much higher than those of each single method: Lasso, Ridge, Mic and Relief, nearly 13% higher than single Lasso for ACC. Conclusion The experimental results show that the proposed LRMR-Ri and W-GDipC method can significantly improve the accuracy of antifreeze proteins prediction compared with other similar single feature methods. In addition, our method has also achieved good results in the classification and prediction of membrane proteins, which verifies its widely reliability to a certain extent. Antifreeze proteins (AFPs) are a group of proteins that inhibit body fluids from growing to ice crystals and thus improve biological antifreeze ability. It is vital to the survival of living organisms in extremely cold environments. However, little research is performed on sequences feature extraction and selection for antifreeze proteins classification in the structure and function prediction, which is of great significance. In this paper, to predict the antifreeze proteins, a feature representation of weighted generalized dipeptide composition (W-GDipC) and an ensemble feature selection based on two-stage and multi-regression method (LRMR-Ri) are proposed. Specifically, four feature selection algorithms: Lasso regression, Ridge regression, Maximal information coefficient and Relief are used to select the feature sets, respectively, which is the first stage of LRMR-Ri method. If there exists a common feature subset among the above four sets, it is the optimal subset; otherwise we use Ridge regression to select the optimal subset from the public set pooled by the four sets, which is the second stage of LRMR-Ri. The LRMR-Ri method combined with W-GDipC was performed both on the antifreeze proteins dataset (binary classification), and on the membrane protein dataset (multiple classification). Experimental results show that this method has good performance in support vector machine (SVM), decision tree (DT) and stochastic gradient descent (SGD). The values of ACC, RE and MCC of LRMR-Ri and W-GDipC with antifreeze proteins dataset and SVM classifier have reached as high as 95.56%, 97.06% and 0.9105, respectively, much higher than those of each single method: Lasso, Ridge, Mic and Relief, nearly 13% higher than single Lasso for ACC. The experimental results show that the proposed LRMR-Ri and W-GDipC method can significantly improve the accuracy of antifreeze proteins prediction compared with other similar single feature methods. In addition, our method has also achieved good results in the classification and prediction of membrane proteins, which verifies its widely reliability to a certain extent. Antifreeze proteins (AFPs) are a group of proteins that inhibit body fluids from growing to ice crystals and thus improve biological antifreeze ability. It is vital to the survival of living organisms in extremely cold environments. However, little research is performed on sequences feature extraction and selection for antifreeze proteins classification in the structure and function prediction, which is of great significance.BACKGROUNDAntifreeze proteins (AFPs) are a group of proteins that inhibit body fluids from growing to ice crystals and thus improve biological antifreeze ability. It is vital to the survival of living organisms in extremely cold environments. However, little research is performed on sequences feature extraction and selection for antifreeze proteins classification in the structure and function prediction, which is of great significance.In this paper, to predict the antifreeze proteins, a feature representation of weighted generalized dipeptide composition (W-GDipC) and an ensemble feature selection based on two-stage and multi-regression method (LRMR-Ri) are proposed. Specifically, four feature selection algorithms: Lasso regression, Ridge regression, Maximal information coefficient and Relief are used to select the feature sets, respectively, which is the first stage of LRMR-Ri method. If there exists a common feature subset among the above four sets, it is the optimal subset; otherwise we use Ridge regression to select the optimal subset from the public set pooled by the four sets, which is the second stage of LRMR-Ri. The LRMR-Ri method combined with W-GDipC was performed both on the antifreeze proteins dataset (binary classification), and on the membrane protein dataset (multiple classification). Experimental results show that this method has good performance in support vector machine (SVM), decision tree (DT) and stochastic gradient descent (SGD). The values of ACC, RE and MCC of LRMR-Ri and W-GDipC with antifreeze proteins dataset and SVM classifier have reached as high as 95.56%, 97.06% and 0.9105, respectively, much higher than those of each single method: Lasso, Ridge, Mic and Relief, nearly 13% higher than single Lasso for ACC.RESULTSIn this paper, to predict the antifreeze proteins, a feature representation of weighted generalized dipeptide composition (W-GDipC) and an ensemble feature selection based on two-stage and multi-regression method (LRMR-Ri) are proposed. Specifically, four feature selection algorithms: Lasso regression, Ridge regression, Maximal information coefficient and Relief are used to select the feature sets, respectively, which is the first stage of LRMR-Ri method. If there exists a common feature subset among the above four sets, it is the optimal subset; otherwise we use Ridge regression to select the optimal subset from the public set pooled by the four sets, which is the second stage of LRMR-Ri. The LRMR-Ri method combined with W-GDipC was performed both on the antifreeze proteins dataset (binary classification), and on the membrane protein dataset (multiple classification). Experimental results show that this method has good performance in support vector machine (SVM), decision tree (DT) and stochastic gradient descent (SGD). The values of ACC, RE and MCC of LRMR-Ri and W-GDipC with antifreeze proteins dataset and SVM classifier have reached as high as 95.56%, 97.06% and 0.9105, respectively, much higher than those of each single method: Lasso, Ridge, Mic and Relief, nearly 13% higher than single Lasso for ACC.The experimental results show that the proposed LRMR-Ri and W-GDipC method can significantly improve the accuracy of antifreeze proteins prediction compared with other similar single feature methods. In addition, our method has also achieved good results in the classification and prediction of membrane proteins, which verifies its widely reliability to a certain extent.CONCLUSIONThe experimental results show that the proposed LRMR-Ri and W-GDipC method can significantly improve the accuracy of antifreeze proteins prediction compared with other similar single feature methods. In addition, our method has also achieved good results in the classification and prediction of membrane proteins, which verifies its widely reliability to a certain extent. Background Antifreeze proteins (AFPs) are a group of proteins that inhibit body fluids from growing to ice crystals and thus improve biological antifreeze ability. It is vital to the survival of living organisms in extremely cold environments. However, little research is performed on sequences feature extraction and selection for antifreeze proteins classification in the structure and function prediction, which is of great significance. Results In this paper, to predict the antifreeze proteins, a feature representation of weighted generalized dipeptide composition (W-GDipC) and an ensemble feature selection based on two-stage and multi-regression method (LRMR-Ri) are proposed. Specifically, four feature selection algorithms: Lasso regression, Ridge regression, Maximal information coefficient and Relief are used to select the feature sets, respectively, which is the first stage of LRMR-Ri method. If there exists a common feature subset among the above four sets, it is the optimal subset; otherwise we use Ridge regression to select the optimal subset from the public set pooled by the four sets, which is the second stage of LRMR-Ri. The LRMR-Ri method combined with W-GDipC was performed both on the antifreeze proteins dataset (binary classification), and on the membrane protein dataset (multiple classification). Experimental results show that this method has good performance in support vector machine (SVM), decision tree (DT) and stochastic gradient descent (SGD). The values of ACC, RE and MCC of LRMR-Ri and W-GDipC with antifreeze proteins dataset and SVM classifier have reached as high as 95.56%, 97.06% and 0.9105, respectively, much higher than those of each single method: Lasso, Ridge, Mic and Relief, nearly 13% higher than single Lasso for ACC. Conclusion The experimental results show that the proposed LRMR-Ri and W-GDipC method can significantly improve the accuracy of antifreeze proteins prediction compared with other similar single feature methods. In addition, our method has also achieved good results in the classification and prediction of membrane proteins, which verifies its widely reliability to a certain extent. Keywords: Antifreeze proteins prediction, Weighted general dipeptide composition, Lasso regression, Ridge regression, Ensemble feature selection, Two-stage multiple regressions Antifreeze proteins (AFPs) are a group of proteins that inhibit body fluids from growing to ice crystals and thus improve biological antifreeze ability. It is vital to the survival of living organisms in extremely cold environments. However, little research is performed on sequences feature extraction and selection for antifreeze proteins classification in the structure and function prediction, which is of great significance. In this paper, to predict the antifreeze proteins, a feature representation of weighted generalized dipeptide composition (W-GDipC) and an ensemble feature selection based on two-stage and multi-regression method (LRMR-Ri) are proposed. Specifically, four feature selection algorithms: Lasso regression, Ridge regression, Maximal information coefficient and Relief are used to select the feature sets, respectively, which is the first stage of LRMR-Ri method. If there exists a common feature subset among the above four sets, it is the optimal subset; otherwise we use Ridge regression to select the optimal subset from the public set pooled by the four sets, which is the second stage of LRMR-Ri. The LRMR-Ri method combined with W-GDipC was performed both on the antifreeze proteins dataset (binary classification), and on the membrane protein dataset (multiple classification). Experimental results show that this method has good performance in support vector machine (SVM), decision tree (DT) and stochastic gradient descent (SGD). The values of ACC, RE and MCC of LRMR-Ri and W-GDipC with antifreeze proteins dataset and SVM classifier have reached as high as 95.56%, 97.06% and 0.9105, respectively, much higher than those of each single method: Lasso, Ridge, Mic and Relief, nearly 13% higher than single Lasso for ACC. The experimental results show that the proposed LRMR-Ri and W-GDipC method can significantly improve the accuracy of antifreeze proteins prediction compared with other similar single feature methods. In addition, our method has also achieved good results in the classification and prediction of membrane proteins, which verifies its widely reliability to a certain extent. Background Antifreeze proteins (AFPs) are a group of proteins that inhibit body fluids from growing to ice crystals and thus improve biological antifreeze ability. It is vital to the survival of living organisms in extremely cold environments. However, little research is performed on sequences feature extraction and selection for antifreeze proteins classification in the structure and function prediction, which is of great significance. Results In this paper, to predict the antifreeze proteins, a feature representation of weighted generalized dipeptide composition (W-GDipC) and an ensemble feature selection based on two-stage and multi-regression method (LRMR-Ri) are proposed. Specifically, four feature selection algorithms: Lasso regression, Ridge regression, Maximal information coefficient and Relief are used to select the feature sets, respectively, which is the first stage of LRMR-Ri method. If there exists a common feature subset among the above four sets, it is the optimal subset; otherwise we use Ridge regression to select the optimal subset from the public set pooled by the four sets, which is the second stage of LRMR-Ri. The LRMR-Ri method combined with W-GDipC was performed both on the antifreeze proteins dataset (binary classification), and on the membrane protein dataset (multiple classification). Experimental results show that this method has good performance in support vector machine (SVM), decision tree (DT) and stochastic gradient descent (SGD). The values of ACC, RE and MCC of LRMR-Ri and W-GDipC with antifreeze proteins dataset and SVM classifier have reached as high as 95.56%, 97.06% and 0.9105, respectively, much higher than those of each single method: Lasso, Ridge, Mic and Relief, nearly 13% higher than single Lasso for ACC. Conclusion The experimental results show that the proposed LRMR-Ri and W-GDipC method can significantly improve the accuracy of antifreeze proteins prediction compared with other similar single feature methods. In addition, our method has also achieved good results in the classification and prediction of membrane proteins, which verifies its widely reliability to a certain extent. |
| ArticleNumber | 340 |
| Audience | Academic |
| Author | Wang, Shunfang Xia, Xinnan Deng, Lin Fei, Yu Cao, Zicheng |
| Author_xml | – sequence: 1 givenname: Shunfang orcidid: 0000-0002-1927-8753 surname: Wang fullname: Wang, Shunfang email: sfwang_66@ynu.edu.cn organization: Department of Computer Science and Engineering, School of Information Science and Engineering, Yunnan University – sequence: 2 givenname: Lin surname: Deng fullname: Deng, Lin organization: Department of Computer Science and Engineering, School of Information Science and Engineering, Yunnan University – sequence: 3 givenname: Xinnan surname: Xia fullname: Xia, Xinnan email: xiaxinnan1@163.com organization: Department of Computer Science and Engineering, School of Information Science and Engineering, Yunnan University – sequence: 4 givenname: Zicheng surname: Cao fullname: Cao, Zicheng organization: School of Public Health (Shenzhen), Sun Yat-Sen University – sequence: 5 givenname: Yu surname: Fei fullname: Fei, Yu email: feiyu@ynufe.edu.cn organization: School of Statistics and Mathematics, Yunnan University of Finance and Economics |
| BackLink | https://www.ncbi.nlm.nih.gov/pubmed/34162327$$D View this record in MEDLINE/PubMed |
| BookMark | eNp9kktv1DAUhSNURB_wB1igSGxgkeJn4tkgVRWPkSqBeKwtx77JeJTYwXYozILfjqdT2k6FqixsXX_nODk5x8WB8w6K4jlGpxiL-k3ERPBFhQiuECMcV5tHxRFmDa4IRvzgzv6wOI5xjRBuBOJPikPKcE0oaY6KP58DGKuTdX2pXLJdANhAOQWfwLpYXtq0Ki_B9qsEpuzBQVCD3eS9sRNMyRootR8nH22y3mUPU47zkGwVoA8Q43bYgUpzgDLCAPoKAxdhbAd4Wjzu1BDh2fV6Unx__-7b-cfq4tOH5fnZRaV5g1OlGGEEMY1Zx7BqGMOM19QYoAqRFgMC6LhhugGmCaNYtAZMJ0jLFRZaU3pSLHe-xqu1nIIdVfgtvbLyauBDL1VIVg8gabMATBmlpDasbers0QhOa6QAqO5I9nq785rmdgSjwaWcyZ7p_omzK9n7n1IQgupFnQ1eXRsE_2OGmORoo4ZhUA78HCXhjIlGMCIy-vIeuvZzcDmqTHGS_yaq-S3Vq_wB1nU-36u3pvKsbgipUcO21Ol_qPwYGK3O1epsnu8JXu8JMpPgV-rVHKNcfv2yz764G8pNGv-algGyA3TwMQbobhCM5LbOcldnmessr-osN1kk7om0TWrboPzqdnhYSnfSmO9xPYTb5B5Q_QWYJAsJ |
| CitedBy_id | crossref_primary_10_1093_bib_bbaf026 crossref_primary_10_1242_jeb_243409 crossref_primary_10_1186_s12986_025_00917_0 crossref_primary_10_1016_j_compbiomed_2024_108534 crossref_primary_10_1016_j_isci_2025_112077 crossref_primary_10_1109_TCBB_2024_3467261 crossref_primary_10_1016_j_biotechadv_2025_108545 crossref_primary_10_1016_j_compbiolchem_2022_107680 |
| Cites_doi | 10.1016/j.compbiolchem.2019.107094 10.1093/bioinformatics/btl158 10.1093/bfgp/elz036 10.1016/j.bbrc.2007.01.011 10.1080/00401706.1970.10488634 10.1109/TNNLS.2012.2199516 10.1007/s13369-018-03713-6 10.1016/j.omtn.2020.05.006 10.1002/med.21658 10.1093/nar/gkv1266 10.1016/j.jtbi.2008.02.031 10.3390/ijms18122718 10.1186/s12859-019-3276-5 10.1007/s00232-015-9811-z 10.1016/S0893-6080(05)80023-1 10.1504/IJDMB.2013.056078 10.1016/j.ygeno.2013.05.006 10.3390/molecules23061448 10.1098/rstb.2002.1081 10.1109/TCBB.2019.2930993 10.1016/j.jtbi.2010.10.037 10.1016/j.jtbi.2008.08.028 10.1016/j.future.2018.01.006 10.1093/bioinformatics/btz609 10.1109/TCBB.2016.2617337 10.1002/jcp.1030490103 10.1016/j.jtbi.2010.12.024 10.4238/gmr.15039013 10.1007/3-540-57868-4_57 10.1109/IJCNN.2001.1016716 10.1002/(SICI)1097-0134(199707)28:3<405::AID-PROT10>3.0.CO;2-L 10.1093/bioinformatics/btx302 10.1371/journal.pone.0195636 10.1007/s00232-016-9935-9 10.1109/CCIP.2015.7100687 10.1145/1015330.1015332 10.1093/nar/25.17.3389 10.3390/ijms13022196 10.1016/j.jtbi.2008.10.026 10.1016/j.jpdc.2017.08.009 10.1155/2017/9861752 10.1007/s13369-017-2738-1 10.1016/j.patcog.2007.08.016 10.3390/ijms160921191 10.3390/ijms161226237 10.1109/TCBB.2014.2351821 10.1016/j.neucom.2017.07.004 10.1142/S021972001950029X 10.1016/j.cplett.2012.02.030 10.1093/nar/gky113 10.1126/science.1205438 10.1016/j.bbrc.2007.06.027 10.1111/j.1399-3054.1997.tb04790.x 10.1016/j.jtbi.2014.04.006 10.1073/pnas.94.8.3485 |
| ContentType | Journal Article |
| Copyright | The Author(s) 2021 COPYRIGHT 2021 BioMed Central Ltd. 2021. This work is licensed under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. |
| Copyright_xml | – notice: The Author(s) 2021 – notice: COPYRIGHT 2021 BioMed Central Ltd. – notice: 2021. This work is licensed under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. |
| DBID | C6C AAYXX CITATION CGR CUY CVF ECM EIF NPM ISR 3V. 7QO 7SC 7X7 7XB 88E 8AL 8AO 8FD 8FE 8FG 8FH 8FI 8FJ 8FK ABUWG AEUYN AFKRA ARAPS AZQEC BBNVY BENPR BGLVJ BHPHI CCPQU DWQXO FR3 FYUFA GHDGH GNUQQ HCIFZ JQ2 K7- K9. L7M LK8 L~C L~D M0N M0S M1P M7P P5Z P62 P64 PHGZM PHGZT PIMPY PJZUB PKEHL PPXIY PQEST PQGLB PQQKQ PQUKI PRINS Q9U 7X8 5PM DOA |
| DOI | 10.1186/s12859-021-04251-z |
| DatabaseName | Springer Nature OA Free Journals CrossRef Medline MEDLINE MEDLINE (Ovid) MEDLINE MEDLINE PubMed Gale In Context: Science ProQuest Central (Corporate) Biotechnology Research Abstracts Computer and Information Systems Abstracts Health & Medical Collection ProQuest Central (purchase pre-March 2016) Medical Database (Alumni Edition) Computing Database (Alumni Edition) ProQuest Pharma Collection Technology Research Database ProQuest SciTech Collection ProQuest Technology Collection ProQuest Natural Science Collection Hospital Premium Collection Hospital Premium Collection (Alumni Edition) ProQuest Central (Alumni) (purchase pre-March 2016) ProQuest Central (Alumni) ProQuest One Sustainability ProQuest Central UK/Ireland Advanced Technologies & Computer Science Collection ProQuest Central Essentials Biological Science Collection ProQuest Central ProQuest Technology Collection Natural Science Collection ProQuest One ProQuest Central Korea Engineering Research Database ProQuest Health & Medical Collection Health Research Premium Collection (Alumni) ProQuest Central Student SciTech Premium Collection ProQuest Computer Science Collection Computer Science Database ProQuest Health & Medical Complete (Alumni) Advanced Technologies Database with Aerospace Biological Sciences Computer and Information Systems Abstracts Academic Computer and Information Systems Abstracts Professional Computing Database ProQuest Health & Medical Collection Medical Database Biological Science Database (ProQuest) Advanced Technologies & Aerospace Database ProQuest Advanced Technologies & Aerospace Collection Biotechnology and BioEngineering Abstracts ProQuest Central Premium ProQuest One Academic (New) ProQuest Publicly Available Content Database ProQuest Health & Medical Research Collection ProQuest One Academic Middle East (New) ProQuest One Health & Nursing ProQuest One Academic Eastern Edition (DO NOT USE) ProQuest One Applied & Life Sciences ProQuest One Academic (retired) ProQuest One Academic UKI Edition ProQuest Central China ProQuest Central Basic MEDLINE - Academic PubMed Central (Full Participant titles) DOAJ Directory of Open Access Journals |
| DatabaseTitle | CrossRef MEDLINE Medline Complete MEDLINE with Full Text PubMed MEDLINE (Ovid) Publicly Available Content Database Computer Science Database ProQuest Central Student ProQuest Advanced Technologies & Aerospace Collection ProQuest Central Essentials ProQuest Computer Science Collection Computer and Information Systems Abstracts SciTech Premium Collection ProQuest Central China ProQuest One Applied & Life Sciences ProQuest One Sustainability Health Research Premium Collection Natural Science Collection Health & Medical Research Collection Biological Science Collection ProQuest Central (New) ProQuest Medical Library (Alumni) Advanced Technologies & Aerospace Collection ProQuest Biological Science Collection ProQuest One Academic Eastern Edition ProQuest Hospital Collection ProQuest Technology Collection Health Research Premium Collection (Alumni) Biological Science Database ProQuest Hospital Collection (Alumni) Biotechnology and BioEngineering Abstracts ProQuest Health & Medical Complete ProQuest One Academic UKI Edition Engineering Research Database ProQuest One Academic ProQuest One Academic (New) Technology Collection Technology Research Database Computer and Information Systems Abstracts – Academic ProQuest One Academic Middle East (New) ProQuest Health & Medical Complete (Alumni) ProQuest Central (Alumni Edition) ProQuest One Community College ProQuest One Health & Nursing ProQuest Natural Science Collection ProQuest Pharma Collection ProQuest Central ProQuest Health & Medical Research Collection Biotechnology Research Abstracts Health and Medicine Complete (Alumni Edition) ProQuest Central Korea Advanced Technologies Database with Aerospace ProQuest Computing ProQuest Central Basic ProQuest Computing (Alumni Edition) ProQuest SciTech Collection Computer and Information Systems Abstracts Professional Advanced Technologies & Aerospace Database ProQuest Medical Library ProQuest Central (Alumni) MEDLINE - Academic |
| DatabaseTitleList | Publicly Available Content Database MEDLINE MEDLINE - Academic |
| Database_xml | – sequence: 1 dbid: DOA name: DOAJ Directory of Open Access Journals url: https://www.doaj.org/ sourceTypes: Open Website – sequence: 2 dbid: NPM name: PubMed url: http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed sourceTypes: Index Database – sequence: 3 dbid: PIMPY name: Publicly Available Content Database url: http://search.proquest.com/publiccontent sourceTypes: Aggregation Database |
| DeliveryMethod | fulltext_linktorsrc |
| Discipline | Biology |
| EISSN | 1471-2105 |
| EndPage | 21 |
| ExternalDocumentID | oai_doaj_org_article_379e1343326d4b76b5a785360aee3cf2 PMC8220696 A672260745 34162327 10_1186_s12859_021_04251_z |
| Genre | Journal Article |
| GeographicLocations | China |
| GeographicLocations_xml | – name: China |
| GrantInformation_xml | – fundername: Training Plan for Young and Middle-aged Academic Leaders of Yunnan Province grantid: 2018HB031 – fundername: National Natural Science Foundation of China grantid: 62062067; 11661081; 11971421 funderid: http://dx.doi.org/10.13039/501100001809 – fundername: Natural Science Foundation of Yunnan Province grantid: 2017FA032 funderid: http://dx.doi.org/10.13039/501100005273 – fundername: National Natural Science Foundation of China grantid: 11971421 – fundername: National Natural Science Foundation of China grantid: 62062067 – fundername: Natural Science Foundation of Yunnan Province grantid: 2017FA032 – fundername: National Natural Science Foundation of China grantid: 11661081 – fundername: ; grantid: 62062067; 11661081; 11971421 – fundername: ; grantid: 2018HB031 – fundername: ; grantid: 2017FA032 |
| GroupedDBID | --- 0R~ 23N 2WC 53G 5VS 6J9 7X7 88E 8AO 8FE 8FG 8FH 8FI 8FJ AAFWJ AAJSJ AAKPC AASML ABDBF ABUWG ACGFO ACGFS ACIHN ACIWK ACPRK ACUHS ADBBV ADMLS ADUKV AEAQA AENEX AEUYN AFKRA AFPKN AFRAH AHBYD AHMBA AHYZX ALMA_UNASSIGNED_HOLDINGS AMKLP AMTXH AOIJS ARAPS AZQEC BAPOH BAWUL BBNVY BCNDV BENPR BFQNJ BGLVJ BHPHI BMC BPHCQ BVXVI C6C CCPQU CS3 DIK DU5 DWQXO E3Z EAD EAP EAS EBD EBLON EBS EMB EMK EMOBN ESX F5P FYUFA GNUQQ GROUPED_DOAJ GX1 HCIFZ HMCUK HYE IAO ICD IHR INH INR ISR ITC K6V K7- KQ8 LK8 M1P M48 M7P MK~ ML0 M~E O5R O5S OK1 OVT P2P P62 PGMZT PHGZM PHGZT PIMPY PJZUB PPXIY PQGLB PQQKQ PROAC PSQYO PUEGO RBZ RNS ROL RPM RSV SBL SOJ SV3 TR2 TUS UKHRP W2D WOQ WOW XH6 XSB AAYXX AFFHD CITATION -A0 3V. ACRMQ ADINQ ALIPV C24 CGR CUY CVF ECM EIF M0N NPM 7QO 7SC 7XB 8AL 8FD 8FK FR3 JQ2 K9. L7M L~C L~D P64 PKEHL PQEST PQUKI PRINS Q9U 7X8 5PM |
| ID | FETCH-LOGICAL-c571t-a424204c14f41a74414563dde3a02b1e0eef5d4c7e4c24318bdedf82b5a18cc33 |
| IEDL.DBID | DOA |
| ISICitedReferencesCount | 8 |
| ISICitedReferencesURI | http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000665002000001&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| ISSN | 1471-2105 |
| IngestDate | Fri Oct 03 12:45:24 EDT 2025 Tue Nov 04 01:46:36 EST 2025 Sun Nov 09 09:39:51 EST 2025 Mon Oct 06 18:39:01 EDT 2025 Tue Nov 11 10:28:23 EST 2025 Tue Nov 04 17:34:36 EST 2025 Thu Nov 13 14:37:55 EST 2025 Wed Feb 19 02:06:26 EST 2025 Sat Nov 29 05:40:10 EST 2025 Tue Nov 18 21:48:01 EST 2025 Sat Sep 06 07:27:38 EDT 2025 |
| IsDoiOpenAccess | true |
| IsOpenAccess | true |
| IsPeerReviewed | true |
| IsScholarly | true |
| Issue | Suppl 3 |
| Keywords | Ridge regression Antifreeze proteins prediction Lasso regression Ensemble feature selection Two-stage multiple regressions Weighted general dipeptide composition |
| Language | English |
| License | Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
| LinkModel | DirectLink |
| MergedId | FETCHMERGED-LOGICAL-c571t-a424204c14f41a74414563dde3a02b1e0eef5d4c7e4c24318bdedf82b5a18cc33 |
| Notes | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 content type line 23 |
| ORCID | 0000-0002-1927-8753 |
| OpenAccessLink | https://doaj.org/article/379e1343326d4b76b5a785360aee3cf2 |
| PMID | 34162327 |
| PQID | 2552805065 |
| PQPubID | 44065 |
| PageCount | 21 |
| ParticipantIDs | doaj_primary_oai_doaj_org_article_379e1343326d4b76b5a785360aee3cf2 pubmedcentral_primary_oai_pubmedcentral_nih_gov_8220696 proquest_miscellaneous_2544878428 proquest_journals_2552805065 gale_infotracmisc_A672260745 gale_infotracacademiconefile_A672260745 gale_incontextgauss_ISR_A672260745 pubmed_primary_34162327 crossref_primary_10_1186_s12859_021_04251_z crossref_citationtrail_10_1186_s12859_021_04251_z springer_journals_10_1186_s12859_021_04251_z |
| PublicationCentury | 2000 |
| PublicationDate | 2021-06-23 |
| PublicationDateYYYYMMDD | 2021-06-23 |
| PublicationDate_xml | – month: 06 year: 2021 text: 2021-06-23 day: 23 |
| PublicationDecade | 2020 |
| PublicationPlace | London |
| PublicationPlace_xml | – name: London – name: England |
| PublicationTitle | BMC bioinformatics |
| PublicationTitleAbbrev | BMC Bioinformatics |
| PublicationTitleAlternate | BMC Bioinformatics |
| PublicationYear | 2021 |
| Publisher | BioMed Central BioMed Central Ltd Springer Nature B.V BMC |
| Publisher_xml | – name: BioMed Central – name: BioMed Central Ltd – name: Springer Nature B.V – name: BMC |
| References | A Anand (4251_CR16) 2008; 253 LY Wei (4251_CR12) 2018; 117 X He (4251_CR30) 2015; 248 RB Huang (4251_CR14) 2009; 256 J Yan (4251_CR43) 2020; 20 SF Wang (4251_CR15) 2019; 81 4251_CR35 EL Sonnhammer (4251_CR45) 1997; 28 4251_CR39 P Agrawal (4251_CR48) 2016; 44 Y Runtao (4251_CR29) 2015; 16 A Nath (4251_CR34) 2018; 272 W Li (4251_CR47) 2006; 22 X Zhao (4251_CR27) 2012; 13 S Wang (4251_CR9) 2019; 20 Z Jian (4251_CR60) 2018; 23 K Kira (4251_CR51) 1992; 2 Y Yan (4251_CR41) 2018; 46 DH Wolpert (4251_CR53) 1992; 5 DN Reshef (4251_CR56) 2011; 334 JM Logsdon (4251_CR4) 1997; 94 MM Tab (4251_CR1) 2018; 43 F Yuan (4251_CR6) 2019; 17 HJ Yu (4251_CR10) 2012; 531 SF Altschul (4251_CR46) 1997; 25 AE Hoerl (4251_CR55) 1970; 12 RJ Tibshirani (4251_CR54) 1996; 73 S Wang (4251_CR11) 2020; 17 KC Chou (4251_CR58) 2011; 273 Q Jiang (4251_CR17) 2013; 8 S Basith (4251_CR42) 2020; 40 PL Davies (4251_CR5) 2002; 357 4251_CR13 4251_CR57 S Khan (4251_CR33) 2016; 15 DS Huang (4251_CR40) 2008; 41 S Mondal (4251_CR28) 2014; 356 KK Kandaswamy (4251_CR26) 2011; 270 S Lalwani (4251_CR38) 2019; 44 KC Chou (4251_CR44) 2007; 360 X Xiao (4251_CR31) 2016; 249 M Griffith (4251_CR2) 1997; 100 W Peng (4251_CR52) 2018; 82 JD Qiu (4251_CR18) 2009; 256 Z Wen (4251_CR19) 2020; 36 J Wang (4251_CR8) 2017; 33 S Wang (4251_CR22) 2017; 18 G Yu (4251_CR23) 2015; 12 SF Wang (4251_CR37) 2018; 13 PF Scholander (4251_CR3) 2010; 49 J Zahiri (4251_CR49) 2013; 102 JG Moreno-Torres (4251_CR59) 2012; 23 S Wang (4251_CR20) 2015; 16 4251_CR25 4251_CR24 X Wang (4251_CR50) 2014; 2014 H Lin (4251_CR21) 2007; 354 Q Zou (4251_CR36) 2020; 21 R Pratiwi (4251_CR32) 2017; 2017 SW Sun (4251_CR7) 2020; 19 |
| References_xml | – volume: 81 start-page: 9 year: 2019 ident: 4251_CR15 publication-title: Comput Biol Chem doi: 10.1016/j.compbiolchem.2019.107094 – volume: 22 start-page: 1658 issue: 13 year: 2006 ident: 4251_CR47 publication-title: Bioinformatics doi: 10.1093/bioinformatics/btl158 – volume: 19 start-page: 40 issue: 1 year: 2020 ident: 4251_CR7 publication-title: Brief Funct Genomics doi: 10.1093/bfgp/elz036 – volume: 354 start-page: 1 issue: 2 year: 2007 ident: 4251_CR21 publication-title: Biochem Biophys Res Commun doi: 10.1016/j.bbrc.2007.01.011 – volume: 12 start-page: 55 issue: 1 year: 1970 ident: 4251_CR55 publication-title: Technometrics doi: 10.1080/00401706.1970.10488634 – volume: 23 start-page: 1304 issue: 8 year: 2012 ident: 4251_CR59 publication-title: IEEE Trans Neural Netw Learn Syst doi: 10.1109/TNNLS.2012.2199516 – volume: 44 start-page: 2899 issue: 4 year: 2019 ident: 4251_CR38 publication-title: Arab J Sci Eng. doi: 10.1007/s13369-018-03713-6 – volume: 20 start-page: 882 year: 2020 ident: 4251_CR43 publication-title: Mol Ther-Nucleic Acids. doi: 10.1016/j.omtn.2020.05.006 – volume: 40 start-page: 1276 issue: 4 year: 2020 ident: 4251_CR42 publication-title: Med Res Rev. doi: 10.1002/med.21658 – volume: 44 start-page: D1098 issue: D1 year: 2016 ident: 4251_CR48 publication-title: Nucleic Acids Res doi: 10.1093/nar/gkv1266 – volume: 253 start-page: 375 issue: 2 year: 2008 ident: 4251_CR16 publication-title: J Theor Biol doi: 10.1016/j.jtbi.2008.02.031 – volume: 18 start-page: 2718 issue: 12 year: 2017 ident: 4251_CR22 publication-title: Int J Mol Sci doi: 10.3390/ijms18122718 – volume: 20 start-page: 701 issue: 25 year: 2019 ident: 4251_CR9 publication-title: BMC Bioinform doi: 10.1186/s12859-019-3276-5 – volume: 248 start-page: 1005 issue: 6 year: 2015 ident: 4251_CR30 publication-title: J Membr Biol doi: 10.1007/s00232-015-9811-z – volume: 5 start-page: 241 issue: 2 year: 1992 ident: 4251_CR53 publication-title: Neural Netw doi: 10.1016/S0893-6080(05)80023-1 – volume: 8 start-page: 282 issue: 3 year: 2013 ident: 4251_CR17 publication-title: Int J Data Min Bioinform doi: 10.1504/IJDMB.2013.056078 – volume: 102 start-page: 237 issue: 4 year: 2013 ident: 4251_CR49 publication-title: Genomics doi: 10.1016/j.ygeno.2013.05.006 – volume: 23 start-page: 1448 issue: 6 year: 2018 ident: 4251_CR60 publication-title: Molecules doi: 10.3390/molecules23061448 – volume: 357 start-page: 927 issue: 1423 year: 2002 ident: 4251_CR5 publication-title: Philos Trans R Soc Lond doi: 10.1098/rstb.2002.1081 – volume: 17 start-page: 739 issue: 3 year: 2020 ident: 4251_CR11 publication-title: IEEE/ACM Trans Comput Biol Bioinf doi: 10.1109/TCBB.2019.2930993 – volume: 270 start-page: 56 issue: 1 year: 2011 ident: 4251_CR26 publication-title: J Theor Biol doi: 10.1016/j.jtbi.2010.10.037 – ident: 4251_CR39 – volume: 256 start-page: 428 issue: 3 year: 2009 ident: 4251_CR14 publication-title: J Theor Biol doi: 10.1016/j.jtbi.2008.08.028 – volume: 82 start-page: 119 year: 2018 ident: 4251_CR52 publication-title: Future Gen Comput Syst doi: 10.1016/j.future.2018.01.006 – volume: 36 start-page: 478 issue: 2 year: 2020 ident: 4251_CR19 publication-title: Bioinformatics doi: 10.1093/bioinformatics/btz609 – volume: 15 start-page: 244 issue: 1 year: 2016 ident: 4251_CR33 publication-title: IEEE/ACM Trans Comput Biol Bioinform doi: 10.1109/TCBB.2016.2617337 – volume: 2014 start-page: 86 year: 2014 ident: 4251_CR50 publication-title: IEEE – volume: 49 start-page: 5 issue: 1 year: 2010 ident: 4251_CR3 publication-title: J Cell Physiol doi: 10.1002/jcp.1030490103 – volume: 273 start-page: 236 issue: 1 year: 2011 ident: 4251_CR58 publication-title: J Theor Biol doi: 10.1016/j.jtbi.2010.12.024 – ident: 4251_CR35 doi: 10.4238/gmr.15039013 – ident: 4251_CR57 doi: 10.1007/3-540-57868-4_57 – ident: 4251_CR13 doi: 10.1109/IJCNN.2001.1016716 – volume: 28 start-page: 405 issue: 3 year: 1997 ident: 4251_CR45 publication-title: Proteins doi: 10.1002/(SICI)1097-0134(199707)28:3<405::AID-PROT10>3.0.CO;2-L – volume: 33 start-page: 2756 issue: 17 year: 2017 ident: 4251_CR8 publication-title: Bioinformatics doi: 10.1093/bioinformatics/btx302 – volume: 13 start-page: 0195636 issue: 4 year: 2018 ident: 4251_CR37 publication-title: PLoS ONE doi: 10.1371/journal.pone.0195636 – volume: 249 start-page: 1 issue: 6 year: 2016 ident: 4251_CR31 publication-title: J Membr Biol doi: 10.1007/s00232-016-9935-9 – ident: 4251_CR25 doi: 10.1109/CCIP.2015.7100687 – ident: 4251_CR24 doi: 10.1145/1015330.1015332 – volume: 73 start-page: 273 issue: 1 year: 1996 ident: 4251_CR54 publication-title: J R Stat Soc Ser B Methodol – volume: 25 start-page: 3389 issue: 17 year: 1997 ident: 4251_CR46 publication-title: Nucleic Acids Res doi: 10.1093/nar/25.17.3389 – volume: 21 start-page: 1 issue: 1 year: 2020 ident: 4251_CR36 publication-title: Brief Bioinform – volume: 13 start-page: 2196 issue: 12 year: 2012 ident: 4251_CR27 publication-title: Int J Mol Sci doi: 10.3390/ijms13022196 – volume: 256 start-page: 625 issue: 4 year: 2009 ident: 4251_CR18 publication-title: J Theor Biol doi: 10.1016/j.jtbi.2008.10.026 – volume: 117 start-page: 212 year: 2018 ident: 4251_CR12 publication-title: J Parallel Distrib Comput doi: 10.1016/j.jpdc.2017.08.009 – volume: 2 start-page: 129 year: 1992 ident: 4251_CR51 publication-title: Aaai – volume: 2017 start-page: 1 year: 2017 ident: 4251_CR32 publication-title: J Chem. doi: 10.1155/2017/9861752 – volume: 43 start-page: 133 issue: 1 year: 2018 ident: 4251_CR1 publication-title: Arab J Sci Eng. doi: 10.1007/s13369-017-2738-1 – volume: 41 start-page: 1316 issue: 4 year: 2008 ident: 4251_CR40 publication-title: Pattern Recognit doi: 10.1016/j.patcog.2007.08.016 – volume: 16 start-page: 21191 issue: 9 year: 2015 ident: 4251_CR29 publication-title: Int J Mol Sci doi: 10.3390/ijms160921191 – volume: 16 start-page: 30343 issue: 12 year: 2015 ident: 4251_CR20 publication-title: Int J Mol Sci doi: 10.3390/ijms161226237 – volume: 12 start-page: 219 issue: 1 year: 2015 ident: 4251_CR23 publication-title: IEEE/ACM Trans Comput Biol Bioinform doi: 10.1109/TCBB.2014.2351821 – volume: 272 start-page: 294 issue: 10 year: 2018 ident: 4251_CR34 publication-title: Neurocomputing doi: 10.1016/j.neucom.2017.07.004 – volume: 17 start-page: 1950029 issue: 4 year: 2019 ident: 4251_CR6 publication-title: J Bioinform Comput Biol doi: 10.1142/S021972001950029X – volume: 531 start-page: 261 year: 2012 ident: 4251_CR10 publication-title: Chem Phys Lett doi: 10.1016/j.cplett.2012.02.030 – volume: 46 start-page: e56 issue: 9 year: 2018 ident: 4251_CR41 publication-title: Nucleic Acids Res doi: 10.1093/nar/gky113 – volume: 334 start-page: 1518 issue: 6062 year: 2011 ident: 4251_CR56 publication-title: Science doi: 10.1126/science.1205438 – volume: 360 start-page: 1 issue: 2 year: 2007 ident: 4251_CR44 publication-title: Biochem Biophys Res Commun doi: 10.1016/j.bbrc.2007.06.027 – volume: 100 start-page: 327 issue: 2 year: 1997 ident: 4251_CR2 publication-title: Physiol Plant doi: 10.1111/j.1399-3054.1997.tb04790.x – volume: 356 start-page: 30 year: 2014 ident: 4251_CR28 publication-title: J Theor Biol doi: 10.1016/j.jtbi.2014.04.006 – volume: 94 start-page: 3485 issue: 8 year: 1997 ident: 4251_CR4 publication-title: Proc Natl Acad Sci doi: 10.1073/pnas.94.8.3485 |
| SSID | ssj0017805 |
| Score | 2.400457 |
| Snippet | Background
Antifreeze proteins (AFPs) are a group of proteins that inhibit body fluids from growing to ice crystals and thus improve biological antifreeze... Antifreeze proteins (AFPs) are a group of proteins that inhibit body fluids from growing to ice crystals and thus improve biological antifreeze ability. It is... Background Antifreeze proteins (AFPs) are a group of proteins that inhibit body fluids from growing to ice crystals and thus improve biological antifreeze... Abstract Background Antifreeze proteins (AFPs) are a group of proteins that inhibit body fluids from growing to ice crystals and thus improve biological... |
| SourceID | doaj pubmedcentral proquest gale pubmed crossref springer |
| SourceType | Open Website Open Access Repository Aggregation Database Index Database Enrichment Source Publisher |
| StartPage | 340 |
| SubjectTerms | Accuracy Algorithms Amino acids Analysis Antifreeze proteins Antifreeze Proteins - genetics Antifreeze proteins prediction Bioinformatics Biomedical and Life Sciences Body fluids Classification Composition Computational Biology/Bioinformatics Computer Appl. in Life Sciences Crystal growth Crystals Datasets Decision trees Dipeptides Ensemble feature selection Experiments Feature extraction Feature selection Ice Ice crystals Lasso regression Life Sciences Machine learning Membrane proteins Membranes Methods Microarrays Optimization techniques Performance evaluation Predictions Principal components analysis Proteins Regression Reproducibility of Results Ridge regression Structure-function relationships Support Vector Machine Support vector machines Two-stage multiple regressions Weighted general dipeptide composition |
| SummonAdditionalLinks | – databaseName: Health & Medical Collection dbid: 7X7 link: http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwpV1Lj9MwELZgAYkL70dgQQYhcYBo4zxs94QWxAouqxUPaW-W44xLpSUtSQuiB347M47bJYvYC5eqip2qtsfjb-zP3zD2TDd2IhxGqpXCj1KSH3RSpJXN6XKaratqSDahDg_18fHkKG649ZFWufGJwVE3c0d75HsIfXOdVbhivlp8SylrFJ2uxhQaF9klSptNdq6OtwGXIL3-zUUZLfd6QWptKZESyFRFuh4tRkGz_2_P_MfSdJY2eebsNCxJB9f_tzE32LUIRvn-YD032QVob7ErQ3rKn7fZr6OOjnGIGM0tkYo6gDXwIO0wa3tOe7j8R9hbhYZPBwHr2Rq_N7MFsWUa4ERZj7ww_I2GBwJj2sF04N-23EOQFuV9SMhDTzCwhq_1Cdxhnw_efnrzLo35GlJXKbFMbYnrfVY6UfpSWIVAC9FZgf6zsFleC8gAfNWUTkHpcgQuum6g8TqvKyu0c0Vxl-208xbuMw7aelEIrzIPpUQMpLMibybel1A5rVTCxGbgjIti5pRT48SEoEZLMwy2wcE2YbDNOmEvtu8sBimPc2u_JnvY1iQZ7vBg3k1NnNWmUBMQBUnAyaaslcSWKMQ_MrMAhfN5wp6SNRkS2miJyTO1q7437z9-MPtSIfJFAFcl7Hms5OfYBmfjxQjsCdLmGtXcHdVET-DGxRtrM9ET9ebU1BL2ZFtMbxK7roX5iupgkK40RqIJuzfY-LbdiHIQIefY42pk_aOOGZe0sy9BpxyxZyYnMmEvN_Pk9G_9u-MfnN-Kh-xqHmawTPNil-0suxU8Ypfd9-Ws7x6H-f8bjtBjiQ priority: 102 providerName: ProQuest – databaseName: SpringerLink Contemporary Journals dbid: RSV link: http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1Lb9QwELaggNQLb2igIIOQOEBEnIftPRZEBZeqooB6sxxnvI1UslWyC2IP_HZmnGQh5SHBZbWKJ6t4djz-Jv78mbEnurIz4bBSLRR-5JLyoJMiLmxKm9NsWRT9YRPq4EAfH88Oh01h3ch2H5ckQ6YOw1rLF50grbWYKAUUaCJeX2SXClKboRr96ONm7YBU-sftMb-9bzIFBaX-X_PxTxPSebLkuRXTMBHtX_u_LlxnVwfgyff6SLnBLkBzk13pj6L8eot9O2xpyYZI0NwSgagFWAMPMg5103F6X8u_hPeoUPF5L1Zdr_F7VZ8RM6YCTvT0gQOGv1HxQFaMW5j3XNuGewgyorwLh-_QFSyi4VN5CrfZh_3X71-9iYezGWJXKLGMbY5ze5I7kftcWIWgCpFYhrkys0laCkgAfFHlTkHuUgQpuqyg8jotCyu0c1l2h201iwZ2GAdtvciEV4mHXCLe0UmWVjPvcyicVipiYvy7jBuEy-n8jFMTChgtTe9Xg341wa9mHbFnm3vOetmOv1q_pCjYWJLkdriwaOdmGMEmUzMQGcm9ySovlcSeKMQ6MrEAmfNpxB5TDBkS1WiItTO3q64zb4_emT2pEOUiWCsi9nQw8gvsg7PDJgj0BOlwTSx3J5Y46t20eQxVM2SdzmB5mGLwI6qM2KNNM91JTLoGFiuywYJcaaw6I3a3j-xNvxHRIBpO0eNqEvMTx0xbmvokaJIjzkzkTEbs-Rj5Px7rz46_92_m99l2GgaPjNNsl20t2xU8YJfd52XdtQ9DFvgOfE9ajw priority: 102 providerName: Springer Nature |
| Title | Predicting antifreeze proteins with weighted generalized dipeptide composition and multi-regression feature selection ensemble |
| URI | https://link.springer.com/article/10.1186/s12859-021-04251-z https://www.ncbi.nlm.nih.gov/pubmed/34162327 https://www.proquest.com/docview/2552805065 https://www.proquest.com/docview/2544878428 https://pubmed.ncbi.nlm.nih.gov/PMC8220696 https://doaj.org/article/379e1343326d4b76b5a785360aee3cf2 |
| Volume | 22 |
| WOSCitedRecordID | wos000665002000001&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| journalDatabaseRights | – providerCode: PRVADU databaseName: BioMedCentral customDbUrl: eissn: 1471-2105 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0017805 issn: 1471-2105 databaseCode: RBZ dateStart: 20000101 isFulltext: true titleUrlDefault: https://www.biomedcentral.com/search/ providerName: BioMedCentral – providerCode: PRVAON databaseName: DOAJ Directory of Open Access Journals customDbUrl: eissn: 1471-2105 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0017805 issn: 1471-2105 databaseCode: DOA dateStart: 20000101 isFulltext: true titleUrlDefault: https://www.doaj.org/ providerName: Directory of Open Access Journals – providerCode: PRVHPJ databaseName: ROAD: Directory of Open Access Scholarly Resources customDbUrl: eissn: 1471-2105 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0017805 issn: 1471-2105 databaseCode: M~E dateStart: 20000101 isFulltext: true titleUrlDefault: https://road.issn.org providerName: ISSN International Centre – providerCode: PRVPQU databaseName: Advanced Technologies & Aerospace Database customDbUrl: eissn: 1471-2105 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0017805 issn: 1471-2105 databaseCode: P5Z dateStart: 20090101 isFulltext: true titleUrlDefault: https://search.proquest.com/hightechjournals providerName: ProQuest – providerCode: PRVPQU databaseName: Biological Science Database (ProQuest) customDbUrl: eissn: 1471-2105 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0017805 issn: 1471-2105 databaseCode: M7P dateStart: 20090101 isFulltext: true titleUrlDefault: http://search.proquest.com/biologicalscijournals providerName: ProQuest – providerCode: PRVPQU databaseName: Computer Science Database customDbUrl: eissn: 1471-2105 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0017805 issn: 1471-2105 databaseCode: K7- dateStart: 20090101 isFulltext: true titleUrlDefault: http://search.proquest.com/compscijour providerName: ProQuest – providerCode: PRVPQU databaseName: Health & Medical Collection customDbUrl: eissn: 1471-2105 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0017805 issn: 1471-2105 databaseCode: 7X7 dateStart: 20090101 isFulltext: true titleUrlDefault: https://search.proquest.com/healthcomplete providerName: ProQuest – providerCode: PRVPQU databaseName: ProQuest Central customDbUrl: eissn: 1471-2105 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0017805 issn: 1471-2105 databaseCode: BENPR dateStart: 20090101 isFulltext: true titleUrlDefault: https://www.proquest.com/central providerName: ProQuest – providerCode: PRVPQU databaseName: Publicly Available Content Database customDbUrl: eissn: 1471-2105 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0017805 issn: 1471-2105 databaseCode: PIMPY dateStart: 20090101 isFulltext: true titleUrlDefault: http://search.proquest.com/publiccontent providerName: ProQuest – providerCode: PRVAVX databaseName: SpringerLINK Contemporary 1997-Present customDbUrl: eissn: 1471-2105 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0017805 issn: 1471-2105 databaseCode: RSV dateStart: 20001201 isFulltext: true titleUrlDefault: https://link.springer.com/search?facet-content-type=%22Journal%22 providerName: Springer Nature |
| link | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV1Lb9QwELaggMQF8SZQVgEhcYCocR62c2xRKyrEKtoCWrhYjjNZIpVstdkFsQd-OzNOdmmKgAsXK4knUTIe2zPx528Ye6ZKk3GLkWoqsUgEjYNW8CA1EW1OM0Wadskm5HisptMsP5fqizBhHT1wp7i9WGbAY2LZEmVSSFGkRuIUI0IDENvKjb6hzDbBVL9-QEz9my0ySuy1nHjaAoIjkJHyYD2Yhhxb_-9j8rlJ6SJg8sKqqZuMjm6yG70X6e93b3-LXYLmNrvW5ZX8fof9yBe0_kKIZt8QGmgBsAbfcTLUTevTz1f_m_spCqU_65in6zUel_UZwVxK8Alr3gO68Bml75CHwQJmHXC28StwnKB-6zLp0BWMiOFLcQp32fujw3evXgd9ooXAppIvA5PgRB0mlidVwo1EDwndqhgHvtiEUcEhBKjSMrESEhuhx6GKEspKRdgWXFkbx_fYTjNv4AHzQZmKx7ySYQWJQOdFhXFUZlWVQGqVlB7jG71r27OQUzKMU-2iESV011Ya20q7ttJrj73Y3nPWcXD8VfqAmnMrSfzZ7gJale6tSv_Lqjz2lIxBE0NGQxCcmVm1rT4-meh9IdFlRc8r9djzXqia4zdY0-9oQE0QqdZAcncgiV3YDqs3Nqf7IaTVGOtFaMXoInrsybaa7iRYXAPzFclgdC0VhpAeu9-Z6Pa70T1B1zZCjcuB8Q4UM6xp6s-OYBydxlBkwmMvN2b-67X-rPiH_0Pxj9j1yHVTEUTxLttZLlbwmF21X5d1uxixy3IqXalG7MrB4TifjFzHx_KNDEaE3M2xzNNPWJ8fv80_4tnk5MNPp-9d7g |
| linkProvider | Directory of Open Access Journals |
| linkToHtml | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMw1V1Lb9NAEF5VLQguvB-GAgaBOIBV7_qxmwNC5VE1ahtFUKRyWuz1OEQqTrATqubAT-I3MrO2U1xEbz1wiSJ7HHk389z99hvGnqos6XGDlWok8SOMyQ-amHtRIuhwWpJGUd1sQg4G6uCgN1xhv9qzMASrbH2iddTZxNAa-QamvkL5EUbM19PvHnWNot3VtoVGrRY7cHyEJVv1qv8O_99nQmy933-77TVdBTwTST7zkhCjkh8aHuYhTySmA5hDBGjlQeKLlIMPkEdZaCSERmB4VWkGWa5EGiVcGUMLoOjy10JUdn-VrQ37e8PPy30L6hDQHs1R8UbFiR_OIxgEGQf3Fp3wZ7sE_B0L_giGp4Gap3ZrbRDcuvq_Td81dqVJt93N2j6usxUobrCLdQPO45vs57CkjSqCfrsJwaZKgAW4lrxiXFQurVK7R3b1GDJ3VFN0jxf4PRtPCQ-UgUug_Ab5hr-RuRai6ZUwqhHGhZuDJU91K9tyiK5AUcG39BBusU_nMvjbbLWYFHCXuaCSnAc8l34OYYxZnvIDkfXyPITIKCkdxltF0aaha6euIYfalm0q1rVyaVQubZVLLxz2YvnMtCYrOVP6DenfUpKIxu2FSTnSjd_SgewBD4jkLs7CVMY4EokZXuwnAIHJhcOekPZqohIpCKs0SuZVpfsfP-jNWGJujylq5LDnjVA-wTGYpDn6gTNB7GMdyfWOJPo6073dardufG2lT1TbYY-Xt-lJwg8WMJmTTIiVucJa22F3aptajhvzOKwBBM647FhbZ2K6d4rxV8vEjtm1H_dih71s7fLktf498ffOHsUjdml7f29X7_YHO_fZZWG9R-yJYJ2tzso5PGAXzI_ZuCofNt7HZV_O22J_AymEwmc |
| linkToPdf | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwpV1Lj9MwELZgeYgLb5bAAgEhcYBo48Sx0-PyqFiBqmoX0N4sxxmXSrtplbQgeuC3M-OkZbM8JMSlquJJFU_H42_iz58Ze5qXZsAtVqqZwg8hKQ9ayaPMJLQ5zRRZ1h42oUaj_OhoMD61i9-z3ddLku2eBlJpqha789K1QzyXuw0n3bWI6AUUdDxanWcXBFYyROo6OPy0WUcgxf71Vpnf3tebjrxq_6-5-dTkdJY4eWb11E9Kw2v_353r7GoHSMO9NoJusHNQ3WSX2iMqv91i38c1LeUQOTo0RCyqAVYQenmHadWE9B43_Orfr0IZTloR6-kKv5fTOTFmSgiJtt5xw_A3ytCTGKMaJi0HtwodeHnRsPGH8tAVLK7hpDiG2-zj8M2HV2-j7syGyGaKLyIjcM6PheXCCW4Ugi1EaCnm0NTEScEhBnBZKawCYRMEL3lRQunypMgMz61N0ztsq5pVcJeFkBvHU-5U7EBIxEF5nCblwDkBmc2VChhf_3XadoLmdK7GsfaFTS5161eNftXer3oVsOebe-atnMdfrV9SRGwsSYrbX5jVE92NbJ2qAfCUZOBkKQolsScKMZCMDUBqXRKwJxRPmsQ2KmLzTMyyafT-4YHekwrRL4K4LGDPOiM3wz5Y022OQE-QPlfPcqdnidnA9pvXYau7bNRoLBsTHAiINgP2eNNMdxLDroLZkmywUFc5VqMB226jfNNvRDqIkhP0uOrFf88x_ZZq-tlrlSP-jOVABuzFehT8fKw_O_7ev5k_YpfHr4f6_f7o3X12JfHjSEZJusO2FvUSHrCL9sti2tQPfXL4Aa09Zlc |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Predicting+antifreeze+proteins+with+weighted+generalized+dipeptide+composition+and+multi-regression+feature+selection+ensemble&rft.jtitle=BMC+bioinformatics&rft.au=Wang%2C+Shunfang&rft.au=Deng%2C+Lin&rft.au=Xia%2C+Xinnan&rft.au=Cao%2C+Zicheng&rft.date=2021-06-23&rft.pub=BioMed+Central&rft.eissn=1471-2105&rft.volume=22&rft.issue=Suppl+3&rft_id=info:doi/10.1186%2Fs12859-021-04251-z&rft_id=info%3Apmid%2F34162327&rft.externalDocID=PMC8220696 |
| thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1471-2105&client=summon |
| thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1471-2105&client=summon |
| thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1471-2105&client=summon |