MiPepid: MicroPeptide identification tool using machine learning
Background Micropeptides are small proteins with length < = 100 amino acids. Short open reading frames that could produces micropeptides were traditionally ignored due to technical difficulties, as few small peptides had been experimentally confirmed. In the past decade, a growing number of micro...
Uloženo v:
| Vydáno v: | BMC bioinformatics Ročník 20; číslo 1; s. 559 - 11 |
|---|---|
| Hlavní autoři: | , |
| Médium: | Journal Article |
| Jazyk: | angličtina |
| Vydáno: |
London
BioMed Central
08.11.2019
BioMed Central Ltd BMC |
| Témata: | |
| ISSN: | 1471-2105, 1471-2105 |
| On-line přístup: | Získat plný text |
| Tagy: |
Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
|
| Abstract | Background
Micropeptides are small proteins with length < = 100 amino acids. Short open reading frames that could produces micropeptides were traditionally ignored due to technical difficulties, as few small peptides had been experimentally confirmed. In the past decade, a growing number of micropeptides have been shown to play significant roles in vital biological activities. Despite the increased amount of data, we still lack bioinformatics tools for specifically identifying micropeptides from DNA sequences. Indeed, most existing tools for classifying coding and noncoding ORFs were built on datasets in which “normal-sized” proteins were considered to be positives and short ORFs were generally considered to be noncoding. Since the functional and biophysical constraints on small peptides are likely to be different from those on “normal” proteins, methods for predicting short translated ORFs must be trained independently from those for longer proteins.
Results
In this study, we have developed MiPepid, a machine-learning tool specifically for the identification of micropeptides. We trained MiPepid using carefully cleaned data from existing databases and used logistic regression with 4-mer features. With only the sequence information of an ORF, MiPepid is able to predict whether it encodes a micropeptide with 96% accuracy on a blind dataset of high-confidence micropeptides, and to correctly classify newly discovered micropeptides not included in either the training or the blind test data. Compared with state-of-the-art coding potential prediction methods, MiPepid performs exceptionally well, as other methods incorrectly classify most bona fide micropeptides as noncoding. MiPepid is alignment-free and runs sufficiently fast for genome-scale analyses. It is easy to use and is available at
https://github.com/MindAI/MiPepid
.
Conclusions
MiPepid was developed to specifically predict micropeptides, a category of proteins with increasing significance, from DNA sequences. It shows evident advantages over existing coding potential prediction methods on micropeptide identification. It is ready to use and runs fast. |
|---|---|
| AbstractList | Micropeptides are small proteins with length < = 100 amino acids. Short open reading frames that could produces micropeptides were traditionally ignored due to technical difficulties, as few small peptides had been experimentally confirmed. In the past decade, a growing number of micropeptides have been shown to play significant roles in vital biological activities. Despite the increased amount of data, we still lack bioinformatics tools for specifically identifying micropeptides from DNA sequences. Indeed, most existing tools for classifying coding and noncoding ORFs were built on datasets in which "normal-sized" proteins were considered to be positives and short ORFs were generally considered to be noncoding. Since the functional and biophysical constraints on small peptides are likely to be different from those on "normal" proteins, methods for predicting short translated ORFs must be trained independently from those for longer proteins. In this study, we have developed MiPepid, a machine-learning tool specifically for the identification of micropeptides. We trained MiPepid using carefully cleaned data from existing databases and used logistic regression with 4-mer features. With only the sequence information of an ORF, MiPepid is able to predict whether it encodes a micropeptide with 96% accuracy on a blind dataset of high-confidence micropeptides, and to correctly classify newly discovered micropeptides not included in either the training or the blind test data. Compared with state-of-the-art coding potential prediction methods, MiPepid performs exceptionally well, as other methods incorrectly classify most bona fide micropeptides as noncoding. MiPepid is alignment-free and runs sufficiently fast for genome-scale analyses. It is easy to use and is available at https://github.com/MindAI/MiPepid. MiPepid was developed to specifically predict micropeptides, a category of proteins with increasing significance, from DNA sequences. It shows evident advantages over existing coding potential prediction methods on micropeptide identification. It is ready to use and runs fast. Abstract Background Micropeptides are small proteins with length < = 100 amino acids. Short open reading frames that could produces micropeptides were traditionally ignored due to technical difficulties, as few small peptides had been experimentally confirmed. In the past decade, a growing number of micropeptides have been shown to play significant roles in vital biological activities. Despite the increased amount of data, we still lack bioinformatics tools for specifically identifying micropeptides from DNA sequences. Indeed, most existing tools for classifying coding and noncoding ORFs were built on datasets in which “normal-sized” proteins were considered to be positives and short ORFs were generally considered to be noncoding. Since the functional and biophysical constraints on small peptides are likely to be different from those on “normal” proteins, methods for predicting short translated ORFs must be trained independently from those for longer proteins. Results In this study, we have developed MiPepid, a machine-learning tool specifically for the identification of micropeptides. We trained MiPepid using carefully cleaned data from existing databases and used logistic regression with 4-mer features. With only the sequence information of an ORF, MiPepid is able to predict whether it encodes a micropeptide with 96% accuracy on a blind dataset of high-confidence micropeptides, and to correctly classify newly discovered micropeptides not included in either the training or the blind test data. Compared with state-of-the-art coding potential prediction methods, MiPepid performs exceptionally well, as other methods incorrectly classify most bona fide micropeptides as noncoding. MiPepid is alignment-free and runs sufficiently fast for genome-scale analyses. It is easy to use and is available at https://github.com/MindAI/MiPepid. Conclusions MiPepid was developed to specifically predict micropeptides, a category of proteins with increasing significance, from DNA sequences. It shows evident advantages over existing coding potential prediction methods on micropeptide identification. It is ready to use and runs fast. Micropeptides are small proteins with length < = 100 amino acids. Short open reading frames that could produces micropeptides were traditionally ignored due to technical difficulties, as few small peptides had been experimentally confirmed. In the past decade, a growing number of micropeptides have been shown to play significant roles in vital biological activities. Despite the increased amount of data, we still lack bioinformatics tools for specifically identifying micropeptides from DNA sequences. Indeed, most existing tools for classifying coding and noncoding ORFs were built on datasets in which "normal-sized" proteins were considered to be positives and short ORFs were generally considered to be noncoding. Since the functional and biophysical constraints on small peptides are likely to be different from those on "normal" proteins, methods for predicting short translated ORFs must be trained independently from those for longer proteins. In this study, we have developed MiPepid, a machine-learning tool specifically for the identification of micropeptides. We trained MiPepid using carefully cleaned data from existing databases and used logistic regression with 4-mer features. With only the sequence information of an ORF, MiPepid is able to predict whether it encodes a micropeptide with 96% accuracy on a blind dataset of high-confidence micropeptides, and to correctly classify newly discovered micropeptides not included in either the training or the blind test data. Compared with state-of-the-art coding potential prediction methods, MiPepid performs exceptionally well, as other methods incorrectly classify most bona fide micropeptides as noncoding. MiPepid is alignment-free and runs sufficiently fast for genome-scale analyses. It is easy to use and is available at https://github.com/MindAI/MiPepid. MiPepid was developed to specifically predict micropeptides, a category of proteins with increasing significance, from DNA sequences. It shows evident advantages over existing coding potential prediction methods on micropeptide identification. It is ready to use and runs fast. Background Micropeptides are small proteins with length < = 100 amino acids. Short open reading frames that could produces micropeptides were traditionally ignored due to technical difficulties, as few small peptides had been experimentally confirmed. In the past decade, a growing number of micropeptides have been shown to play significant roles in vital biological activities. Despite the increased amount of data, we still lack bioinformatics tools for specifically identifying micropeptides from DNA sequences. Indeed, most existing tools for classifying coding and noncoding ORFs were built on datasets in which “normal-sized” proteins were considered to be positives and short ORFs were generally considered to be noncoding. Since the functional and biophysical constraints on small peptides are likely to be different from those on “normal” proteins, methods for predicting short translated ORFs must be trained independently from those for longer proteins. Results In this study, we have developed MiPepid, a machine-learning tool specifically for the identification of micropeptides. We trained MiPepid using carefully cleaned data from existing databases and used logistic regression with 4-mer features. With only the sequence information of an ORF, MiPepid is able to predict whether it encodes a micropeptide with 96% accuracy on a blind dataset of high-confidence micropeptides, and to correctly classify newly discovered micropeptides not included in either the training or the blind test data. Compared with state-of-the-art coding potential prediction methods, MiPepid performs exceptionally well, as other methods incorrectly classify most bona fide micropeptides as noncoding. MiPepid is alignment-free and runs sufficiently fast for genome-scale analyses. It is easy to use and is available at https://github.com/MindAI/MiPepid . Conclusions MiPepid was developed to specifically predict micropeptides, a category of proteins with increasing significance, from DNA sequences. It shows evident advantages over existing coding potential prediction methods on micropeptide identification. It is ready to use and runs fast. Micropeptides are small proteins with length < = 100 amino acids. Short open reading frames that could produces micropeptides were traditionally ignored due to technical difficulties, as few small peptides had been experimentally confirmed. In the past decade, a growing number of micropeptides have been shown to play significant roles in vital biological activities. Despite the increased amount of data, we still lack bioinformatics tools for specifically identifying micropeptides from DNA sequences. Indeed, most existing tools for classifying coding and noncoding ORFs were built on datasets in which "normal-sized" proteins were considered to be positives and short ORFs were generally considered to be noncoding. Since the functional and biophysical constraints on small peptides are likely to be different from those on "normal" proteins, methods for predicting short translated ORFs must be trained independently from those for longer proteins.BACKGROUNDMicropeptides are small proteins with length < = 100 amino acids. Short open reading frames that could produces micropeptides were traditionally ignored due to technical difficulties, as few small peptides had been experimentally confirmed. In the past decade, a growing number of micropeptides have been shown to play significant roles in vital biological activities. Despite the increased amount of data, we still lack bioinformatics tools for specifically identifying micropeptides from DNA sequences. Indeed, most existing tools for classifying coding and noncoding ORFs were built on datasets in which "normal-sized" proteins were considered to be positives and short ORFs were generally considered to be noncoding. Since the functional and biophysical constraints on small peptides are likely to be different from those on "normal" proteins, methods for predicting short translated ORFs must be trained independently from those for longer proteins.In this study, we have developed MiPepid, a machine-learning tool specifically for the identification of micropeptides. We trained MiPepid using carefully cleaned data from existing databases and used logistic regression with 4-mer features. With only the sequence information of an ORF, MiPepid is able to predict whether it encodes a micropeptide with 96% accuracy on a blind dataset of high-confidence micropeptides, and to correctly classify newly discovered micropeptides not included in either the training or the blind test data. Compared with state-of-the-art coding potential prediction methods, MiPepid performs exceptionally well, as other methods incorrectly classify most bona fide micropeptides as noncoding. MiPepid is alignment-free and runs sufficiently fast for genome-scale analyses. It is easy to use and is available at https://github.com/MindAI/MiPepid.RESULTSIn this study, we have developed MiPepid, a machine-learning tool specifically for the identification of micropeptides. We trained MiPepid using carefully cleaned data from existing databases and used logistic regression with 4-mer features. With only the sequence information of an ORF, MiPepid is able to predict whether it encodes a micropeptide with 96% accuracy on a blind dataset of high-confidence micropeptides, and to correctly classify newly discovered micropeptides not included in either the training or the blind test data. Compared with state-of-the-art coding potential prediction methods, MiPepid performs exceptionally well, as other methods incorrectly classify most bona fide micropeptides as noncoding. MiPepid is alignment-free and runs sufficiently fast for genome-scale analyses. It is easy to use and is available at https://github.com/MindAI/MiPepid.MiPepid was developed to specifically predict micropeptides, a category of proteins with increasing significance, from DNA sequences. It shows evident advantages over existing coding potential prediction methods on micropeptide identification. It is ready to use and runs fast.CONCLUSIONSMiPepid was developed to specifically predict micropeptides, a category of proteins with increasing significance, from DNA sequences. It shows evident advantages over existing coding potential prediction methods on micropeptide identification. It is ready to use and runs fast. Background Micropeptides are small proteins with length < = 100 amino acids. Short open reading frames that could produces micropeptides were traditionally ignored due to technical difficulties, as few small peptides had been experimentally confirmed. In the past decade, a growing number of micropeptides have been shown to play significant roles in vital biological activities. Despite the increased amount of data, we still lack bioinformatics tools for specifically identifying micropeptides from DNA sequences. Indeed, most existing tools for classifying coding and noncoding ORFs were built on datasets in which "normal-sized" proteins were considered to be positives and short ORFs were generally considered to be noncoding. Since the functional and biophysical constraints on small peptides are likely to be different from those on "normal" proteins, methods for predicting short translated ORFs must be trained independently from those for longer proteins. Results In this study, we have developed MiPepid, a machine-learning tool specifically for the identification of micropeptides. We trained MiPepid using carefully cleaned data from existing databases and used logistic regression with 4-mer features. With only the sequence information of an ORF, MiPepid is able to predict whether it encodes a micropeptide with 96% accuracy on a blind dataset of high-confidence micropeptides, and to correctly classify newly discovered micropeptides not included in either the training or the blind test data. Compared with state-of-the-art coding potential prediction methods, MiPepid performs exceptionally well, as other methods incorrectly classify most bona fide micropeptides as noncoding. MiPepid is alignment-free and runs sufficiently fast for genome-scale analyses. It is easy to use and is available at Conclusions MiPepid was developed to specifically predict micropeptides, a category of proteins with increasing significance, from DNA sequences. It shows evident advantages over existing coding potential prediction methods on micropeptide identification. It is ready to use and runs fast. Keywords: Micropeptide, Small ORF, sORF, smORF, Coding, Noncoding, lncRNA, Machine learning |
| ArticleNumber | 559 |
| Audience | Academic |
| Author | Gribskov, Michael Zhu, Mengmeng |
| Author_xml | – sequence: 1 givenname: Mengmeng surname: Zhu fullname: Zhu, Mengmeng organization: Department of Statistics, Purdue University, Department of Biological Sciences, Purdue University – sequence: 2 givenname: Michael orcidid: 0000-0002-1718-0242 surname: Gribskov fullname: Gribskov, Michael email: mgribsko@purdue.edu organization: Department of Biological Sciences, Purdue University |
| BackLink | https://www.ncbi.nlm.nih.gov/pubmed/31703551$$D View this record in MEDLINE/PubMed |
| BookMark | eNp9kktv1DAUhS1URNuBH8AGRWIDixQ7fiRmgagqHiO1AvFYW44fqauMPdgJgn_PnaZUHYSqyIp1_Z3je-VzjA5iig6hpwSfENKJV4U0HZc1JrKmmNJaPkBHhLWkbgjmB3f2h-i4lCuMSdth_ggdUtJiyjk5Qm8vwme3DfZ1dRFMTrCfgnUVrDgFH4yeQorVlNJYzSXEodpocxmiq0anc4TCY_TQ67G4Jzf_Ffr-_t23s4_1-acP67PT89qIlk-15NZ4w7yXVlLhbMOshx467iXuvaDSG9I7hh1rGyMFET2ThLmOOKxbzR1dofXia5O-UtscNjr_VkkHdV1IeVA6T8GMTvVGEGNt27e2B4tedx3tDWkEYdxZKcDrzeK1nfuNswZmzXrcM90_ieFSDemnEh1rCKNg8OLGIKcfsyuT2oRi3Djq6NJcVEMJpYILeJUVer6gg4bWQvQJHM0OV6cCt5QLKRhQJ_-h4LNuEwy8ug9Q3xO83BMAM7lf06DnUtT665d99tndcW_n_JsCAMgCQAJKyc7fIgSrXdLUkjQFSVO7pCkJmvYfjQnTdVqg8zDeq2wWZYFb4uCyukpzjhCee0R_ALBj5cc |
| CitedBy_id | crossref_primary_10_1016_j_ymeth_2022_12_003 crossref_primary_10_1186_s12935_024_03446_7 crossref_primary_10_1007_s12539_021_00464_1 crossref_primary_10_1016_j_gene_2024_148817 crossref_primary_10_1093_bfgp_elae018 crossref_primary_10_1093_nar_gkab816 crossref_primary_10_1093_bib_bbad352 crossref_primary_10_1093_jxb_eraf240 crossref_primary_10_3390_cells13191645 crossref_primary_10_48130_GR_2023_0026 crossref_primary_10_1093_bioinformatics_btaf250 crossref_primary_10_3390_cancers17172932 crossref_primary_10_1111_febs_15769 crossref_primary_10_1371_journal_pone_0320314 crossref_primary_10_1007_s12672_025_01945_1 crossref_primary_10_1094_MPMI_34_2 crossref_primary_10_32604_cmc_2022_023848 crossref_primary_10_1109_ACCESS_2024_3419766 crossref_primary_10_1111_imm_13850 crossref_primary_10_1007_s42994_023_00100_0 crossref_primary_10_3389_fmicb_2024_1304044 crossref_primary_10_1007_s12539_024_00661_8 crossref_primary_10_1016_j_chom_2020_11_002 crossref_primary_10_1016_j_ijbiomac_2023_124952 crossref_primary_10_1186_s13045_024_01591_0 crossref_primary_10_1016_j_tig_2024_09_004 crossref_primary_10_2174_1574893615999200811130522 crossref_primary_10_1111_mpp_70084 crossref_primary_10_3390_biology10050371 crossref_primary_10_1007_s12539_023_00552_4 crossref_primary_10_1016_j_xplc_2025_101437 crossref_primary_10_1080_07391102_2023_2235605 crossref_primary_10_1094_MPMI_06_20_0160_R crossref_primary_10_1016_j_chemolab_2022_104490 crossref_primary_10_1161_ATVBAHA_120_314222 crossref_primary_10_1093_bib_bbae147 crossref_primary_10_1093_bib_bbae345 crossref_primary_10_3390_proteomes12030026 crossref_primary_10_52586_4943 crossref_primary_10_3390_ijms241310562 crossref_primary_10_3389_fgene_2024_1439423 crossref_primary_10_3390_pharmaceutics16111486 crossref_primary_10_1038_s43586_023_00205_2 crossref_primary_10_3389_fpls_2022_975938 crossref_primary_10_1007_s13205_022_03147_w crossref_primary_10_1016_j_compbiomed_2023_106773 crossref_primary_10_1186_s12929_022_00802_5 crossref_primary_10_3390_ijms26083651 crossref_primary_10_1002_cbic_202100534 crossref_primary_10_1016_j_pharmr_2025_100065 crossref_primary_10_1016_j_virusres_2020_198104 crossref_primary_10_1093_bib_bbac392 crossref_primary_10_1016_j_canlet_2022_215723 crossref_primary_10_1016_j_isci_2023_106781 crossref_primary_10_3390_a15080274 crossref_primary_10_1016_j_canlet_2020_10_038 crossref_primary_10_3390_plants12233951 crossref_primary_10_1109_TCBB_2021_3104288 crossref_primary_10_3390_cancers16152660 crossref_primary_10_1007_s00018_020_03740_3 crossref_primary_10_1186_s12943_025_02278_x crossref_primary_10_1093_bib_bbac108 crossref_primary_10_1016_j_yexcr_2020_111923 crossref_primary_10_1093_hr_uhac043 crossref_primary_10_1016_j_compbiomed_2022_106440 |
| Cites_doi | 10.7554/eLife.03523 10.3389/fphys.2017.00230 10.1093/bioinformatics/btp688 10.3389/fmicb.2013.00269 10.1084/jem.183.3.1131 10.1093/nar/gkx428 10.1186/1471-2164-14-648 10.3389/fgene.2018.00144 10.1093/nar/gkp335 10.1146/annurev-cellbio-100616-060516 10.1186/s13059-015-0742-x 10.1126/science.1085650 10.1126/scisignal.aaj1460 10.1038/nrg3645 10.1126/science.aad4076 10.1016/j.cell.2016.02.066 10.1093/bib/bbx005 10.1126/science.1238802 10.1186/1471-2105-15-36 10.1002/embj.201488411 10.1093/nar/gkm391 10.1093/nar/gkv1175 10.1186/1471-2105-6-262 10.1002/embj.201488303 10.1021/acs.jproteome.7b00707 10.1093/nar/gkt1059 10.1016/j.cell.2015.01.009 10.1016/j.celrep.2018.05.058 10.1186/1471-2105-9-192 10.1093/nar/gkt006 10.1093/nar/gkt646 10.1016/j.tcb.2017.04.006 10.1016/j.cell.2013.06.009 10.1093/bioinformatics/btr209 10.1126/science.aam9361 10.1016/j.molcel.2017.09.015 10.1093/database/bas008 10.1101/gr.080531.108 10.7554/eLife.13328 10.1038/nrm.2017.58 10.1126/science.1168978 10.7554/eLife.08890 10.1093/nar/gkx1130 10.1038/nrg.2016.119 10.1016/j.cmet.2015.02.009 10.1093/nar/gkx1098 10.1093/nar/gku989 10.1038/nature21034 |
| ContentType | Journal Article |
| Copyright | The Author(s). 2019 COPYRIGHT 2019 BioMed Central Ltd. |
| Copyright_xml | – notice: The Author(s). 2019 – notice: COPYRIGHT 2019 BioMed Central Ltd. |
| DBID | C6C AAYXX CITATION CGR CUY CVF ECM EIF NPM ISR 7X8 5PM DOA |
| DOI | 10.1186/s12859-019-3033-9 |
| DatabaseName | Springer Nature OA Free Journals CrossRef Medline MEDLINE MEDLINE (Ovid) MEDLINE MEDLINE PubMed Gale In Context: Science MEDLINE - Academic PubMed Central (Full Participant titles) DOAJ Directory of Open Access Journals |
| DatabaseTitle | CrossRef MEDLINE Medline Complete MEDLINE with Full Text PubMed MEDLINE (Ovid) MEDLINE - Academic |
| DatabaseTitleList | MEDLINE MEDLINE - Academic |
| Database_xml | – sequence: 1 dbid: DOA name: DOAJ Directory of Open Access Journals url: https://www.doaj.org/ sourceTypes: Open Website – sequence: 2 dbid: NPM name: PubMed url: http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed sourceTypes: Index Database – sequence: 3 dbid: 7X8 name: MEDLINE - Academic url: https://search.proquest.com/medline sourceTypes: Aggregation Database |
| DeliveryMethod | fulltext_linktorsrc |
| Discipline | Biology |
| EISSN | 1471-2105 |
| EndPage | 11 |
| ExternalDocumentID | oai_doaj_org_article_bc61cdd7b7db4e8ba883bc126145ed96 PMC6842143 A607356964 31703551 10_1186_s12859_019_3033_9 |
| Genre | Journal Article |
| GroupedDBID | --- 0R~ 23N 2WC 53G 5VS 6J9 7X7 88E 8AO 8FE 8FG 8FH 8FI 8FJ AAFWJ AAJSJ AAKPC AASML ABDBF ABUWG ACGFO ACGFS ACIHN ACIWK ACPRK ACUHS ADBBV ADMLS ADUKV AEAQA AENEX AEUYN AFKRA AFPKN AFRAH AHBYD AHMBA AHYZX ALMA_UNASSIGNED_HOLDINGS AMKLP AMTXH AOIJS ARAPS AZQEC BAPOH BAWUL BBNVY BCNDV BENPR BFQNJ BGLVJ BHPHI BMC BPHCQ BVXVI C6C CCPQU CS3 DIK DU5 DWQXO E3Z EAD EAP EAS EBD EBLON EBS EMB EMK EMOBN ESX F5P FYUFA GNUQQ GROUPED_DOAJ GX1 HCIFZ HMCUK HYE IAO ICD IHR INH INR ISR ITC K6V K7- KQ8 LK8 M1P M48 M7P MK~ ML0 M~E O5R O5S OK1 OVT P2P P62 PGMZT PHGZM PHGZT PIMPY PJZUB PPXIY PQGLB PQQKQ PROAC PSQYO PUEGO RBZ RNS ROL RPM RSV SBL SOJ SV3 TR2 TUS UKHRP W2D WOQ WOW XH6 XSB AAYXX AFFHD CITATION ALIPV CGR CUY CVF ECM EIF NPM 7X8 5PM |
| ID | FETCH-LOGICAL-c675t-95dcfc4ff9d936ed24df70385f90bf639fc1be40e472c9616b4914e81e0a7a5e3 |
| IEDL.DBID | RSV |
| ISICitedReferencesCount | 70 |
| ISICitedReferencesURI | http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000496277900007&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| ISSN | 1471-2105 |
| IngestDate | Fri Oct 03 12:52:57 EDT 2025 Tue Nov 04 01:44:56 EST 2025 Thu Sep 04 19:05:56 EDT 2025 Tue Nov 11 10:23:31 EST 2025 Tue Nov 04 17:55:52 EST 2025 Thu Nov 13 15:23:13 EST 2025 Thu Apr 03 07:02:33 EDT 2025 Sat Nov 29 05:40:04 EST 2025 Tue Nov 18 21:11:07 EST 2025 Sat Sep 06 07:27:26 EDT 2025 |
| IsDoiOpenAccess | true |
| IsOpenAccess | true |
| IsPeerReviewed | true |
| IsScholarly | true |
| Issue | 1 |
| Keywords | Small ORF Noncoding lncRNA sORF Coding Machine learning smORF Micropeptide |
| Language | English |
| License | Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
| LinkModel | DirectLink |
| MergedId | FETCHMERGED-LOGICAL-c675t-95dcfc4ff9d936ed24df70385f90bf639fc1be40e472c9616b4914e81e0a7a5e3 |
| Notes | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23 |
| ORCID | 0000-0002-1718-0242 |
| OpenAccessLink | https://link.springer.com/10.1186/s12859-019-3033-9 |
| PMID | 31703551 |
| PQID | 2313365630 |
| PQPubID | 23479 |
| PageCount | 11 |
| ParticipantIDs | doaj_primary_oai_doaj_org_article_bc61cdd7b7db4e8ba883bc126145ed96 pubmedcentral_primary_oai_pubmedcentral_nih_gov_6842143 proquest_miscellaneous_2313365630 gale_infotracmisc_A607356964 gale_infotracacademiconefile_A607356964 gale_incontextgauss_ISR_A607356964 pubmed_primary_31703551 crossref_primary_10_1186_s12859_019_3033_9 crossref_citationtrail_10_1186_s12859_019_3033_9 springer_journals_10_1186_s12859_019_3033_9 |
| PublicationCentury | 2000 |
| PublicationDate | 2019-11-08 |
| PublicationDateYYYYMMDD | 2019-11-08 |
| PublicationDate_xml | – month: 11 year: 2019 text: 2019-11-08 day: 08 |
| PublicationDecade | 2010 |
| PublicationPlace | London |
| PublicationPlace_xml | – name: London – name: England |
| PublicationTitle | BMC bioinformatics |
| PublicationTitleAbbrev | BMC Bioinformatics |
| PublicationTitleAlternate | BMC Bioinformatics |
| PublicationYear | 2019 |
| Publisher | BioMed Central BioMed Central Ltd BMC |
| Publisher_xml | – name: BioMed Central – name: BioMed Central Ltd – name: BMC |
| References | 3033_CR18 L Sun (3033_CR27) 2013; 41 CA Makarewich (3033_CR1) 2017; 27 EG Magny (3033_CR8) 2013; 341 AA Bazzini (3033_CR23) 2014; 33 DR Zerbino (3033_CR34) 2018; 46 J-P Couso (3033_CR3) 2017; 18 C Lee (3033_CR9) 2015; 21 BR Nelson (3033_CR43) 2016; 351 MF Lin (3033_CR28) 2011; 27 DM Anderson (3033_CR6) 2015; 160 A Chugunova (3033_CR2) 2018; 17 Z Ji (3033_CR36) 2015; 4 TL Bailey (3033_CR40) 2009; 37 P Bi (3033_CR44) 2017; 356 H Zhang (3033_CR38) 2013; 4 Y-J Kang (3033_CR25) 2017; 45 J Ruiz-Orera (3033_CR35) 2014; 3 J Crappé (3033_CR22) 2013; 14 NT Ingolia (3033_CR15) 2009; 324 BY Chan (3033_CR41) 2005; 6 CA Makarewich (3033_CR42) 2018; 23 B Cai (3033_CR13) 2017; 8 L Wang (3033_CR26) 2013; 41 S Plaza (3033_CR47) 2017; 33 NT Ingolia (3033_CR17) 2016; 165 L Kong (3033_CR24) 2007; 35 RF Wang (3033_CR11) 1996; 183 K Hanada (3033_CR20) 2010; 26 SD Mackowiak (3033_CR21) 2015; 16 RA Harte (3033_CR32) 2012; 2012 UniProt Consortium (3033_CR30) 2015; 43 KD Pruitt (3033_CR33) 2009; 19 J-Z Huang (3033_CR46) 2017; 68 V Olexiouk (3033_CR4) 2018; 46 NT Ingolia (3033_CR14) 2014; 15 F Yeasmin (3033_CR12) 2018; 9 M Jiang (3033_CR39) 2008; 9 V Olexiouk (3033_CR5) 2016; 44 SM Cohen (3033_CR48) 2014; 33 Y Hao (3033_CR29) 2018; 19 M Guttman (3033_CR37) 2013; 154 JM Mudge (3033_CR16) 2016; 17 CM Farrell (3033_CR31) 2014; 42 A Matsumoto (3033_CR45) 2016; 541 SR Schwab (3033_CR10) 2003; 301 A Skarshewski (3033_CR19) 2014; 15 DM Anderson (3033_CR7) 2016; 9 |
| References_xml | – volume: 3 start-page: e03523 year: 2014 ident: 3033_CR35 publication-title: Elife doi: 10.7554/eLife.03523 – volume: 8 start-page: 230 year: 2017 ident: 3033_CR13 publication-title: Front Physiol doi: 10.3389/fphys.2017.00230 – volume: 26 start-page: 399 year: 2010 ident: 3033_CR20 publication-title: Bioinformatics doi: 10.1093/bioinformatics/btp688 – volume: 4 start-page: 269 year: 2013 ident: 3033_CR38 publication-title: Front Microbiol doi: 10.3389/fmicb.2013.00269 – volume: 183 start-page: 1131 LP year: 1996 ident: 3033_CR11 publication-title: J Exp Med doi: 10.1084/jem.183.3.1131 – volume: 45 start-page: W12 year: 2017 ident: 3033_CR25 publication-title: Nucleic Acids Res doi: 10.1093/nar/gkx428 – volume: 14 start-page: 648 year: 2013 ident: 3033_CR22 publication-title: BMC Genomics doi: 10.1186/1471-2164-14-648 – volume: 9 start-page: 144 year: 2018 ident: 3033_CR12 publication-title: Front Genet doi: 10.3389/fgene.2018.00144 – volume: 37 start-page: W202 issue: suppl_2 year: 2009 ident: 3033_CR40 publication-title: Nucleic Acids Res doi: 10.1093/nar/gkp335 – volume: 33 start-page: 391 year: 2017 ident: 3033_CR47 publication-title: Annu Rev Cell Dev Biol doi: 10.1146/annurev-cellbio-100616-060516 – volume: 16 start-page: 179 year: 2015 ident: 3033_CR21 publication-title: Genome Biol doi: 10.1186/s13059-015-0742-x – volume: 301 start-page: 1367 LP year: 2003 ident: 3033_CR10 publication-title: Science (80- ) doi: 10.1126/science.1085650 – volume: 9 start-page: ra119 LP year: 2016 ident: 3033_CR7 publication-title: Sci Signal doi: 10.1126/scisignal.aaj1460 – volume: 15 start-page: 205 year: 2014 ident: 3033_CR14 publication-title: Nat Rev Genet. doi: 10.1038/nrg3645 – volume: 351 start-page: 271 LP year: 2016 ident: 3033_CR43 publication-title: Science (80- ) doi: 10.1126/science.aad4076 – volume: 165 start-page: 22 year: 2016 ident: 3033_CR17 publication-title: Cell doi: 10.1016/j.cell.2016.02.066 – volume: 19 start-page: 636 year: 2018 ident: 3033_CR29 publication-title: Brief Bioinform doi: 10.1093/bib/bbx005 – volume: 341 start-page: 1116 LP year: 2013 ident: 3033_CR8 publication-title: Science (80- ) doi: 10.1126/science.1238802 – volume: 15 start-page: 36 year: 2014 ident: 3033_CR19 publication-title: BMC Bioinformatics doi: 10.1186/1471-2105-15-36 – volume: 33 start-page: 981 LP year: 2014 ident: 3033_CR23 publication-title: EMBO J doi: 10.1002/embj.201488411 – volume: 35 start-page: W345 issue: Web Server issu year: 2007 ident: 3033_CR24 publication-title: Nucleic Acids Res doi: 10.1093/nar/gkm391 – volume: 44 start-page: D324 year: 2016 ident: 3033_CR5 publication-title: Nucleic Acids Res doi: 10.1093/nar/gkv1175 – volume: 6 start-page: 262 year: 2005 ident: 3033_CR41 publication-title: BMC Bioinformatics doi: 10.1186/1471-2105-6-262 – volume: 33 start-page: 937 LP year: 2014 ident: 3033_CR48 publication-title: EMBO J doi: 10.1002/embj.201488303 – volume: 17 start-page: 1 year: 2018 ident: 3033_CR2 publication-title: J Proteome Res doi: 10.1021/acs.jproteome.7b00707 – volume: 42 start-page: D865 issue: Database issue year: 2014 ident: 3033_CR31 publication-title: Nucleic Acids Res doi: 10.1093/nar/gkt1059 – volume: 160 start-page: 595 year: 2015 ident: 3033_CR6 publication-title: Cell doi: 10.1016/j.cell.2015.01.009 – volume: 23 start-page: 3701 year: 2018 ident: 3033_CR42 publication-title: Cell Rep doi: 10.1016/j.celrep.2018.05.058 – volume: 9 start-page: 192 year: 2008 ident: 3033_CR39 publication-title: BMC Bioinformatics doi: 10.1186/1471-2105-9-192 – volume: 41 start-page: e74 year: 2013 ident: 3033_CR26 publication-title: Nucleic Acids Res doi: 10.1093/nar/gkt006 – volume: 41 start-page: e166 year: 2013 ident: 3033_CR27 publication-title: Nucleic Acids Res doi: 10.1093/nar/gkt646 – volume: 27 start-page: 685 year: 2017 ident: 3033_CR1 publication-title: Trends Cell Biol doi: 10.1016/j.tcb.2017.04.006 – volume: 154 start-page: 240 year: 2013 ident: 3033_CR37 publication-title: Cell doi: 10.1016/j.cell.2013.06.009 – volume: 27 start-page: i275 year: 2011 ident: 3033_CR28 publication-title: Bioinformatics doi: 10.1093/bioinformatics/btr209 – volume: 356 start-page: 323 LP year: 2017 ident: 3033_CR44 publication-title: Science (80- ) doi: 10.1126/science.aam9361 – volume: 68 start-page: 171 year: 2017 ident: 3033_CR46 publication-title: Mol Cell doi: 10.1016/j.molcel.2017.09.015 – volume: 2012 start-page: bas008 year: 2012 ident: 3033_CR32 publication-title: Database (Oxford) doi: 10.1093/database/bas008 – volume: 19 start-page: 1316 year: 2009 ident: 3033_CR33 publication-title: Genome Res doi: 10.1101/gr.080531.108 – ident: 3033_CR18 doi: 10.7554/eLife.13328 – volume: 18 start-page: 575 year: 2017 ident: 3033_CR3 publication-title: Nat Rev Mol Cell Biol doi: 10.1038/nrm.2017.58 – volume: 324 start-page: 218 LP year: 2009 ident: 3033_CR15 publication-title: Science (80- ) doi: 10.1126/science.1168978 – volume: 4 start-page: e08890 year: 2015 ident: 3033_CR36 publication-title: Elife doi: 10.7554/eLife.08890 – volume: 46 start-page: D497 year: 2018 ident: 3033_CR4 publication-title: Nucleic Acids Res doi: 10.1093/nar/gkx1130 – volume: 17 start-page: 758 year: 2016 ident: 3033_CR16 publication-title: Nat Rev Genet doi: 10.1038/nrg.2016.119 – volume: 21 start-page: 443 year: 2015 ident: 3033_CR9 publication-title: Cell Metab doi: 10.1016/j.cmet.2015.02.009 – volume: 46 start-page: D754 year: 2018 ident: 3033_CR34 publication-title: Nucleic Acids Res doi: 10.1093/nar/gkx1098 – volume: 43 start-page: D204 issue: Database issue year: 2015 ident: 3033_CR30 publication-title: Nucleic Acids Res doi: 10.1093/nar/gku989 – volume: 541 start-page: 228 year: 2016 ident: 3033_CR45 publication-title: Nature doi: 10.1038/nature21034 |
| SSID | ssj0017805 |
| Score | 2.5618274 |
| Snippet | Background
Micropeptides are small proteins with length < = 100 amino acids. Short open reading frames that could produces micropeptides were traditionally... Micropeptides are small proteins with length < = 100 amino acids. Short open reading frames that could produces micropeptides were traditionally ignored due to... Micropeptides are small proteins with length < = 100 amino acids. Short open reading frames that could produces micropeptides were traditionally ignored due to... Background Micropeptides are small proteins with length < = 100 amino acids. Short open reading frames that could produces micropeptides were traditionally... Abstract Background Micropeptides are small proteins with length < = 100 amino acids. Short open reading frames that could produces micropeptides were... |
| SourceID | doaj pubmedcentral proquest gale pubmed crossref springer |
| SourceType | Open Website Open Access Repository Aggregation Database Index Database Enrichment Source Publisher |
| StartPage | 559 |
| SubjectTerms | Algorithms Amino acids Bioinformatics Biomedical and Life Sciences Coding Computational biology Computational Biology - methods Computational Biology/Bioinformatics Computer Appl. in Life Sciences Databases, Protein DNA DNA sequencing Genomics Life Sciences Machine Learning Machine Learning and Artificial Intelligence in Bioinformatics Microarrays Micropeptide Noncoding Open Reading Frames - genetics Peptides Peptides - analysis Proteins Small ORF smORF Software sORF |
| SummonAdditionalLinks | – databaseName: DOAJ Directory of Open Access Journals dbid: DOA link: http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV1bi9UwEA6yKPgi3q2uUkUQlLJNL0njk6u4KLjLwQvsW2iSydnC2i7bcwT_vTNpzmG7or742kyh-TJJvmSm3zD23OYKZIgZCumzqqWLJg9NJmRrC5DKSwhVSz7Jo6Pm-FgtLpT6opywSR54Am7PWMGtc9JIZypoTNs0pbEciX9Vg1NBbBtZz-YwFeMHpNQfY5i8EXsjJ502PDZTnL8sMzXbhYJY_-9L8oU96XK-5KWgadiLDm6yG5FEpvvTx99iV6C_za5NZSV_3mFvDrsFnHXudXpI6XYLSlxxkHYuZgaFwUhXw3CaUtr7Mv0eMiohjSUklnfZt4P3X999yGKlhMwi4V9lqnbW28p75VQpwBWV85Jifl7lxiMJ8ZYbqHKoZGGV4MJUiiOaHPJWtjWU99hOP_TwgKUGd81CNt545StrXCtsbfCcDAXHlrxKWL5BTtsoI07VLE51OE40Qk9gawRbE9haJezl9pWzSUPjb8ZvaTi2hiR_HR6gU-joFPpfTpGwZzSYmgQuesqgWbbrcdQfv3zW-wIXtVoogT15EY38gD2wbfwhAXEgTayZ5e7MEmegnTU_3fiMpiZKW-thWI8ayXNZCpJgS9j9yYe2HUPiliPZ4wmTM--a9Xze0ncnQQCcYqfIcxP2auOHOq4845-Bffg_gH3Erhc0i8K9-i7bWZ2v4TG7an-suvH8SZiDvwBHwTWH priority: 102 providerName: Directory of Open Access Journals |
| Title | MiPepid: MicroPeptide identification tool using machine learning |
| URI | https://link.springer.com/article/10.1186/s12859-019-3033-9 https://www.ncbi.nlm.nih.gov/pubmed/31703551 https://www.proquest.com/docview/2313365630 https://pubmed.ncbi.nlm.nih.gov/PMC6842143 https://doaj.org/article/bc61cdd7b7db4e8ba883bc126145ed96 |
| Volume | 20 |
| WOSCitedRecordID | wos000496277900007&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| journalDatabaseRights | – providerCode: PRVADU databaseName: Open Access: BioMedCentral Open Access Titles customDbUrl: eissn: 1471-2105 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0017805 issn: 1471-2105 databaseCode: RBZ dateStart: 20000101 isFulltext: true titleUrlDefault: https://www.biomedcentral.com/search/ providerName: BioMedCentral – providerCode: PRVAON databaseName: DOAJ Directory of Open Access Journals customDbUrl: eissn: 1471-2105 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0017805 issn: 1471-2105 databaseCode: DOA dateStart: 20000101 isFulltext: true titleUrlDefault: https://www.doaj.org/ providerName: Directory of Open Access Journals – providerCode: PRVHPJ databaseName: ROAD: Directory of Open Access Scholarly Resources customDbUrl: eissn: 1471-2105 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0017805 issn: 1471-2105 databaseCode: M~E dateStart: 20000101 isFulltext: true titleUrlDefault: https://road.issn.org providerName: ISSN International Centre – providerCode: PRVPQU databaseName: Advanced Technologies & Aerospace Database customDbUrl: eissn: 1471-2105 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0017805 issn: 1471-2105 databaseCode: P5Z dateStart: 20090101 isFulltext: true titleUrlDefault: https://search.proquest.com/hightechjournals providerName: ProQuest – providerCode: PRVPQU databaseName: Biological Science Database customDbUrl: eissn: 1471-2105 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0017805 issn: 1471-2105 databaseCode: M7P dateStart: 20090101 isFulltext: true titleUrlDefault: http://search.proquest.com/biologicalscijournals providerName: ProQuest – providerCode: PRVPQU databaseName: Computer Science Database (ProQuest) customDbUrl: eissn: 1471-2105 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0017805 issn: 1471-2105 databaseCode: K7- dateStart: 20090101 isFulltext: true titleUrlDefault: http://search.proquest.com/compscijour providerName: ProQuest – providerCode: PRVPQU databaseName: Health & Medical Collection customDbUrl: eissn: 1471-2105 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0017805 issn: 1471-2105 databaseCode: 7X7 dateStart: 20090101 isFulltext: true titleUrlDefault: https://search.proquest.com/healthcomplete providerName: ProQuest – providerCode: PRVPQU databaseName: ProQuest Central customDbUrl: eissn: 1471-2105 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0017805 issn: 1471-2105 databaseCode: BENPR dateStart: 20090101 isFulltext: true titleUrlDefault: https://www.proquest.com/central providerName: ProQuest – providerCode: PRVPQU databaseName: ProQuest Publicly Available Content Database customDbUrl: eissn: 1471-2105 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0017805 issn: 1471-2105 databaseCode: PIMPY dateStart: 20090101 isFulltext: true titleUrlDefault: http://search.proquest.com/publiccontent providerName: ProQuest – providerCode: PRVAVX databaseName: SpringerLINK Contemporary 1997-Present customDbUrl: eissn: 1471-2105 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0017805 issn: 1471-2105 databaseCode: RSV dateStart: 20001201 isFulltext: true titleUrlDefault: https://link.springer.com/search?facet-content-type=%22Journal%22 providerName: Springer Nature |
| link | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1Lb9NAEF7RFqReeD8MJTIICQlk4eeulxMtakUFiawUUMpl5X2lkYpdxQkS_56ZzSbC5SHBZQ_e8WHGMzszO-NvCHmmYm6YqxlSZqO8xosma8qIslqlhnHLjJta8oGNRuVkwiv_H3e37nZflyTdSe3MuqSvugSx1iD1xVp9lkV8i-yAtyvRGscnnzelAwTp9-XL377Wc0AOp__X0_gnd3S5VfJSvdS5oaMb_8XATXLdR53h_kpNbpErprlNrq3mUH6_Q94MZ5W5mOnX4RD78yrsdNEmnGnfSuS-Xrho2_MQ--Sn4VfXgmlCP3Niepd8Ojr8-PZd5EcrRAoyhEXEC62syq3lmmfU6DTXlmGR0PJYWoharEqkyWOTs1RxmlCZ8yQ3ZWLimtWFye6R7aZtzAMSSnCzKSuttNzmSuqaqkJCYm3SBHbiPCDxWt5CedxxHH9xLlz-UVKxEowAwQgUjOABebF55WIFuvE34gP8iBtCxMt2D9r5VHjzE1LRRGnNJNMS2JB1WWZSJZA-5oXRnAbkKaqAQESMBltupvWy68TxyVjsUzgFC8opcPLcE9kWOFC1_4MB5IAgWj3KvR4lmKzqbT9Za5rALexza0y77ARE21lGEbMtIPdXmrdhDCK9GKLDJCCsp5M9zvs7zezMIYZjsRUC44C8XGum8EdV92fBPvwn6kdkN0XVdjfue2R7MV-ax-Sq-raYdfMB2WIT5tZyQHYODkfVeOCuP2B9z6IBttxWsFbFF9ivjofV6cCZ9Q8dOEDU |
| linkProvider | Springer Nature |
| linkToHtml | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1Zb9QwEB5BAcEL9xEoEBASEigih9eOeaIgqlbsrlZtqfpmxdeyUkmqzS4S_x6P46xIOSR4jScPMxl7ZjKfvwF4oVJumO8ZUmYTUuGPJmvKhLJK5YZxy4yfWjJm02l5csJn4R5326Pd-5akP6n9ti7pmzZDrjVX-mKvvigSfhEuERewEMd3cHi8aR0gSX9oX_72tUEA8jz9v57GP4Wj81DJc_1SH4Z2b_yXAjfhesg6453OTW7BBVPfhivdHMrvd-DdZDEzZwv9Np4gPm-GSBdt4oUOUCL_9eJV05zGiJOfx189BNPEYebE_C583v149GEvCaMVEuUqhFXCR1pZRazlmhfU6Jxoy7BJaHkqrctarMqkIakhLFecZlQSnhFTZiatWDUyxT3YqpvaPIBYujCbs9JKyy1RUldUjaQrrE2euZWURJD29hYq8I7j-ItT4euPkorOMMIZRqBhBI_g1eaVs45042_C7_EjbgSRL9s_aJZzEbafkIpmSmsmmZZODVmVZSFV5spHMjKa0wieowsIZMSoEXIzr9ZtK_YPD8QOdafgiHLqNHkZhGzjNFBVuMHg7IAkWgPJ7YGk27JqsPys9zSBS4hzq02zboXLtouCImdbBPc7z9so5jK91GWHWQRs4JMDzYcr9eKLZwzHZqtLjCN43XumCEdV-2fDPvwn6adwde9oMhbj_emnR3AtRzf3f9-3YWu1XJvHcFl9Wy3a5RO_XX8Aioo64w |
| linkToPdf | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwpV1Zb9QwEB5BgYoX7iNQICAkJFDUHF475olyrKhoVysKqG9WfG1XKslqk0Xi3-NxvCtSDgnxGo8fZjK2ZzyfvwF4qlJumK8ZUmYTUuFFkzVlQlmlcsO4ZcZ3LTlgk0l5fMynoc9pu0a7r0uS_ZsGZGmqu92Ftv0SL-lumyHvmkuDsW5fFAk_DxcI9gzCdP3oy6aMgIT9oZT522mDw8hz9v-6M_90NJ2FTZ6pnfojaXz1v5W5BldCNBrv9e5zHc6Z-gZc6vtTfr8Jrw7nU7OY65fxIeL2poiA0Sae6wAx8n817prmNEb8_Cz-6qGZJg69KGa34PP43ac375PQciFRLnPoEj7SyipiLde8oEbnRFuGxUPLU2ldNGNVJg1JDWG54jSjkvCMmDIzacWqkSluw1bd1OYuxNIdvzkrrbTcEiV1RdVIuoTb5JkbSUkE6dr2QgU-cmyLcSp8XlJS0RtGOMMINIzgETzfTFn0ZBx_E36NP3QjiDza_kOznImwLIVUNFNaM8m0dGrIqiwLqTKXVpKR0ZxG8ATdQSBTRo1QnFm1aluxf_RR7FG3O44op06TZ0HINk4DVYWXDc4OSK41kNwZSLqlrAbDj9deJ3AI8W-1aVatcFF4UVDkcovgTu-FG8VcBJi6qDGLgA38c6D5cKSen3gmcSzCuoA5ghdrLxVhC2v_bNh7_yT9CLanb8fiYH_y4T5cztHL_aX8Dmx1y5V5ABfVt27eLh_6lfsDFYRDxw |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=MiPepid%3A+MicroPeptide+identification+tool+using+machine+learning&rft.jtitle=BMC+bioinformatics&rft.au=Zhu%2C+Mengmeng&rft.au=Gribskov%2C+Michael&rft.date=2019-11-08&rft.pub=BioMed+Central+Ltd&rft.issn=1471-2105&rft.eissn=1471-2105&rft.volume=20&rft.issue=1&rft_id=info:doi/10.1186%2Fs12859-019-3033-9&rft.externalDocID=A607356964 |
| thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1471-2105&client=summon |
| thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1471-2105&client=summon |
| thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1471-2105&client=summon |