MiPepid: MicroPeptide identification tool using machine learning

Background Micropeptides are small proteins with length < = 100 amino acids. Short open reading frames that could produces micropeptides were traditionally ignored due to technical difficulties, as few small peptides had been experimentally confirmed. In the past decade, a growing number of micro...

Celý popis

Uložené v:
Podrobná bibliografia
Vydané v:BMC bioinformatics Ročník 20; číslo 1; s. 559 - 11
Hlavní autori: Zhu, Mengmeng, Gribskov, Michael
Médium: Journal Article
Jazyk:English
Vydavateľské údaje: London BioMed Central 08.11.2019
BioMed Central Ltd
BMC
Predmet:
ISSN:1471-2105, 1471-2105
On-line prístup:Získať plný text
Tagy: Pridať tag
Žiadne tagy, Buďte prvý, kto otaguje tento záznam!
Abstract Background Micropeptides are small proteins with length < = 100 amino acids. Short open reading frames that could produces micropeptides were traditionally ignored due to technical difficulties, as few small peptides had been experimentally confirmed. In the past decade, a growing number of micropeptides have been shown to play significant roles in vital biological activities. Despite the increased amount of data, we still lack bioinformatics tools for specifically identifying micropeptides from DNA sequences. Indeed, most existing tools for classifying coding and noncoding ORFs were built on datasets in which “normal-sized” proteins were considered to be positives and short ORFs were generally considered to be noncoding. Since the functional and biophysical constraints on small peptides are likely to be different from those on “normal” proteins, methods for predicting short translated ORFs must be trained independently from those for longer proteins. Results In this study, we have developed MiPepid, a machine-learning tool specifically for the identification of micropeptides. We trained MiPepid using carefully cleaned data from existing databases and used logistic regression with 4-mer features. With only the sequence information of an ORF, MiPepid is able to predict whether it encodes a micropeptide with 96% accuracy on a blind dataset of high-confidence micropeptides, and to correctly classify newly discovered micropeptides not included in either the training or the blind test data. Compared with state-of-the-art coding potential prediction methods, MiPepid performs exceptionally well, as other methods incorrectly classify most bona fide micropeptides as noncoding. MiPepid is alignment-free and runs sufficiently fast for genome-scale analyses. It is easy to use and is available at https://github.com/MindAI/MiPepid . Conclusions MiPepid was developed to specifically predict micropeptides, a category of proteins with increasing significance, from DNA sequences. It shows evident advantages over existing coding potential prediction methods on micropeptide identification. It is ready to use and runs fast.
AbstractList Background Micropeptides are small proteins with length < = 100 amino acids. Short open reading frames that could produces micropeptides were traditionally ignored due to technical difficulties, as few small peptides had been experimentally confirmed. In the past decade, a growing number of micropeptides have been shown to play significant roles in vital biological activities. Despite the increased amount of data, we still lack bioinformatics tools for specifically identifying micropeptides from DNA sequences. Indeed, most existing tools for classifying coding and noncoding ORFs were built on datasets in which "normal-sized" proteins were considered to be positives and short ORFs were generally considered to be noncoding. Since the functional and biophysical constraints on small peptides are likely to be different from those on "normal" proteins, methods for predicting short translated ORFs must be trained independently from those for longer proteins. Results In this study, we have developed MiPepid, a machine-learning tool specifically for the identification of micropeptides. We trained MiPepid using carefully cleaned data from existing databases and used logistic regression with 4-mer features. With only the sequence information of an ORF, MiPepid is able to predict whether it encodes a micropeptide with 96% accuracy on a blind dataset of high-confidence micropeptides, and to correctly classify newly discovered micropeptides not included in either the training or the blind test data. Compared with state-of-the-art coding potential prediction methods, MiPepid performs exceptionally well, as other methods incorrectly classify most bona fide micropeptides as noncoding. MiPepid is alignment-free and runs sufficiently fast for genome-scale analyses. It is easy to use and is available at Conclusions MiPepid was developed to specifically predict micropeptides, a category of proteins with increasing significance, from DNA sequences. It shows evident advantages over existing coding potential prediction methods on micropeptide identification. It is ready to use and runs fast. Keywords: Micropeptide, Small ORF, sORF, smORF, Coding, Noncoding, lncRNA, Machine learning
Micropeptides are small proteins with length < = 100 amino acids. Short open reading frames that could produces micropeptides were traditionally ignored due to technical difficulties, as few small peptides had been experimentally confirmed. In the past decade, a growing number of micropeptides have been shown to play significant roles in vital biological activities. Despite the increased amount of data, we still lack bioinformatics tools for specifically identifying micropeptides from DNA sequences. Indeed, most existing tools for classifying coding and noncoding ORFs were built on datasets in which "normal-sized" proteins were considered to be positives and short ORFs were generally considered to be noncoding. Since the functional and biophysical constraints on small peptides are likely to be different from those on "normal" proteins, methods for predicting short translated ORFs must be trained independently from those for longer proteins. In this study, we have developed MiPepid, a machine-learning tool specifically for the identification of micropeptides. We trained MiPepid using carefully cleaned data from existing databases and used logistic regression with 4-mer features. With only the sequence information of an ORF, MiPepid is able to predict whether it encodes a micropeptide with 96% accuracy on a blind dataset of high-confidence micropeptides, and to correctly classify newly discovered micropeptides not included in either the training or the blind test data. Compared with state-of-the-art coding potential prediction methods, MiPepid performs exceptionally well, as other methods incorrectly classify most bona fide micropeptides as noncoding. MiPepid is alignment-free and runs sufficiently fast for genome-scale analyses. It is easy to use and is available at https://github.com/MindAI/MiPepid. MiPepid was developed to specifically predict micropeptides, a category of proteins with increasing significance, from DNA sequences. It shows evident advantages over existing coding potential prediction methods on micropeptide identification. It is ready to use and runs fast.
Abstract Background Micropeptides are small proteins with length < = 100 amino acids. Short open reading frames that could produces micropeptides were traditionally ignored due to technical difficulties, as few small peptides had been experimentally confirmed. In the past decade, a growing number of micropeptides have been shown to play significant roles in vital biological activities. Despite the increased amount of data, we still lack bioinformatics tools for specifically identifying micropeptides from DNA sequences. Indeed, most existing tools for classifying coding and noncoding ORFs were built on datasets in which “normal-sized” proteins were considered to be positives and short ORFs were generally considered to be noncoding. Since the functional and biophysical constraints on small peptides are likely to be different from those on “normal” proteins, methods for predicting short translated ORFs must be trained independently from those for longer proteins. Results In this study, we have developed MiPepid, a machine-learning tool specifically for the identification of micropeptides. We trained MiPepid using carefully cleaned data from existing databases and used logistic regression with 4-mer features. With only the sequence information of an ORF, MiPepid is able to predict whether it encodes a micropeptide with 96% accuracy on a blind dataset of high-confidence micropeptides, and to correctly classify newly discovered micropeptides not included in either the training or the blind test data. Compared with state-of-the-art coding potential prediction methods, MiPepid performs exceptionally well, as other methods incorrectly classify most bona fide micropeptides as noncoding. MiPepid is alignment-free and runs sufficiently fast for genome-scale analyses. It is easy to use and is available at https://github.com/MindAI/MiPepid. Conclusions MiPepid was developed to specifically predict micropeptides, a category of proteins with increasing significance, from DNA sequences. It shows evident advantages over existing coding potential prediction methods on micropeptide identification. It is ready to use and runs fast.
Background Micropeptides are small proteins with length < = 100 amino acids. Short open reading frames that could produces micropeptides were traditionally ignored due to technical difficulties, as few small peptides had been experimentally confirmed. In the past decade, a growing number of micropeptides have been shown to play significant roles in vital biological activities. Despite the increased amount of data, we still lack bioinformatics tools for specifically identifying micropeptides from DNA sequences. Indeed, most existing tools for classifying coding and noncoding ORFs were built on datasets in which “normal-sized” proteins were considered to be positives and short ORFs were generally considered to be noncoding. Since the functional and biophysical constraints on small peptides are likely to be different from those on “normal” proteins, methods for predicting short translated ORFs must be trained independently from those for longer proteins. Results In this study, we have developed MiPepid, a machine-learning tool specifically for the identification of micropeptides. We trained MiPepid using carefully cleaned data from existing databases and used logistic regression with 4-mer features. With only the sequence information of an ORF, MiPepid is able to predict whether it encodes a micropeptide with 96% accuracy on a blind dataset of high-confidence micropeptides, and to correctly classify newly discovered micropeptides not included in either the training or the blind test data. Compared with state-of-the-art coding potential prediction methods, MiPepid performs exceptionally well, as other methods incorrectly classify most bona fide micropeptides as noncoding. MiPepid is alignment-free and runs sufficiently fast for genome-scale analyses. It is easy to use and is available at https://github.com/MindAI/MiPepid . Conclusions MiPepid was developed to specifically predict micropeptides, a category of proteins with increasing significance, from DNA sequences. It shows evident advantages over existing coding potential prediction methods on micropeptide identification. It is ready to use and runs fast.
Micropeptides are small proteins with length < = 100 amino acids. Short open reading frames that could produces micropeptides were traditionally ignored due to technical difficulties, as few small peptides had been experimentally confirmed. In the past decade, a growing number of micropeptides have been shown to play significant roles in vital biological activities. Despite the increased amount of data, we still lack bioinformatics tools for specifically identifying micropeptides from DNA sequences. Indeed, most existing tools for classifying coding and noncoding ORFs were built on datasets in which "normal-sized" proteins were considered to be positives and short ORFs were generally considered to be noncoding. Since the functional and biophysical constraints on small peptides are likely to be different from those on "normal" proteins, methods for predicting short translated ORFs must be trained independently from those for longer proteins. In this study, we have developed MiPepid, a machine-learning tool specifically for the identification of micropeptides. We trained MiPepid using carefully cleaned data from existing databases and used logistic regression with 4-mer features. With only the sequence information of an ORF, MiPepid is able to predict whether it encodes a micropeptide with 96% accuracy on a blind dataset of high-confidence micropeptides, and to correctly classify newly discovered micropeptides not included in either the training or the blind test data. Compared with state-of-the-art coding potential prediction methods, MiPepid performs exceptionally well, as other methods incorrectly classify most bona fide micropeptides as noncoding. MiPepid is alignment-free and runs sufficiently fast for genome-scale analyses. It is easy to use and is available at https://github.com/MindAI/MiPepid. MiPepid was developed to specifically predict micropeptides, a category of proteins with increasing significance, from DNA sequences. It shows evident advantages over existing coding potential prediction methods on micropeptide identification. It is ready to use and runs fast.
Micropeptides are small proteins with length < = 100 amino acids. Short open reading frames that could produces micropeptides were traditionally ignored due to technical difficulties, as few small peptides had been experimentally confirmed. In the past decade, a growing number of micropeptides have been shown to play significant roles in vital biological activities. Despite the increased amount of data, we still lack bioinformatics tools for specifically identifying micropeptides from DNA sequences. Indeed, most existing tools for classifying coding and noncoding ORFs were built on datasets in which "normal-sized" proteins were considered to be positives and short ORFs were generally considered to be noncoding. Since the functional and biophysical constraints on small peptides are likely to be different from those on "normal" proteins, methods for predicting short translated ORFs must be trained independently from those for longer proteins.BACKGROUNDMicropeptides are small proteins with length < = 100 amino acids. Short open reading frames that could produces micropeptides were traditionally ignored due to technical difficulties, as few small peptides had been experimentally confirmed. In the past decade, a growing number of micropeptides have been shown to play significant roles in vital biological activities. Despite the increased amount of data, we still lack bioinformatics tools for specifically identifying micropeptides from DNA sequences. Indeed, most existing tools for classifying coding and noncoding ORFs were built on datasets in which "normal-sized" proteins were considered to be positives and short ORFs were generally considered to be noncoding. Since the functional and biophysical constraints on small peptides are likely to be different from those on "normal" proteins, methods for predicting short translated ORFs must be trained independently from those for longer proteins.In this study, we have developed MiPepid, a machine-learning tool specifically for the identification of micropeptides. We trained MiPepid using carefully cleaned data from existing databases and used logistic regression with 4-mer features. With only the sequence information of an ORF, MiPepid is able to predict whether it encodes a micropeptide with 96% accuracy on a blind dataset of high-confidence micropeptides, and to correctly classify newly discovered micropeptides not included in either the training or the blind test data. Compared with state-of-the-art coding potential prediction methods, MiPepid performs exceptionally well, as other methods incorrectly classify most bona fide micropeptides as noncoding. MiPepid is alignment-free and runs sufficiently fast for genome-scale analyses. It is easy to use and is available at https://github.com/MindAI/MiPepid.RESULTSIn this study, we have developed MiPepid, a machine-learning tool specifically for the identification of micropeptides. We trained MiPepid using carefully cleaned data from existing databases and used logistic regression with 4-mer features. With only the sequence information of an ORF, MiPepid is able to predict whether it encodes a micropeptide with 96% accuracy on a blind dataset of high-confidence micropeptides, and to correctly classify newly discovered micropeptides not included in either the training or the blind test data. Compared with state-of-the-art coding potential prediction methods, MiPepid performs exceptionally well, as other methods incorrectly classify most bona fide micropeptides as noncoding. MiPepid is alignment-free and runs sufficiently fast for genome-scale analyses. It is easy to use and is available at https://github.com/MindAI/MiPepid.MiPepid was developed to specifically predict micropeptides, a category of proteins with increasing significance, from DNA sequences. It shows evident advantages over existing coding potential prediction methods on micropeptide identification. It is ready to use and runs fast.CONCLUSIONSMiPepid was developed to specifically predict micropeptides, a category of proteins with increasing significance, from DNA sequences. It shows evident advantages over existing coding potential prediction methods on micropeptide identification. It is ready to use and runs fast.
ArticleNumber 559
Audience Academic
Author Gribskov, Michael
Zhu, Mengmeng
Author_xml – sequence: 1
  givenname: Mengmeng
  surname: Zhu
  fullname: Zhu, Mengmeng
  organization: Department of Statistics, Purdue University, Department of Biological Sciences, Purdue University
– sequence: 2
  givenname: Michael
  orcidid: 0000-0002-1718-0242
  surname: Gribskov
  fullname: Gribskov, Michael
  email: mgribsko@purdue.edu
  organization: Department of Biological Sciences, Purdue University
BackLink https://www.ncbi.nlm.nih.gov/pubmed/31703551$$D View this record in MEDLINE/PubMed
BookMark eNp9kktv1DAUhS1URNuBH8AGRWIDixQ7fiRmgagqHiO1AvFYW44fqauMPdgJgn_PnaZUHYSqyIp1_Z3je-VzjA5iig6hpwSfENKJV4U0HZc1JrKmmNJaPkBHhLWkbgjmB3f2h-i4lCuMSdth_ggdUtJiyjk5Qm8vwme3DfZ1dRFMTrCfgnUVrDgFH4yeQorVlNJYzSXEodpocxmiq0anc4TCY_TQ67G4Jzf_Ffr-_t23s4_1-acP67PT89qIlk-15NZ4w7yXVlLhbMOshx467iXuvaDSG9I7hh1rGyMFET2ThLmOOKxbzR1dofXia5O-UtscNjr_VkkHdV1IeVA6T8GMTvVGEGNt27e2B4tedx3tDWkEYdxZKcDrzeK1nfuNswZmzXrcM90_ieFSDemnEh1rCKNg8OLGIKcfsyuT2oRi3Djq6NJcVEMJpYILeJUVer6gg4bWQvQJHM0OV6cCt5QLKRhQJ_-h4LNuEwy8ug9Q3xO83BMAM7lf06DnUtT665d99tndcW_n_JsCAMgCQAJKyc7fIgSrXdLUkjQFSVO7pCkJmvYfjQnTdVqg8zDeq2wWZYFb4uCyukpzjhCee0R_ALBj5cc
CitedBy_id crossref_primary_10_1016_j_ymeth_2022_12_003
crossref_primary_10_1186_s12935_024_03446_7
crossref_primary_10_1007_s12539_021_00464_1
crossref_primary_10_1016_j_gene_2024_148817
crossref_primary_10_1093_bfgp_elae018
crossref_primary_10_1093_nar_gkab816
crossref_primary_10_1093_bib_bbad352
crossref_primary_10_1093_jxb_eraf240
crossref_primary_10_3390_cells13191645
crossref_primary_10_48130_GR_2023_0026
crossref_primary_10_1093_bioinformatics_btaf250
crossref_primary_10_3390_cancers17172932
crossref_primary_10_1111_febs_15769
crossref_primary_10_1371_journal_pone_0320314
crossref_primary_10_1007_s12672_025_01945_1
crossref_primary_10_1094_MPMI_34_2
crossref_primary_10_32604_cmc_2022_023848
crossref_primary_10_1109_ACCESS_2024_3419766
crossref_primary_10_1111_imm_13850
crossref_primary_10_1007_s42994_023_00100_0
crossref_primary_10_3389_fmicb_2024_1304044
crossref_primary_10_1007_s12539_024_00661_8
crossref_primary_10_1016_j_chom_2020_11_002
crossref_primary_10_1016_j_ijbiomac_2023_124952
crossref_primary_10_1186_s13045_024_01591_0
crossref_primary_10_1016_j_tig_2024_09_004
crossref_primary_10_2174_1574893615999200811130522
crossref_primary_10_1111_mpp_70084
crossref_primary_10_3390_biology10050371
crossref_primary_10_1007_s12539_023_00552_4
crossref_primary_10_1016_j_xplc_2025_101437
crossref_primary_10_1080_07391102_2023_2235605
crossref_primary_10_1094_MPMI_06_20_0160_R
crossref_primary_10_1016_j_chemolab_2022_104490
crossref_primary_10_1161_ATVBAHA_120_314222
crossref_primary_10_1093_bib_bbae147
crossref_primary_10_1093_bib_bbae345
crossref_primary_10_3390_proteomes12030026
crossref_primary_10_52586_4943
crossref_primary_10_3390_ijms241310562
crossref_primary_10_3389_fgene_2024_1439423
crossref_primary_10_3390_pharmaceutics16111486
crossref_primary_10_1038_s43586_023_00205_2
crossref_primary_10_3389_fpls_2022_975938
crossref_primary_10_1007_s13205_022_03147_w
crossref_primary_10_1016_j_compbiomed_2023_106773
crossref_primary_10_1186_s12929_022_00802_5
crossref_primary_10_3390_ijms26083651
crossref_primary_10_1002_cbic_202100534
crossref_primary_10_1016_j_pharmr_2025_100065
crossref_primary_10_1016_j_virusres_2020_198104
crossref_primary_10_1093_bib_bbac392
crossref_primary_10_1016_j_canlet_2022_215723
crossref_primary_10_1016_j_isci_2023_106781
crossref_primary_10_3390_a15080274
crossref_primary_10_1016_j_canlet_2020_10_038
crossref_primary_10_3390_plants12233951
crossref_primary_10_1109_TCBB_2021_3104288
crossref_primary_10_3390_cancers16152660
crossref_primary_10_1007_s00018_020_03740_3
crossref_primary_10_1186_s12943_025_02278_x
crossref_primary_10_1093_bib_bbac108
crossref_primary_10_1016_j_yexcr_2020_111923
crossref_primary_10_1093_hr_uhac043
crossref_primary_10_1016_j_compbiomed_2022_106440
Cites_doi 10.7554/eLife.03523
10.3389/fphys.2017.00230
10.1093/bioinformatics/btp688
10.3389/fmicb.2013.00269
10.1084/jem.183.3.1131
10.1093/nar/gkx428
10.1186/1471-2164-14-648
10.3389/fgene.2018.00144
10.1093/nar/gkp335
10.1146/annurev-cellbio-100616-060516
10.1186/s13059-015-0742-x
10.1126/science.1085650
10.1126/scisignal.aaj1460
10.1038/nrg3645
10.1126/science.aad4076
10.1016/j.cell.2016.02.066
10.1093/bib/bbx005
10.1126/science.1238802
10.1186/1471-2105-15-36
10.1002/embj.201488411
10.1093/nar/gkm391
10.1093/nar/gkv1175
10.1186/1471-2105-6-262
10.1002/embj.201488303
10.1021/acs.jproteome.7b00707
10.1093/nar/gkt1059
10.1016/j.cell.2015.01.009
10.1016/j.celrep.2018.05.058
10.1186/1471-2105-9-192
10.1093/nar/gkt006
10.1093/nar/gkt646
10.1016/j.tcb.2017.04.006
10.1016/j.cell.2013.06.009
10.1093/bioinformatics/btr209
10.1126/science.aam9361
10.1016/j.molcel.2017.09.015
10.1093/database/bas008
10.1101/gr.080531.108
10.7554/eLife.13328
10.1038/nrm.2017.58
10.1126/science.1168978
10.7554/eLife.08890
10.1093/nar/gkx1130
10.1038/nrg.2016.119
10.1016/j.cmet.2015.02.009
10.1093/nar/gkx1098
10.1093/nar/gku989
10.1038/nature21034
ContentType Journal Article
Copyright The Author(s). 2019
COPYRIGHT 2019 BioMed Central Ltd.
Copyright_xml – notice: The Author(s). 2019
– notice: COPYRIGHT 2019 BioMed Central Ltd.
DBID C6C
AAYXX
CITATION
CGR
CUY
CVF
ECM
EIF
NPM
ISR
7X8
5PM
DOA
DOI 10.1186/s12859-019-3033-9
DatabaseName Springer Nature OA Free Journals
CrossRef
Medline
MEDLINE
MEDLINE (Ovid)
MEDLINE
MEDLINE
PubMed
Gale In Context: Science
MEDLINE - Academic
PubMed Central (Full Participant titles)
Open Access: DOAJ - Directory of Open Access Journals
DatabaseTitle CrossRef
MEDLINE
Medline Complete
MEDLINE with Full Text
PubMed
MEDLINE (Ovid)
MEDLINE - Academic
DatabaseTitleList




MEDLINE
MEDLINE - Academic
Database_xml – sequence: 1
  dbid: DOA
  name: DOAJ Directory of Open Access Journals
  url: https://www.doaj.org/
  sourceTypes: Open Website
– sequence: 2
  dbid: NPM
  name: PubMed
  url: http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed
  sourceTypes: Index Database
– sequence: 3
  dbid: 7X8
  name: MEDLINE - Academic
  url: https://search.proquest.com/medline
  sourceTypes: Aggregation Database
DeliveryMethod fulltext_linktorsrc
Discipline Biology
EISSN 1471-2105
EndPage 11
ExternalDocumentID oai_doaj_org_article_bc61cdd7b7db4e8ba883bc126145ed96
PMC6842143
A607356964
31703551
10_1186_s12859_019_3033_9
Genre Journal Article
GroupedDBID ---
0R~
23N
2WC
53G
5VS
6J9
7X7
88E
8AO
8FE
8FG
8FH
8FI
8FJ
AAFWJ
AAJSJ
AAKPC
AASML
ABDBF
ABUWG
ACGFO
ACGFS
ACIHN
ACIWK
ACPRK
ACUHS
ADBBV
ADMLS
ADUKV
AEAQA
AENEX
AEUYN
AFKRA
AFPKN
AFRAH
AHBYD
AHMBA
AHYZX
ALMA_UNASSIGNED_HOLDINGS
AMKLP
AMTXH
AOIJS
ARAPS
AZQEC
BAPOH
BAWUL
BBNVY
BCNDV
BENPR
BFQNJ
BGLVJ
BHPHI
BMC
BPHCQ
BVXVI
C6C
CCPQU
CS3
DIK
DU5
DWQXO
E3Z
EAD
EAP
EAS
EBD
EBLON
EBS
EMB
EMK
EMOBN
ESX
F5P
FYUFA
GNUQQ
GROUPED_DOAJ
GX1
HCIFZ
HMCUK
HYE
IAO
ICD
IHR
INH
INR
ISR
ITC
K6V
K7-
KQ8
LK8
M1P
M48
M7P
MK~
ML0
M~E
O5R
O5S
OK1
OVT
P2P
P62
PGMZT
PHGZM
PHGZT
PIMPY
PJZUB
PPXIY
PQGLB
PQQKQ
PROAC
PSQYO
PUEGO
RBZ
RNS
ROL
RPM
RSV
SBL
SOJ
SV3
TR2
TUS
UKHRP
W2D
WOQ
WOW
XH6
XSB
AAYXX
AFFHD
CITATION
ALIPV
CGR
CUY
CVF
ECM
EIF
NPM
7X8
5PM
ID FETCH-LOGICAL-c675t-95dcfc4ff9d936ed24df70385f90bf639fc1be40e472c9616b4914e81e0a7a5e3
IEDL.DBID RSV
ISICitedReferencesCount 70
ISICitedReferencesURI http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000496277900007&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
ISSN 1471-2105
IngestDate Fri Oct 03 12:52:57 EDT 2025
Tue Nov 04 01:44:56 EST 2025
Thu Sep 04 19:05:56 EDT 2025
Tue Nov 11 10:23:31 EST 2025
Tue Nov 04 17:55:52 EST 2025
Thu Nov 13 15:23:13 EST 2025
Thu Apr 03 07:02:33 EDT 2025
Sat Nov 29 05:40:04 EST 2025
Tue Nov 18 21:11:07 EST 2025
Sat Sep 06 07:27:26 EDT 2025
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Issue 1
Keywords Small ORF
Noncoding
lncRNA
sORF
Coding
Machine learning
smORF
Micropeptide
Language English
License Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c675t-95dcfc4ff9d936ed24df70385f90bf639fc1be40e472c9616b4914e81e0a7a5e3
Notes ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ORCID 0000-0002-1718-0242
OpenAccessLink https://link.springer.com/10.1186/s12859-019-3033-9
PMID 31703551
PQID 2313365630
PQPubID 23479
PageCount 11
ParticipantIDs doaj_primary_oai_doaj_org_article_bc61cdd7b7db4e8ba883bc126145ed96
pubmedcentral_primary_oai_pubmedcentral_nih_gov_6842143
proquest_miscellaneous_2313365630
gale_infotracmisc_A607356964
gale_infotracacademiconefile_A607356964
gale_incontextgauss_ISR_A607356964
pubmed_primary_31703551
crossref_primary_10_1186_s12859_019_3033_9
crossref_citationtrail_10_1186_s12859_019_3033_9
springer_journals_10_1186_s12859_019_3033_9
PublicationCentury 2000
PublicationDate 2019-11-08
PublicationDateYYYYMMDD 2019-11-08
PublicationDate_xml – month: 11
  year: 2019
  text: 2019-11-08
  day: 08
PublicationDecade 2010
PublicationPlace London
PublicationPlace_xml – name: London
– name: England
PublicationTitle BMC bioinformatics
PublicationTitleAbbrev BMC Bioinformatics
PublicationTitleAlternate BMC Bioinformatics
PublicationYear 2019
Publisher BioMed Central
BioMed Central Ltd
BMC
Publisher_xml – name: BioMed Central
– name: BioMed Central Ltd
– name: BMC
References 3033_CR18
L Sun (3033_CR27) 2013; 41
CA Makarewich (3033_CR1) 2017; 27
EG Magny (3033_CR8) 2013; 341
AA Bazzini (3033_CR23) 2014; 33
DR Zerbino (3033_CR34) 2018; 46
J-P Couso (3033_CR3) 2017; 18
C Lee (3033_CR9) 2015; 21
BR Nelson (3033_CR43) 2016; 351
MF Lin (3033_CR28) 2011; 27
DM Anderson (3033_CR6) 2015; 160
A Chugunova (3033_CR2) 2018; 17
Z Ji (3033_CR36) 2015; 4
TL Bailey (3033_CR40) 2009; 37
P Bi (3033_CR44) 2017; 356
H Zhang (3033_CR38) 2013; 4
Y-J Kang (3033_CR25) 2017; 45
J Ruiz-Orera (3033_CR35) 2014; 3
J Crappé (3033_CR22) 2013; 14
NT Ingolia (3033_CR15) 2009; 324
BY Chan (3033_CR41) 2005; 6
CA Makarewich (3033_CR42) 2018; 23
B Cai (3033_CR13) 2017; 8
L Wang (3033_CR26) 2013; 41
S Plaza (3033_CR47) 2017; 33
NT Ingolia (3033_CR17) 2016; 165
L Kong (3033_CR24) 2007; 35
RF Wang (3033_CR11) 1996; 183
K Hanada (3033_CR20) 2010; 26
SD Mackowiak (3033_CR21) 2015; 16
RA Harte (3033_CR32) 2012; 2012
UniProt Consortium (3033_CR30) 2015; 43
KD Pruitt (3033_CR33) 2009; 19
J-Z Huang (3033_CR46) 2017; 68
V Olexiouk (3033_CR4) 2018; 46
NT Ingolia (3033_CR14) 2014; 15
F Yeasmin (3033_CR12) 2018; 9
M Jiang (3033_CR39) 2008; 9
V Olexiouk (3033_CR5) 2016; 44
SM Cohen (3033_CR48) 2014; 33
Y Hao (3033_CR29) 2018; 19
M Guttman (3033_CR37) 2013; 154
JM Mudge (3033_CR16) 2016; 17
CM Farrell (3033_CR31) 2014; 42
A Matsumoto (3033_CR45) 2016; 541
SR Schwab (3033_CR10) 2003; 301
A Skarshewski (3033_CR19) 2014; 15
DM Anderson (3033_CR7) 2016; 9
References_xml – volume: 3
  start-page: e03523
  year: 2014
  ident: 3033_CR35
  publication-title: Elife
  doi: 10.7554/eLife.03523
– volume: 8
  start-page: 230
  year: 2017
  ident: 3033_CR13
  publication-title: Front Physiol
  doi: 10.3389/fphys.2017.00230
– volume: 26
  start-page: 399
  year: 2010
  ident: 3033_CR20
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/btp688
– volume: 4
  start-page: 269
  year: 2013
  ident: 3033_CR38
  publication-title: Front Microbiol
  doi: 10.3389/fmicb.2013.00269
– volume: 183
  start-page: 1131 LP
  year: 1996
  ident: 3033_CR11
  publication-title: J Exp Med
  doi: 10.1084/jem.183.3.1131
– volume: 45
  start-page: W12
  year: 2017
  ident: 3033_CR25
  publication-title: Nucleic Acids Res
  doi: 10.1093/nar/gkx428
– volume: 14
  start-page: 648
  year: 2013
  ident: 3033_CR22
  publication-title: BMC Genomics
  doi: 10.1186/1471-2164-14-648
– volume: 9
  start-page: 144
  year: 2018
  ident: 3033_CR12
  publication-title: Front Genet
  doi: 10.3389/fgene.2018.00144
– volume: 37
  start-page: W202
  issue: suppl_2
  year: 2009
  ident: 3033_CR40
  publication-title: Nucleic Acids Res
  doi: 10.1093/nar/gkp335
– volume: 33
  start-page: 391
  year: 2017
  ident: 3033_CR47
  publication-title: Annu Rev Cell Dev Biol
  doi: 10.1146/annurev-cellbio-100616-060516
– volume: 16
  start-page: 179
  year: 2015
  ident: 3033_CR21
  publication-title: Genome Biol
  doi: 10.1186/s13059-015-0742-x
– volume: 301
  start-page: 1367 LP
  year: 2003
  ident: 3033_CR10
  publication-title: Science (80- )
  doi: 10.1126/science.1085650
– volume: 9
  start-page: ra119 LP
  year: 2016
  ident: 3033_CR7
  publication-title: Sci Signal
  doi: 10.1126/scisignal.aaj1460
– volume: 15
  start-page: 205
  year: 2014
  ident: 3033_CR14
  publication-title: Nat Rev Genet.
  doi: 10.1038/nrg3645
– volume: 351
  start-page: 271 LP
  year: 2016
  ident: 3033_CR43
  publication-title: Science (80- )
  doi: 10.1126/science.aad4076
– volume: 165
  start-page: 22
  year: 2016
  ident: 3033_CR17
  publication-title: Cell
  doi: 10.1016/j.cell.2016.02.066
– volume: 19
  start-page: 636
  year: 2018
  ident: 3033_CR29
  publication-title: Brief Bioinform
  doi: 10.1093/bib/bbx005
– volume: 341
  start-page: 1116 LP
  year: 2013
  ident: 3033_CR8
  publication-title: Science (80- )
  doi: 10.1126/science.1238802
– volume: 15
  start-page: 36
  year: 2014
  ident: 3033_CR19
  publication-title: BMC Bioinformatics
  doi: 10.1186/1471-2105-15-36
– volume: 33
  start-page: 981 LP
  year: 2014
  ident: 3033_CR23
  publication-title: EMBO J
  doi: 10.1002/embj.201488411
– volume: 35
  start-page: W345
  issue: Web Server issu
  year: 2007
  ident: 3033_CR24
  publication-title: Nucleic Acids Res
  doi: 10.1093/nar/gkm391
– volume: 44
  start-page: D324
  year: 2016
  ident: 3033_CR5
  publication-title: Nucleic Acids Res
  doi: 10.1093/nar/gkv1175
– volume: 6
  start-page: 262
  year: 2005
  ident: 3033_CR41
  publication-title: BMC Bioinformatics
  doi: 10.1186/1471-2105-6-262
– volume: 33
  start-page: 937 LP
  year: 2014
  ident: 3033_CR48
  publication-title: EMBO J
  doi: 10.1002/embj.201488303
– volume: 17
  start-page: 1
  year: 2018
  ident: 3033_CR2
  publication-title: J Proteome Res
  doi: 10.1021/acs.jproteome.7b00707
– volume: 42
  start-page: D865
  issue: Database issue
  year: 2014
  ident: 3033_CR31
  publication-title: Nucleic Acids Res
  doi: 10.1093/nar/gkt1059
– volume: 160
  start-page: 595
  year: 2015
  ident: 3033_CR6
  publication-title: Cell
  doi: 10.1016/j.cell.2015.01.009
– volume: 23
  start-page: 3701
  year: 2018
  ident: 3033_CR42
  publication-title: Cell Rep
  doi: 10.1016/j.celrep.2018.05.058
– volume: 9
  start-page: 192
  year: 2008
  ident: 3033_CR39
  publication-title: BMC Bioinformatics
  doi: 10.1186/1471-2105-9-192
– volume: 41
  start-page: e74
  year: 2013
  ident: 3033_CR26
  publication-title: Nucleic Acids Res
  doi: 10.1093/nar/gkt006
– volume: 41
  start-page: e166
  year: 2013
  ident: 3033_CR27
  publication-title: Nucleic Acids Res
  doi: 10.1093/nar/gkt646
– volume: 27
  start-page: 685
  year: 2017
  ident: 3033_CR1
  publication-title: Trends Cell Biol
  doi: 10.1016/j.tcb.2017.04.006
– volume: 154
  start-page: 240
  year: 2013
  ident: 3033_CR37
  publication-title: Cell
  doi: 10.1016/j.cell.2013.06.009
– volume: 27
  start-page: i275
  year: 2011
  ident: 3033_CR28
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/btr209
– volume: 356
  start-page: 323 LP
  year: 2017
  ident: 3033_CR44
  publication-title: Science (80- )
  doi: 10.1126/science.aam9361
– volume: 68
  start-page: 171
  year: 2017
  ident: 3033_CR46
  publication-title: Mol Cell
  doi: 10.1016/j.molcel.2017.09.015
– volume: 2012
  start-page: bas008
  year: 2012
  ident: 3033_CR32
  publication-title: Database (Oxford)
  doi: 10.1093/database/bas008
– volume: 19
  start-page: 1316
  year: 2009
  ident: 3033_CR33
  publication-title: Genome Res
  doi: 10.1101/gr.080531.108
– ident: 3033_CR18
  doi: 10.7554/eLife.13328
– volume: 18
  start-page: 575
  year: 2017
  ident: 3033_CR3
  publication-title: Nat Rev Mol Cell Biol
  doi: 10.1038/nrm.2017.58
– volume: 324
  start-page: 218 LP
  year: 2009
  ident: 3033_CR15
  publication-title: Science (80- )
  doi: 10.1126/science.1168978
– volume: 4
  start-page: e08890
  year: 2015
  ident: 3033_CR36
  publication-title: Elife
  doi: 10.7554/eLife.08890
– volume: 46
  start-page: D497
  year: 2018
  ident: 3033_CR4
  publication-title: Nucleic Acids Res
  doi: 10.1093/nar/gkx1130
– volume: 17
  start-page: 758
  year: 2016
  ident: 3033_CR16
  publication-title: Nat Rev Genet
  doi: 10.1038/nrg.2016.119
– volume: 21
  start-page: 443
  year: 2015
  ident: 3033_CR9
  publication-title: Cell Metab
  doi: 10.1016/j.cmet.2015.02.009
– volume: 46
  start-page: D754
  year: 2018
  ident: 3033_CR34
  publication-title: Nucleic Acids Res
  doi: 10.1093/nar/gkx1098
– volume: 43
  start-page: D204
  issue: Database issue
  year: 2015
  ident: 3033_CR30
  publication-title: Nucleic Acids Res
  doi: 10.1093/nar/gku989
– volume: 541
  start-page: 228
  year: 2016
  ident: 3033_CR45
  publication-title: Nature
  doi: 10.1038/nature21034
SSID ssj0017805
Score 2.5618274
Snippet Background Micropeptides are small proteins with length < = 100 amino acids. Short open reading frames that could produces micropeptides were traditionally...
Micropeptides are small proteins with length < = 100 amino acids. Short open reading frames that could produces micropeptides were traditionally ignored due to...
Micropeptides are small proteins with length < = 100 amino acids. Short open reading frames that could produces micropeptides were traditionally ignored due to...
Background Micropeptides are small proteins with length < = 100 amino acids. Short open reading frames that could produces micropeptides were traditionally...
Abstract Background Micropeptides are small proteins with length < = 100 amino acids. Short open reading frames that could produces micropeptides were...
SourceID doaj
pubmedcentral
proquest
gale
pubmed
crossref
springer
SourceType Open Website
Open Access Repository
Aggregation Database
Index Database
Enrichment Source
Publisher
StartPage 559
SubjectTerms Algorithms
Amino acids
Bioinformatics
Biomedical and Life Sciences
Coding
Computational biology
Computational Biology - methods
Computational Biology/Bioinformatics
Computer Appl. in Life Sciences
Databases, Protein
DNA
DNA sequencing
Genomics
Life Sciences
Machine Learning
Machine Learning and Artificial Intelligence in Bioinformatics
Microarrays
Micropeptide
Noncoding
Open Reading Frames - genetics
Peptides
Peptides - analysis
Proteins
Small ORF
smORF
Software
sORF
SummonAdditionalLinks – databaseName: Open Access: DOAJ - Directory of Open Access Journals
  dbid: DOA
  link: http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV1bi9UwEA6yKPgi3q2uUkUQlLJNkyaNT67iouAuBy-wb6G5nS2s7bI9R_DfO5PmHLYr6ouPbabQfJkkXzrTbwh57kVNW8lVwQzjBbecF0YG_BJXVWWgUhoX1fU_yaOj5vhYLS6U-sKcsEkeeAJuz1hBrXPSSGe4b0zbNMxYCsSf196pKLYNrGdzmErxA1TqTzFM2oi9kaJOGxybMc7PWKFmu1AU6_99Sb6wJ13Ol7wUNI170cFNciORyHx_evlb5Irvb5NrU1nJn3fIm8Nu4c869zo_xHS7BSauOJ93LmUGxcHIV8NwmmPa-zL_HjMqfZ5KSCzvkm8H77---1CkSgmFBcK_KlTtbLA8BOUUE95V3AWJMb-gShOAhARLjeel57KySlBhuKKAJvVlK9vas3tkpx96_4DkKgjDBPNemobXrFW1BcvatnABXK_JSLlBTtskI47VLE51PE40Qk9gawBbI9haZeTl9pGzSUPjb8ZvcTi2hih_HW-AU-jkFPpfTpGRZziYGgUuesygWbbrcdQfv3zW-wIWtVoowTPyIhmFAXpg2_RDAuCAmlgzy92ZJcxAO2t-uvEZjU2Yttb7YT1qIM-MCZRgy8j9yYe2HQPiVgLZoxmRM--a9Xze0ncnUQAcY6fAczPyauOHOq0845-Bffg_gH1Erlc4i-J39V2yszpf-8fkqv2x6sbzJ3EO_gIW0jON
  priority: 102
  providerName: Directory of Open Access Journals
Title MiPepid: MicroPeptide identification tool using machine learning
URI https://link.springer.com/article/10.1186/s12859-019-3033-9
https://www.ncbi.nlm.nih.gov/pubmed/31703551
https://www.proquest.com/docview/2313365630
https://pubmed.ncbi.nlm.nih.gov/PMC6842143
https://doaj.org/article/bc61cdd7b7db4e8ba883bc126145ed96
Volume 20
WOSCitedRecordID wos000496277900007&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVADU
  databaseName: BioMedCentral
  customDbUrl:
  eissn: 1471-2105
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0017805
  issn: 1471-2105
  databaseCode: RBZ
  dateStart: 20000101
  isFulltext: true
  titleUrlDefault: https://www.biomedcentral.com/search/
  providerName: BioMedCentral
– providerCode: PRVAON
  databaseName: DOAJ Directory of Open Access Journals
  customDbUrl:
  eissn: 1471-2105
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0017805
  issn: 1471-2105
  databaseCode: DOA
  dateStart: 20000101
  isFulltext: true
  titleUrlDefault: https://www.doaj.org/
  providerName: Directory of Open Access Journals
– providerCode: PRVHPJ
  databaseName: ROAD: Directory of Open Access Scholarly Resources
  customDbUrl:
  eissn: 1471-2105
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0017805
  issn: 1471-2105
  databaseCode: M~E
  dateStart: 20000101
  isFulltext: true
  titleUrlDefault: https://road.issn.org
  providerName: ISSN International Centre
– providerCode: PRVPQU
  databaseName: Advanced Technologies & Aerospace Database
  customDbUrl:
  eissn: 1471-2105
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0017805
  issn: 1471-2105
  databaseCode: P5Z
  dateStart: 20090101
  isFulltext: true
  titleUrlDefault: https://search.proquest.com/hightechjournals
  providerName: ProQuest
– providerCode: PRVPQU
  databaseName: Biological Science Database
  customDbUrl:
  eissn: 1471-2105
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0017805
  issn: 1471-2105
  databaseCode: M7P
  dateStart: 20090101
  isFulltext: true
  titleUrlDefault: http://search.proquest.com/biologicalscijournals
  providerName: ProQuest
– providerCode: PRVPQU
  databaseName: Computer Science Database
  customDbUrl:
  eissn: 1471-2105
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0017805
  issn: 1471-2105
  databaseCode: K7-
  dateStart: 20090101
  isFulltext: true
  titleUrlDefault: http://search.proquest.com/compscijour
  providerName: ProQuest
– providerCode: PRVPQU
  databaseName: Health & Medical Collection
  customDbUrl:
  eissn: 1471-2105
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0017805
  issn: 1471-2105
  databaseCode: 7X7
  dateStart: 20090101
  isFulltext: true
  titleUrlDefault: https://search.proquest.com/healthcomplete
  providerName: ProQuest
– providerCode: PRVPQU
  databaseName: ProQuest Central
  customDbUrl:
  eissn: 1471-2105
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0017805
  issn: 1471-2105
  databaseCode: BENPR
  dateStart: 20090101
  isFulltext: true
  titleUrlDefault: https://www.proquest.com/central
  providerName: ProQuest
– providerCode: PRVPQU
  databaseName: Publicly Available Content Database
  customDbUrl:
  eissn: 1471-2105
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0017805
  issn: 1471-2105
  databaseCode: PIMPY
  dateStart: 20090101
  isFulltext: true
  titleUrlDefault: http://search.proquest.com/publiccontent
  providerName: ProQuest
– providerCode: PRVAVX
  databaseName: SpringerLink Contemporary
  customDbUrl:
  eissn: 1471-2105
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0017805
  issn: 1471-2105
  databaseCode: RSV
  dateStart: 20001201
  isFulltext: true
  titleUrlDefault: https://link.springer.com/search?facet-content-type=%22Journal%22
  providerName: Springer Nature
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV3db9MwED-xDSRe-P4IjCogJCRQRFI7dswTG9rEBK2iDtDGixV_pFQaydS0SPz3-Fy3IuNDghdLiS8Pd7mzz76ffwZ4almeVZyKhChCE6opTRSvcSduOEzrjHNlPLv-ez4eFycnogznuLs12n1dkvQjtQ_rgr3sMuRac0tfrNUTkogt2HGzXYHRODn-tCkdIEl_KF_-9rPeBOR5-n8djX-aji5CJS_US_00dHj9vxS4AddC1hnvrdzkJlyyzS24srqH8vtteD2alfZ8Zl7FI8TnlYh0MTaemQAl8n8vXrTtWYw4-Wn81UMwbRzunJjegY-HBx_evE3C1QqJdiuERSJyo2tN61oYQZg1Q2pqjkXCWqSqdllLrTNlaWopH2rBMqaoyKgtMptWvMotuQvbTdvY-xCLminCiLVcFTQnlci1k8x15R5cclhEkK7tLXXgHcfrL86kX38UTK4MI51hJBpGigiebz45X5Fu_E14H3_iRhD5sv2Ldj6VIfyk0izTxnDFjXJqqKooiNKZWz7S3BrBIniCLiCREaNByM20WnadPDqeyD3mRsGcCUYjeBaE6tZpoKtwgsHZAUm0epK7PUkXsrrX_XjtaRK7EOfW2HbZSZdtE8KQsy2CeyvP2yjmMr3UZYdZBLznkz3N-z3N7ItnDMdiq0uMI3ix9kwZhqruz4Z98E_SD-HqEF3b77jvwvZivrSP4LL-tph18wFs8RPu22IAO_sH43Iy8Nsfrn3HkwFCbkvXlvln118ejcrTgQ_rH5poPto
linkProvider Springer Nature
linkToHtml http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV3db9MwED_BAMEL3x-BAQEhIYEiktqxY54YiGkTbVVtY9qbFX-VSiOZmhaJ_x6f61ZkfEjwmPj8cOc7--z7-WeAF5aVRc2pyIgiNKOa0kxxhydxg0HuCs6VCez6Qz4eVycnYhLvcXdrtPu6JBlm6hDWFXvTFci15re-WKsnJBMX4RL1Cxbi-A4OjzelAyTpj-XL33brLUCBp__X2fin5eg8VPJcvTQsQ7s3_kuBm3A9Zp3pzspNbsEF29yGK6t3KL_fgXej2cSezczbdIT4vAkiXYxNZyZCicLopYu2PU0RJz9NvwYIpk3jmxPTu_B59-PRh70sPq2Qab9DWGSiNNpp6pwwgjBrBtQ4jkVCJ3LlfNbidKEszS3lAy1YwRQVBbVVYfOa16Ul92CraRv7AFLhmCKMWMtVRUtSi1J7yVLX_sMnh1UC-dreUkfecXz-4lSG_UfF5Mow0htGomGkSODVpsvZinTjb8LvcRA3gsiXHX6086mM4SeVZoU2hitulFdD1VVFlC789pGW1giWwHN0AYmMGA1Cbqb1suvk_uGB3GF-FiyZYDSBl1HItV4DXccbDN4OSKLVk9zuSfqQ1b3mZ2tPk9iEOLfGtstO-mybEIacbQncX3neRjGf6eU-OywS4D2f7Gneb2lmXwJjOBZbfWKcwOu1Z8o4VXV_NuzDf5J-Clf3jkZDOdwff3oE1wbo5uH0fRu2FvOlfQyX9bfFrJs_CeH6A1bZOOk
linkToPdf http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwpV1Zb9QwEB5BgYoX7iNQICAkJFDUZO3YMU-UY0VFu1pRQH2z4mu7UklWmywS_x5P4l2RckiIx8Tjh5mM7RnPl28AnlqWZyWnIiGK0IRqShPFHd7EjUapyzhXpmPXP-CTSXF8LKahz2mzRruvS5L9Pw3I0lS1uwvj-iVesN0mQ941nwZj3Z6QRJyHCxR7BmG6fvRlU0ZAwv5QyvzttMFh1HH2_7oz_3Q0nYVNnqmddkfS-Op_K3MNroRoNN7r3ec6nLPVDbjU96f8fhNeHc6ndjE3L-NDxO1NEQFjbDw3AWLUfdW4revTGPHzs_hrB820cehFMbsFn8fvPr15n4SWC4n2mUObiNxop6lzwgjCrBlR4zgWD51IlfPRjNOZsjS1lI-0YBlTVGTUFplNS17mltyGraqu7F2IhWOKMGItVwXNSSly7SVzXfoHHzQWEaRr20sd-MixLcap7PKSgsneMNIbRqJhpIjg-WbKoifj-Jvwa_ygG0Hk0e5e1MuZDMtSKs0ybQxX3CivhiqLgiid-bSS5tYIFsETdAeJTBkVQnFm5app5P7RR7nH_O6YM8FoBM-CkKu9BroMfzZ4OyC51kByZyDpl7IeDD9ee53EIcS_VbZeNdJH4YQw5HKL4E7vhRvFfASY-qgxi4AP_HOg-XCkmp90TOJYhPUBcwQv1l4qwxbW_Nmw9_5J-hFsT9-O5cH-5MN9uDxCL-8u5Xdgq12u7AO4qL-182b5sFu5PwDXvkHN
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=MiPepid%3A+MicroPeptide+identification+tool+using+machine+learning&rft.jtitle=BMC+bioinformatics&rft.au=Zhu%2C+Mengmeng&rft.au=Gribskov%2C+Michael&rft.date=2019-11-08&rft.eissn=1471-2105&rft.volume=20&rft.issue=1&rft.spage=559&rft_id=info:doi/10.1186%2Fs12859-019-3033-9&rft_id=info%3Apmid%2F31703551&rft.externalDocID=31703551
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1471-2105&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1471-2105&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1471-2105&client=summon