An algorithm for peptide de novo sequencing from a group of SILAC labeled MS/MS spectra

Shotgun proteomics coupled with high-performance liquid chromatography and mass spectrometry has been instrumental in identifying proteins in complex mixtures. Effective computational approaches are required to automate the spectra interpretation process to handle the vast amount of data collected i...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Journal of bioinformatics and computational biology Jg. 23; H. 3; S. 2550007
Hauptverfasser: Han, Fang, Zhang, Kaizhong
Format: Journal Article
Sprache:Englisch
Veröffentlicht: Singapore 01.06.2025
Schlagworte:
ISSN:1757-6334, 1757-6334
Online-Zugang:Weitere Angaben
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Abstract Shotgun proteomics coupled with high-performance liquid chromatography and mass spectrometry has been instrumental in identifying proteins in complex mixtures. Effective computational approaches are required to automate the spectra interpretation process to handle the vast amount of data collected in a single Liquid Chromatography-Tandem Mass Spectrometry (LC-MS/MS) run. De novo sequencing from MS/MS has emerged as a vital technology for peptide sequencing in proteomics. To enhance the accuracy and practicality of de novo sequencing, previous algorithms have utilized multiple spectra to identify peptide sequences. Here, our study focuses on de novo sequencing of multiple tandem mass spectra of peptides with stable isotope labeling with amino acids in cell culture (SILAC) by incorporating different isotope-labeled amino acids into newly synthesized proteins. Multiple MS/MS spectra for the same peptide sequence are produced by the spectrometer after the SILAC samples undergo processing by LC-MS/MS shotgun proteomics. Taking into consideration the factors such as retention time and precursor ion mass, we aim to identify the peptide sequence with specific SILAC modifications and their locations. To do so, we propose de novo sequencing algorithms to compute the potential candidate peptide sequence by using similarity scores, followed by refinement algorithms to evaluate them. We also use real experimental data to test the algorithms.
AbstractList Shotgun proteomics coupled with high-performance liquid chromatography and mass spectrometry has been instrumental in identifying proteins in complex mixtures. Effective computational approaches are required to automate the spectra interpretation process to handle the vast amount of data collected in a single Liquid Chromatography-Tandem Mass Spectrometry (LC-MS/MS) run. De novo sequencing from MS/MS has emerged as a vital technology for peptide sequencing in proteomics. To enhance the accuracy and practicality of de novo sequencing, previous algorithms have utilized multiple spectra to identify peptide sequences. Here, our study focuses on de novo sequencing of multiple tandem mass spectra of peptides with stable isotope labeling with amino acids in cell culture (SILAC) by incorporating different isotope-labeled amino acids into newly synthesized proteins. Multiple MS/MS spectra for the same peptide sequence are produced by the spectrometer after the SILAC samples undergo processing by LC-MS/MS shotgun proteomics. Taking into consideration the factors such as retention time and precursor ion mass, we aim to identify the peptide sequence with specific SILAC modifications and their locations. To do so, we propose de novo sequencing algorithms to compute the potential candidate peptide sequence by using similarity scores, followed by refinement algorithms to evaluate them. We also use real experimental data to test the algorithms.
Shotgun proteomics coupled with high-performance liquid chromatography and mass spectrometry has been instrumental in identifying proteins in complex mixtures. Effective computational approaches are required to automate the spectra interpretation process to handle the vast amount of data collected in a single Liquid Chromatography-Tandem Mass Spectrometry (LC-MS/MS) run. De novo sequencing from MS/MS has emerged as a vital technology for peptide sequencing in proteomics. To enhance the accuracy and practicality of de novo sequencing, previous algorithms have utilized multiple spectra to identify peptide sequences. Here, our study focuses on de novo sequencing of multiple tandem mass spectra of peptides with stable isotope labeling with amino acids in cell culture (SILAC) by incorporating different isotope-labeled amino acids into newly synthesized proteins. Multiple MS/MS spectra for the same peptide sequence are produced by the spectrometer after the SILAC samples undergo processing by LC-MS/MS shotgun proteomics. Taking into consideration the factors such as retention time and precursor ion mass, we aim to identify the peptide sequence with specific SILAC modifications and their locations. To do so, we propose de novo sequencing algorithms to compute the potential candidate peptide sequence by using similarity scores, followed by refinement algorithms to evaluate them. We also use real experimental data to test the algorithms.Shotgun proteomics coupled with high-performance liquid chromatography and mass spectrometry has been instrumental in identifying proteins in complex mixtures. Effective computational approaches are required to automate the spectra interpretation process to handle the vast amount of data collected in a single Liquid Chromatography-Tandem Mass Spectrometry (LC-MS/MS) run. De novo sequencing from MS/MS has emerged as a vital technology for peptide sequencing in proteomics. To enhance the accuracy and practicality of de novo sequencing, previous algorithms have utilized multiple spectra to identify peptide sequences. Here, our study focuses on de novo sequencing of multiple tandem mass spectra of peptides with stable isotope labeling with amino acids in cell culture (SILAC) by incorporating different isotope-labeled amino acids into newly synthesized proteins. Multiple MS/MS spectra for the same peptide sequence are produced by the spectrometer after the SILAC samples undergo processing by LC-MS/MS shotgun proteomics. Taking into consideration the factors such as retention time and precursor ion mass, we aim to identify the peptide sequence with specific SILAC modifications and their locations. To do so, we propose de novo sequencing algorithms to compute the potential candidate peptide sequence by using similarity scores, followed by refinement algorithms to evaluate them. We also use real experimental data to test the algorithms.
Author Han, Fang
Zhang, Kaizhong
Author_xml – sequence: 1
  givenname: Fang
  orcidid: 0009-0009-3874-3348
  surname: Han
  fullname: Han, Fang
  organization: Institute of Applied Mathematics, Hebei Academy of Science, 46 Youyi South Street, Shijiazhuang, Hebei Province, 050081, China
– sequence: 2
  givenname: Kaizhong
  orcidid: 0009-0006-5735-2949
  surname: Zhang
  fullname: Zhang, Kaizhong
  organization: Department of Computer Science, University of Western Ontario, 1151 Richmond Street, London, Ontario, N6A5B7, Canada
BackLink https://www.ncbi.nlm.nih.gov/pubmed/40618198$$D View this record in MEDLINE/PubMed
BookMark eNpNUM1KxDAYDLKi6-oDeJEcvdRNvqZJelwWfxZ28dAFjyVNv6yFNqlpK_j2FlQQBmYOwzAzV2Thg0dCbjl74FzAumDAcwWMQZYxxpQ8I0uuMpXINBWLf_qCXAomuea5XpK3jaemPYXYjO8ddSHSHvuxqZHO8OEz0AE_JvS28SfqYuiooacYpp4GR4vdfrOlramwxZoeivWhoEOPdozmmpw70w5488srcnx6PG5fkv3r82672Sc2BSETo6tMcws2zwXkTNYurQQ67Srt5g3MSYG2BhAMnLSgMNfG2YrlaAANgxW5_4ntY5hrDmPZNYPFtjUewzSUKYASXAGks_Xu1zpVHdZlH5vOxK_y7wv4BlZLXtE
ContentType Journal Article
DBID CGR
CUY
CVF
ECM
EIF
NPM
7X8
DOI 10.1142/S0219720025500076
DatabaseName Medline
MEDLINE
MEDLINE (Ovid)
MEDLINE
MEDLINE
PubMed
MEDLINE - Academic
DatabaseTitle MEDLINE
Medline Complete
MEDLINE with Full Text
PubMed
MEDLINE (Ovid)
MEDLINE - Academic
DatabaseTitleList MEDLINE
MEDLINE - Academic
Database_xml – sequence: 1
  dbid: NPM
  name: PubMed
  url: http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed
  sourceTypes: Index Database
– sequence: 2
  dbid: 7X8
  name: MEDLINE - Academic
  url: https://search.proquest.com/medline
  sourceTypes: Aggregation Database
DeliveryMethod no_fulltext_linktorsrc
EISSN 1757-6334
ExternalDocumentID 40618198
Genre Journal Article
GroupedDBID CGR
CUY
CVF
ECM
EIF
NPM
7X8
ID FETCH-LOGICAL-c3246-a8b581c2c9942906df3b4ef8fb8f0070f64ecd22402f6c27e98afcb09ea2ea02
IEDL.DBID 7X8
ISICitedReferencesCount 0
ISICitedReferencesURI http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=001523573200001&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
ISSN 1757-6334
IngestDate Sun Jul 06 16:30:56 EDT 2025
Thu Jul 31 01:53:22 EDT 2025
IsPeerReviewed true
IsScholarly true
Issue 3
Keywords mass spectrometry
SILAC
multiple MS/MS spectra
Bioinformatics
computational proteomics
de novo sequencing
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c3246-a8b581c2c9942906df3b4ef8fb8f0070f64ecd22402f6c27e98afcb09ea2ea02
Notes ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ORCID 0009-0009-3874-3348
0009-0006-5735-2949
PMID 40618198
PQID 3227417223
PQPubID 23479
ParticipantIDs proquest_miscellaneous_3227417223
pubmed_primary_40618198
PublicationCentury 2000
PublicationDate 20250600
PublicationDateYYYYMMDD 2025-06-01
PublicationDate_xml – month: 06
  year: 2025
  text: 20250600
PublicationDecade 2020
PublicationPlace Singapore
PublicationPlace_xml – name: Singapore
PublicationTitle Journal of bioinformatics and computational biology
PublicationTitleAlternate J Bioinform Comput Biol
PublicationYear 2025
Score 2.3701806
Snippet Shotgun proteomics coupled with high-performance liquid chromatography and mass spectrometry has been instrumental in identifying proteins in complex mixtures....
SourceID proquest
pubmed
SourceType Aggregation Database
Index Database
StartPage 2550007
SubjectTerms Algorithms
Amino Acid Sequence
Chromatography, Liquid
Humans
Isotope Labeling - methods
Peptides - chemistry
Proteomics - methods
Sequence Analysis, Protein - methods
Tandem Mass Spectrometry - methods
Title An algorithm for peptide de novo sequencing from a group of SILAC labeled MS/MS spectra
URI https://www.ncbi.nlm.nih.gov/pubmed/40618198
https://www.proquest.com/docview/3227417223
Volume 23
WOSCitedRecordID wos001523573200001&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
hasFullText
inHoldings 1
isFullTextHit
isPrint
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwpV1bS8MwFA7qfPDFC97mjQi-hm1p1qRPMoZDYRuDDtzbyO2ooO3c5n6_J1mHT4IglD61tBy-5PtyroTcyQRSwJMP0wlwJqzxTGWizdJEOy8sOAOxz2xfDodqMslGlcNtUaVVbvbEuFG70gYfeQOBh-Qnkc3uZ58sTI0K0dVqhMY2qSUoZQKq5SRWv8m2xO8logpktgRv5EhnmeRRRccI1O-iMpJL7-C_v3VI9itZSTtrHByRLV8ck-dOQfX7Cz6-fP2gqE7pLOSwOE_xKspVSatEaqQvGupMqKaxyoOWQPOnfqdLESPIS44O8sYgp7Esc65PyLj3MO4-smqQArOol1KmlWmrluU2y0Ro7-4gMcKDAqMg9PuBVHjrArlzSC2XPlMarGlmXnOvm_yU7BRl4c8JVWkipDFexb6ACk87hjsnZaJBCp9CndxuzDRFnIbggy58-bWY_hiqTs7Wtp7O1g01pkFUoDJRF394-5Ls8TCCNzpCrkgNcJX6a7JrV8u3xfwmAgDvw9HgG8u_uug
linkProvider ProQuest
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=An+algorithm+for+peptide+de+novo+sequencing+from+a+group+of+SILAC+labeled+MS%2FMS+spectra&rft.jtitle=Journal+of+bioinformatics+and+computational+biology&rft.au=Han%2C+Fang&rft.au=Zhang%2C+Kaizhong&rft.date=2025-06-01&rft.issn=1757-6334&rft.eissn=1757-6334&rft.spage=2550007&rft_id=info:doi/10.1142%2FS0219720025500076&rft.externalDBID=NO_FULL_TEXT
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1757-6334&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1757-6334&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1757-6334&client=summon