State Ensemble Energy Recognition (SEER): A Hybrid Gas-Phase Molecular Charge State Predictor

Gespeichert in:
Bibliographische Detailangaben
Titel: State Ensemble Energy Recognition (SEER): A Hybrid Gas-Phase Molecular Charge State Predictor
Autoren: Mithony Keng, Kenneth M. Merz
Publikationsjahr: 2025
Schlagwörter: Biophysics, Biochemistry, Medicine, Space Science, Biological Sciences not elsewhere classified, Chemical Sciences not elsewhere classified, Physical Sciences not elsewhere classified, Information Systems not elsewhere classified, warrants additional protonation, relative energy scoring, quantum mechanical optimizations, physiochemical properties relating, overall average number, major adverse impact, improved analyte identification, https :// github, historically achieved success, google colab platform, experimental mass spectrometry, equilibrium charge states, correct charge states, correct charge state, assigning equilibrium structures, analyte impurity testing, achieve higher throughput, new computational software, modeling large systems, derived ccs values, additional user programming, r
Beschreibung: Accurately resolving a three-dimensional structure that corresponds to an experimental mass spectrometry (MS) result is valuable for outcomes such as improved analyte identification, determination of physiochemical properties relating to conformation, analyte impurity testing, and drug chemical integrity analysis. Computational approaches utilizing charge state modeling, conformational sampling, quantum mechanical optimizations, relative energy scoring, and computed ion-neutral collision cross sections (CCS) have historically achieved success at assigning equilibrium structures to ion-mobility MS-derived CCS values. Despite this positive status, there remains a lack of new computational software to achieve higher throughput when modeling large systems. A major adverse impact on computational cost is the general increase in titratable sites with molecular size, which then warrants additional protonation/deprotonation models in order to ensure that the correct charge state is captured. Here, we introduce a user-friendly machine learning program called SEER ( S tate E nsemble E nergy R ecognition) to accurately and efficiently predict the equilibrium charge states of MS-relevant ions. We report that for all systems within the test set, SEER successfully captured the lowest relative energy minimum charge states within its top two predicted candidates from an overall average number of ∼ seven titratable sites. Furthermore, the density functional theory optimized geometries for SEER assigned charge states produced CCS experimental errors that are within the acceptable threshold (i.e., ≤3% error) set for this work. The benchmark study compared SEER to two well-established charge state prediction software packages CREST and Epik classic and found that SEER is either on par or better at consistently locating the correct charge states for the test set with competitive efficiency. SEER requires no additional user programming and is readily accessible through the Google Colab platform at https://github.com/mitkeng/SEER.
Publikationsart: article in journal/newspaper
Sprache: unknown
DOI: 10.1021/acs.jcim.5c00980.s001
Verfügbarkeit: https://doi.org/10.1021/acs.jcim.5c00980.s001
https://figshare.com/articles/journal_contribution/State_Ensemble_Energy_Recognition_SEER_A_Hybrid_Gas-Phase_Molecular_Charge_State_Predictor/29507329
Rights: CC BY-NC 4.0
Dokumentencode: edsbas.D2F2DBE9
Datenbank: BASE
FullText Text:
  Availability: 0
CustomLinks:
  – Url: https://doi.org/10.1021/acs.jcim.5c00980.s001#
    Name: EDS - BASE (s4221598)
    Category: fullText
    Text: View record from BASE
  – Url: https://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=EBSCO&SrcAuth=EBSCO&DestApp=WOS&ServiceName=TransferToWoS&DestLinkType=GeneralSearchSummary&Func=Links&author=Keng%20M
    Name: ISI
    Category: fullText
    Text: Nájsť tento článok vo Web of Science
    Icon: https://imagesrvr.epnet.com/ls/20docs.gif
    MouseOverText: Nájsť tento článok vo Web of Science
Header DbId: edsbas
DbLabel: BASE
An: edsbas.D2F2DBE9
RelevancyScore: 997
AccessLevel: 3
PubType: Academic Journal
PubTypeId: academicJournal
PreciseRelevancyScore: 996.707214355469
IllustrationInfo
Items – Name: Title
  Label: Title
  Group: Ti
  Data: State Ensemble Energy Recognition (SEER): A Hybrid Gas-Phase Molecular Charge State Predictor
– Name: Author
  Label: Authors
  Group: Au
  Data: <searchLink fieldCode="AR" term="%22Mithony+Keng%22">Mithony Keng</searchLink><br /><searchLink fieldCode="AR" term="%22Kenneth+M%2E+Merz%22">Kenneth M. Merz</searchLink>
– Name: DatePubCY
  Label: Publication Year
  Group: Date
  Data: 2025
– Name: Subject
  Label: Subject Terms
  Group: Su
  Data: <searchLink fieldCode="DE" term="%22Biophysics%22">Biophysics</searchLink><br /><searchLink fieldCode="DE" term="%22Biochemistry%22">Biochemistry</searchLink><br /><searchLink fieldCode="DE" term="%22Medicine%22">Medicine</searchLink><br /><searchLink fieldCode="DE" term="%22Space+Science%22">Space Science</searchLink><br /><searchLink fieldCode="DE" term="%22Biological+Sciences+not+elsewhere+classified%22">Biological Sciences not elsewhere classified</searchLink><br /><searchLink fieldCode="DE" term="%22Chemical+Sciences+not+elsewhere+classified%22">Chemical Sciences not elsewhere classified</searchLink><br /><searchLink fieldCode="DE" term="%22Physical+Sciences+not+elsewhere+classified%22">Physical Sciences not elsewhere classified</searchLink><br /><searchLink fieldCode="DE" term="%22Information+Systems+not+elsewhere+classified%22">Information Systems not elsewhere classified</searchLink><br /><searchLink fieldCode="DE" term="%22warrants+additional+protonation%22">warrants additional protonation</searchLink><br /><searchLink fieldCode="DE" term="%22relative+energy+scoring%22">relative energy scoring</searchLink><br /><searchLink fieldCode="DE" term="%22quantum+mechanical+optimizations%22">quantum mechanical optimizations</searchLink><br /><searchLink fieldCode="DE" term="%22physiochemical+properties+relating%22">physiochemical properties relating</searchLink><br /><searchLink fieldCode="DE" term="%22overall+average+number%22">overall average number</searchLink><br /><searchLink fieldCode="DE" term="%22major+adverse+impact%22">major adverse impact</searchLink><br /><searchLink fieldCode="DE" term="%22improved+analyte+identification%22">improved analyte identification</searchLink><br /><searchLink fieldCode="DE" term="%22https+%3A%2F%2F+github%22">https :// github</searchLink><br /><searchLink fieldCode="DE" term="%22historically+achieved+success%22">historically achieved success</searchLink><br /><searchLink fieldCode="DE" term="%22google+colab+platform%22">google colab platform</searchLink><br /><searchLink fieldCode="DE" term="%22experimental+mass+spectrometry%22">experimental mass spectrometry</searchLink><br /><searchLink fieldCode="DE" term="%22equilibrium+charge+states%22">equilibrium charge states</searchLink><br /><searchLink fieldCode="DE" term="%22correct+charge+states%22">correct charge states</searchLink><br /><searchLink fieldCode="DE" term="%22correct+charge+state%22">correct charge state</searchLink><br /><searchLink fieldCode="DE" term="%22assigning+equilibrium+structures%22">assigning equilibrium structures</searchLink><br /><searchLink fieldCode="DE" term="%22analyte+impurity+testing%22">analyte impurity testing</searchLink><br /><searchLink fieldCode="DE" term="%22achieve+higher+throughput%22">achieve higher throughput</searchLink><br /><searchLink fieldCode="DE" term="%22new+computational+software%22">new computational software</searchLink><br /><searchLink fieldCode="DE" term="%22modeling+large+systems%22">modeling large systems</searchLink><br /><searchLink fieldCode="DE" term="%22derived+ccs+values%22">derived ccs values</searchLink><br /><searchLink fieldCode="DE" term="%22additional+user+programming%22">additional user programming</searchLink><br /><searchLink fieldCode="DE" term="%22r+<%2F+b%22">r </ b</searchLink>
– Name: Abstract
  Label: Description
  Group: Ab
  Data: Accurately resolving a three-dimensional structure that corresponds to an experimental mass spectrometry (MS) result is valuable for outcomes such as improved analyte identification, determination of physiochemical properties relating to conformation, analyte impurity testing, and drug chemical integrity analysis. Computational approaches utilizing charge state modeling, conformational sampling, quantum mechanical optimizations, relative energy scoring, and computed ion-neutral collision cross sections (CCS) have historically achieved success at assigning equilibrium structures to ion-mobility MS-derived CCS values. Despite this positive status, there remains a lack of new computational software to achieve higher throughput when modeling large systems. A major adverse impact on computational cost is the general increase in titratable sites with molecular size, which then warrants additional protonation/deprotonation models in order to ensure that the correct charge state is captured. Here, we introduce a user-friendly machine learning program called SEER ( S tate E nsemble E nergy R ecognition) to accurately and efficiently predict the equilibrium charge states of MS-relevant ions. We report that for all systems within the test set, SEER successfully captured the lowest relative energy minimum charge states within its top two predicted candidates from an overall average number of ∼ seven titratable sites. Furthermore, the density functional theory optimized geometries for SEER assigned charge states produced CCS experimental errors that are within the acceptable threshold (i.e., ≤3% error) set for this work. The benchmark study compared SEER to two well-established charge state prediction software packages CREST and Epik classic and found that SEER is either on par or better at consistently locating the correct charge states for the test set with competitive efficiency. SEER requires no additional user programming and is readily accessible through the Google Colab platform at https://github.com/mitkeng/SEER.
– Name: TypeDocument
  Label: Document Type
  Group: TypDoc
  Data: article in journal/newspaper
– Name: Language
  Label: Language
  Group: Lang
  Data: unknown
– Name: DOI
  Label: DOI
  Group: ID
  Data: 10.1021/acs.jcim.5c00980.s001
– Name: URL
  Label: Availability
  Group: URL
  Data: https://doi.org/10.1021/acs.jcim.5c00980.s001<br />https://figshare.com/articles/journal_contribution/State_Ensemble_Energy_Recognition_SEER_A_Hybrid_Gas-Phase_Molecular_Charge_State_Predictor/29507329
– Name: Copyright
  Label: Rights
  Group: Cpyrght
  Data: CC BY-NC 4.0
– Name: AN
  Label: Accession Number
  Group: ID
  Data: edsbas.D2F2DBE9
PLink https://erproxy.cvtisr.sk/sfx/access?url=https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&db=edsbas&AN=edsbas.D2F2DBE9
RecordInfo BibRecord:
  BibEntity:
    Identifiers:
      – Type: doi
        Value: 10.1021/acs.jcim.5c00980.s001
    Languages:
      – Text: unknown
    Subjects:
      – SubjectFull: Biophysics
        Type: general
      – SubjectFull: Biochemistry
        Type: general
      – SubjectFull: Medicine
        Type: general
      – SubjectFull: Space Science
        Type: general
      – SubjectFull: Biological Sciences not elsewhere classified
        Type: general
      – SubjectFull: Chemical Sciences not elsewhere classified
        Type: general
      – SubjectFull: Physical Sciences not elsewhere classified
        Type: general
      – SubjectFull: Information Systems not elsewhere classified
        Type: general
      – SubjectFull: warrants additional protonation
        Type: general
      – SubjectFull: relative energy scoring
        Type: general
      – SubjectFull: quantum mechanical optimizations
        Type: general
      – SubjectFull: physiochemical properties relating
        Type: general
      – SubjectFull: overall average number
        Type: general
      – SubjectFull: major adverse impact
        Type: general
      – SubjectFull: improved analyte identification
        Type: general
      – SubjectFull: https :// github
        Type: general
      – SubjectFull: historically achieved success
        Type: general
      – SubjectFull: google colab platform
        Type: general
      – SubjectFull: experimental mass spectrometry
        Type: general
      – SubjectFull: equilibrium charge states
        Type: general
      – SubjectFull: correct charge states
        Type: general
      – SubjectFull: correct charge state
        Type: general
      – SubjectFull: assigning equilibrium structures
        Type: general
      – SubjectFull: analyte impurity testing
        Type: general
      – SubjectFull: achieve higher throughput
        Type: general
      – SubjectFull: new computational software
        Type: general
      – SubjectFull: modeling large systems
        Type: general
      – SubjectFull: derived ccs values
        Type: general
      – SubjectFull: additional user programming
        Type: general
      – SubjectFull: r </ b
        Type: general
    Titles:
      – TitleFull: State Ensemble Energy Recognition (SEER): A Hybrid Gas-Phase Molecular Charge State Predictor
        Type: main
  BibRelationships:
    HasContributorRelationships:
      – PersonEntity:
          Name:
            NameFull: Mithony Keng
      – PersonEntity:
          Name:
            NameFull: Kenneth M. Merz
    IsPartOfRelationships:
      – BibEntity:
          Dates:
            – D: 01
              M: 01
              Type: published
              Y: 2025
          Identifiers:
            – Type: issn-locals
              Value: edsbas
            – Type: issn-locals
              Value: edsbas.oa
ResultId 1