State Ensemble Energy Recognition (SEER): A Hybrid Gas-Phase Molecular Charge State Predictor
Saved in:
| Title: | State Ensemble Energy Recognition (SEER): A Hybrid Gas-Phase Molecular Charge State Predictor |
|---|---|
| Authors: | Mithony Keng, Kenneth M. Merz |
| Publication Year: | 2025 |
| Subject Terms: | Biophysics, Biochemistry, Medicine, Space Science, Biological Sciences not elsewhere classified, Chemical Sciences not elsewhere classified, Physical Sciences not elsewhere classified, Information Systems not elsewhere classified, warrants additional protonation, relative energy scoring, quantum mechanical optimizations, physiochemical properties relating, overall average number, major adverse impact, improved analyte identification, https :// github, historically achieved success, google colab platform, experimental mass spectrometry, equilibrium charge states, correct charge states, correct charge state, assigning equilibrium structures, analyte impurity testing, achieve higher throughput, new computational software, modeling large systems, derived ccs values, additional user programming, r b |
| Description: | Accurately resolving a three-dimensional structure that corresponds to an experimental mass spectrometry (MS) result is valuable for outcomes such as improved analyte identification, determination of physiochemical properties relating to conformation, analyte impurity testing, and drug chemical integrity analysis. Computational approaches utilizing charge state modeling, conformational sampling, quantum mechanical optimizations, relative energy scoring, and computed ion-neutral collision cross sections (CCS) have historically achieved success at assigning equilibrium structures to ion-mobility MS-derived CCS values. Despite this positive status, there remains a lack of new computational software to achieve higher throughput when modeling large systems. A major adverse impact on computational cost is the general increase in titratable sites with molecular size, which then warrants additional protonation/deprotonation models in order to ensure that the correct charge state is captured. Here, we introduce a user-friendly machine learning program called SEER ( S tate E nsemble E nergy R ecognition) to accurately and efficiently predict the equilibrium charge states of MS-relevant ions. We report that for all systems within the test set, SEER successfully captured the lowest relative energy minimum charge states within its top two predicted candidates from an overall average number of ∼ seven titratable sites. Furthermore, the density functional theory optimized geometries for SEER assigned charge states produced CCS experimental errors that are within the acceptable threshold (i.e., ≤3% error) set for this work. The benchmark study compared SEER to two well-established charge state prediction software packages CREST and Epik classic and found that SEER is either on par or better at consistently locating the correct charge states for the test set with competitive efficiency. SEER requires no additional user programming and is readily accessible through the Google Colab platform at https://github.com/mitkeng/SEER. |
| Document Type: | article in journal/newspaper |
| Language: | unknown |
| DOI: | 10.1021/acs.jcim.5c00980.s001 |
| Availability: | https://doi.org/10.1021/acs.jcim.5c00980.s001 https://figshare.com/articles/journal_contribution/State_Ensemble_Energy_Recognition_SEER_A_Hybrid_Gas-Phase_Molecular_Charge_State_Predictor/29507329 |
| Rights: | CC BY-NC 4.0 |
| Accession Number: | edsbas.D2F2DBE9 |
| Database: | BASE |
| Abstract: | Accurately resolving a three-dimensional structure that corresponds to an experimental mass spectrometry (MS) result is valuable for outcomes such as improved analyte identification, determination of physiochemical properties relating to conformation, analyte impurity testing, and drug chemical integrity analysis. Computational approaches utilizing charge state modeling, conformational sampling, quantum mechanical optimizations, relative energy scoring, and computed ion-neutral collision cross sections (CCS) have historically achieved success at assigning equilibrium structures to ion-mobility MS-derived CCS values. Despite this positive status, there remains a lack of new computational software to achieve higher throughput when modeling large systems. A major adverse impact on computational cost is the general increase in titratable sites with molecular size, which then warrants additional protonation/deprotonation models in order to ensure that the correct charge state is captured. Here, we introduce a user-friendly machine learning program called SEER ( S tate E nsemble E nergy R ecognition) to accurately and efficiently predict the equilibrium charge states of MS-relevant ions. We report that for all systems within the test set, SEER successfully captured the lowest relative energy minimum charge states within its top two predicted candidates from an overall average number of ∼ seven titratable sites. Furthermore, the density functional theory optimized geometries for SEER assigned charge states produced CCS experimental errors that are within the acceptable threshold (i.e., ≤3% error) set for this work. The benchmark study compared SEER to two well-established charge state prediction software packages CREST and Epik classic and found that SEER is either on par or better at consistently locating the correct charge states for the test set with competitive efficiency. SEER requires no additional user programming and is readily accessible through the Google Colab platform at https://github.com/mitkeng/SEER. |
|---|---|
| DOI: | 10.1021/acs.jcim.5c00980.s001 |
Nájsť tento článok vo Web of Science