UV-adVISor: Attention-Based Recurrent Neural Networks to Predict UV-Vis Spectra

Ultraviolet-visible (UV-Vis) absorption spectra are routinely collected as part of high-performance liquid chromatography (HPLC) analysis systems and can be used to identify chemical reaction products by comparison to the reference spectra. Here, we present UV-adVISor as a new computational tool for...

Celý popis

Uloženo v:
Podrobná bibliografie
Vydáno v:Analytical chemistry (Washington) Ročník 93; číslo 48; s. 16076
Hlavní autoři: Urbina, Fabio, Batra, Kushal, Luebke, Kevin J, White, Jason D, Matsiev, Daniel, Olson, Lori L, Malerich, Jeremiah P, Hupcey, Maggie A Z, Madrid, Peter B, Ekins, Sean
Médium: Journal Article
Jazyk:angličtina
Vydáno: United States 07.12.2021
Témata:
ISSN:1520-6882, 1520-6882
On-line přístup:Zjistit podrobnosti o přístupu
Tagy: Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
Popis
Shrnutí:Ultraviolet-visible (UV-Vis) absorption spectra are routinely collected as part of high-performance liquid chromatography (HPLC) analysis systems and can be used to identify chemical reaction products by comparison to the reference spectra. Here, we present UV-adVISor as a new computational tool for predicting the UV-Vis spectra from a molecule's structure alone. UV-Vis prediction was approached as a sequence-to-sequence problem. We utilized Long-Short Term Memory and attention-based neural networks with Extended Connectivity Fingerprint Diameter 6 or molecule SMILES to generate predictive models for the UV spectra. We have produced two spectrum datasets (dataset I, = 949, and dataset II, = 2222) using different compound collections and spectrum acquisition methods to train, validate, and test our models. We evaluated the prediction accuracy of the complete spectra by the correspondence of wavelengths of absorbance maxima and with a series of statistical measures (the best test set median model parameters are in parentheses for model II), including RMSE (0.064), (0.71), and dynamic time warping (DTW, 0.194) of the entire spectrum curve. Scrambling molecule structures with the experimental spectra during training resulted in a degraded , confirming the utility of the approaches for prediction. UV-adVISor is able to provide fast and accurate predictions for libraries of compounds.
Bibliografie:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:1520-6882
1520-6882
DOI:10.1021/acs.analchem.1c03741