Persistent topological Laplacian analysis of SARS-CoV-2 variants

Topological data analysis (TDA) is an emerging field in mathematics and data science. Its central technique, persistent homology, has had tremendous success in many science and engineering disciplines. However, persistent homology has limitations, including its inability to handle heterogeneous info...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Journal of computational biophysics and chemistry Jg. 22; H. 5; S. 569
Hauptverfasser: Wei, Xiaoqi, Chen, Jiahui, Wei, Guo-Wei
Format: Journal Article
Sprache:Englisch
Veröffentlicht: Singapore 01.08.2023
Schlagworte:
ISSN:2737-4173, 2737-4173
Online-Zugang:Weitere Angaben
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Abstract Topological data analysis (TDA) is an emerging field in mathematics and data science. Its central technique, persistent homology, has had tremendous success in many science and engineering disciplines. However, persistent homology has limitations, including its inability to handle heterogeneous information, such as multiple types of geometric objects; being qualitative rather than quantitative, e.g., counting a 5-member ring the same as a 6-member ring, and a failure to describe non-topological changes, such as homotopic changes in protein-protein binding. Persistent topological Laplacians (PTLs), such as persistent Laplacian and persistent sheaf Laplacian, were proposed to overcome the limitations of persistent homology. In this work, we examine the modeling and analysis power of PTLs in the study of the protein structures of the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) spike receptor binding domain (RBD). First, we employ PTLs to study how the RBD mutation-induced structural changes of RBD-angiotensin-converting enzyme 2 (ACE2) binding complexes are captured in the changes of spectra of the PTLs among SARS-CoV-2 variants. Additionally, we use PTLs to analyze the binding of RBD and ACE2-induced structural changes of various SARS-CoV-2 variants. Finally, we explore the impacts of computationally generated RBD structures on a topological deep learning paradigm and predictions of deep mutational scanning datasets for the SARS-CoV-2 Omicron BA.2 variant. Our results indicate that PTLs have advantages over persistent homology in analyzing protein structural changes and provide a powerful new TDA tool for data science.
AbstractList Topological data analysis (TDA) is an emerging field in mathematics and data science. Its central technique, persistent homology, has had tremendous success in many science and engineering disciplines. However, persistent homology has limitations, including its inability to handle heterogeneous information, such as multiple types of geometric objects; being qualitative rather than quantitative, e.g., counting a 5-member ring the same as a 6-member ring, and a failure to describe non-topological changes, such as homotopic changes in protein-protein binding. Persistent topological Laplacians (PTLs), such as persistent Laplacian and persistent sheaf Laplacian, were proposed to overcome the limitations of persistent homology. In this work, we examine the modeling and analysis power of PTLs in the study of the protein structures of the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) spike receptor binding domain (RBD). First, we employ PTLs to study how the RBD mutation-induced structural changes of RBD-angiotensin-converting enzyme 2 (ACE2) binding complexes are captured in the changes of spectra of the PTLs among SARS-CoV-2 variants. Additionally, we use PTLs to analyze the binding of RBD and ACE2-induced structural changes of various SARS-CoV-2 variants. Finally, we explore the impacts of computationally generated RBD structures on a topological deep learning paradigm and predictions of deep mutational scanning datasets for the SARS-CoV-2 Omicron BA.2 variant. Our results indicate that PTLs have advantages over persistent homology in analyzing protein structural changes and provide a powerful new TDA tool for data science.
Topological data analysis (TDA) is an emerging field in mathematics and data science. Its central technique, persistent homology, has had tremendous success in many science and engineering disciplines. However, persistent homology has limitations, including its inability to handle heterogeneous information, such as multiple types of geometric objects; being qualitative rather than quantitative, e.g., counting a 5-member ring the same as a 6-member ring, and a failure to describe non-topological changes, such as homotopic changes in protein-protein binding. Persistent topological Laplacians (PTLs), such as persistent Laplacian and persistent sheaf Laplacian, were proposed to overcome the limitations of persistent homology. In this work, we examine the modeling and analysis power of PTLs in the study of the protein structures of the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) spike receptor binding domain (RBD). First, we employ PTLs to study how the RBD mutation-induced structural changes of RBD-angiotensin-converting enzyme 2 (ACE2) binding complexes are captured in the changes of spectra of the PTLs among SARS-CoV-2 variants. Additionally, we use PTLs to analyze the binding of RBD and ACE2-induced structural changes of various SARS-CoV-2 variants. Finally, we explore the impacts of computationally generated RBD structures on a topological deep learning paradigm and predictions of deep mutational scanning datasets for the SARS-CoV-2 Omicron BA.2 variant. Our results indicate that PTLs have advantages over persistent homology in analyzing protein structural changes and provide a powerful new TDA tool for data science.Topological data analysis (TDA) is an emerging field in mathematics and data science. Its central technique, persistent homology, has had tremendous success in many science and engineering disciplines. However, persistent homology has limitations, including its inability to handle heterogeneous information, such as multiple types of geometric objects; being qualitative rather than quantitative, e.g., counting a 5-member ring the same as a 6-member ring, and a failure to describe non-topological changes, such as homotopic changes in protein-protein binding. Persistent topological Laplacians (PTLs), such as persistent Laplacian and persistent sheaf Laplacian, were proposed to overcome the limitations of persistent homology. In this work, we examine the modeling and analysis power of PTLs in the study of the protein structures of the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) spike receptor binding domain (RBD). First, we employ PTLs to study how the RBD mutation-induced structural changes of RBD-angiotensin-converting enzyme 2 (ACE2) binding complexes are captured in the changes of spectra of the PTLs among SARS-CoV-2 variants. Additionally, we use PTLs to analyze the binding of RBD and ACE2-induced structural changes of various SARS-CoV-2 variants. Finally, we explore the impacts of computationally generated RBD structures on a topological deep learning paradigm and predictions of deep mutational scanning datasets for the SARS-CoV-2 Omicron BA.2 variant. Our results indicate that PTLs have advantages over persistent homology in analyzing protein structural changes and provide a powerful new TDA tool for data science.
Author Wei, Xiaoqi
Wei, Guo-Wei
Chen, Jiahui
Author_xml – sequence: 1
  givenname: Xiaoqi
  surname: Wei
  fullname: Wei, Xiaoqi
  organization: Department of Mathematics, Michigan State University, MI 48824, USA
– sequence: 2
  givenname: Jiahui
  surname: Chen
  fullname: Chen, Jiahui
  organization: Department of Mathematics, Michigan State University, MI 48824, USA
– sequence: 3
  givenname: Guo-Wei
  surname: Wei
  fullname: Wei, Guo-Wei
  organization: Department of Biochemistry and Molecular Biology, Michigan State University, MI 48824, USA
BackLink https://www.ncbi.nlm.nih.gov/pubmed/37829318$$D View this record in MEDLINE/PubMed
BookMark eNpNj01LxDAYhIOs6Lr6A7xIjl6izXd6c1n8goLiqteSpm-kkDa1aYX991ZcwdMMMw8Dc4IWXewAoXOaXVEq2HVimmtBlWRcZhnT5gAtfyIiqOaLf_4IHXNtWM6pWaKbZxhSk0boRjzGPob40TgbcGH7YF1jO2w7G3YzgqPH2_XLlmziO2H4yw5zO6ZTdOhtSHC21xV6u7t93TyQ4un-cbMuiBMqN6SurZSV87muKBhQjispvc54LaRwXgulHa8MdQA5mFwzLzxlTknqZW2sZCt0-bvbD_FzgjSWbZMchGA7iFMqmdGaG6OkmtGLPTpVLdRlPzStHXbl32v2DU9pV-k
CitedBy_id crossref_primary_10_1016_j_jpha_2024_101081
crossref_primary_10_3934_fods_2025011
crossref_primary_10_3390_math13020208
crossref_primary_10_1093_pnasnexus_pgae158
ContentType Journal Article
DBID NPM
7X8
DOI 10.1142/s2737416523500278
DatabaseName PubMed
MEDLINE - Academic
DatabaseTitle PubMed
MEDLINE - Academic
DatabaseTitleList PubMed
MEDLINE - Academic
Database_xml – sequence: 1
  dbid: NPM
  name: PubMed
  url: http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed
  sourceTypes: Index Database
– sequence: 2
  dbid: 7X8
  name: MEDLINE - Academic
  url: https://search.proquest.com/medline
  sourceTypes: Aggregation Database
DeliveryMethod no_fulltext_linktorsrc
EISSN 2737-4173
ExternalDocumentID 37829318
Genre Journal Article
GrantInformation_xml – fundername: NIGMS NIH HHS
  grantid: R01 GM126189
– fundername: NIAID NIH HHS
  grantid: R01 AI164266
GroupedDBID NPM
7X8
ID FETCH-LOGICAL-c4698-dda55bcf97b1e8e6c3655f703d454cf7467c3b81cee9e8972f4f12c651f5d8a52
IEDL.DBID 7X8
ISICitedReferencesCount 8
ISICitedReferencesURI http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=001005647300002&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
ISSN 2737-4173
IngestDate Fri Jul 11 11:26:30 EDT 2025
Thu Jan 02 22:38:03 EST 2025
IsDoiOpenAccess false
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Issue 5
Keywords Persistent Laplacian
Mutation and binding induced protein structural changes
Spectral data analysis
Topological data analysis
Persistent sheaf Laplacian
Topological deep learning
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c4698-dda55bcf97b1e8e6c3655f703d454cf7467c3b81cee9e8972f4f12c651f5d8a52
Notes ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
OpenAccessLink https://pmc.ncbi.nlm.nih.gov/articles/PMC10569362/pdf/nihms-1888545.pdf
PMID 37829318
PQID 2877388656
PQPubID 23479
ParticipantIDs proquest_miscellaneous_2877388656
pubmed_primary_37829318
PublicationCentury 2000
PublicationDate 20230800
PublicationDateYYYYMMDD 2023-08-01
PublicationDate_xml – month: 08
  year: 2023
  text: 20230800
PublicationDecade 2020
PublicationPlace Singapore
PublicationPlace_xml – name: Singapore
PublicationTitle Journal of computational biophysics and chemistry
PublicationTitleAlternate J Comput Biophys Chem
PublicationYear 2023
References 36748007 - ArXiv. 2023 Apr 6:arXiv:2301.10865v2.
References_xml – reference: 36748007 - ArXiv. 2023 Apr 6:arXiv:2301.10865v2.
Score 2.2985034
Snippet Topological data analysis (TDA) is an emerging field in mathematics and data science. Its central technique, persistent homology, has had tremendous success in...
SourceID proquest
pubmed
SourceType Aggregation Database
Index Database
StartPage 569
Title Persistent topological Laplacian analysis of SARS-CoV-2 variants
URI https://www.ncbi.nlm.nih.gov/pubmed/37829318
https://www.proquest.com/docview/2877388656
Volume 22
WOSCitedRecordID wos001005647300002&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
hasFullText
inHoldings 1
isFullTextHit
isPrint
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwpV1JSwMxGA1qPXhxwa1uRPAa2myT5KSlWDzUUqyW3oZMJgEvM9Wp_f1-mZniSRC85BAIhG_LS75HHkJ3mQg1bCY54FEijGfEwhzxlinpArWuZvnOx2oy0YuFmbYPblVLq9zUxLpQ56WLb-Q9QPaKaw3w4375QaJqVOyuthIa26jDAcrEqFaLWoROcUUEVbxtZFLBelWchI3B1UvWDbffQWV9uIwO_rutQ7Tfwko8aOLgCG354hg9RHp7dGOxwqtGDCG6BI9tZGJBXGDbfkmCy4Bng5cZGZZzwvAaLtCRH3OC3kaPr8Mn0iomEBeVIEmeWykzF4zKqNc-cTyRMkBS50IKF6K0iOOZpnAyGq-NYkEEylwiaZC5tpKdop2iLPw5wo5JrrxyLBNcaJubTAtDQ2Z931rP-110uzFIChEZ2wy28OVXlf6YpIvOGqumy-brjJQDIDFQRi7-sPoS7UVt94Ztd4U6AfLRX6Ndt169V583tathnEyfvwEkIrR1
linkProvider ProQuest
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Persistent+topological+Laplacian+analysis+of+SARS-CoV-2+variants&rft.jtitle=Journal+of+computational+biophysics+and+chemistry&rft.au=Wei%2C+Xiaoqi&rft.au=Chen%2C+Jiahui&rft.au=Wei%2C+Guo-Wei&rft.date=2023-08-01&rft.eissn=2737-4173&rft.volume=22&rft.issue=5&rft.spage=569&rft_id=info:doi/10.1142%2Fs2737416523500278&rft_id=info%3Apmid%2F37829318&rft_id=info%3Apmid%2F37829318&rft.externalDocID=37829318
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2737-4173&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2737-4173&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2737-4173&client=summon