Fast & Space-Efficient Approximations of Language Edit Distance and RNA Folding: An Amnesic Dynamic Programming Approach
Dynamic programming is a basic, and one of the most systematic techniques for developing polynomial time algorithms with overwhelming applications. However, it often suffers from having high running time and space complexity due to (a) maintaining a table of solutions for a large number of sub-insta...
Uloženo v:
| Vydáno v: | Annual Symposium on Foundations of Computer Science s. 295 - 306 |
|---|---|
| Hlavní autor: | |
| Médium: | Konferenční příspěvek |
| Jazyk: | angličtina |
| Vydáno: |
IEEE
01.10.2017
|
| Témata: | |
| ISSN: | 0272-5428 |
| On-line přístup: | Získat plný text |
| Tagy: |
Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
|
| Abstract | Dynamic programming is a basic, and one of the most systematic techniques for developing polynomial time algorithms with overwhelming applications. However, it often suffers from having high running time and space complexity due to (a) maintaining a table of solutions for a large number of sub-instances, and (b) combining/comparing these solutions to successively solve larger sub-instances. In this paper, we consider a canonical cubic time and quadratic space dynamic programming, and show how improvements in both its time and space uses are possible. As a result, we obtain fast small-space approximation algorithms for the fundamental problems of context free grammar recognition (the basic computer science problem of parsing), the language edit distance (a significant generalization of string edit distance and parsing), and RNA folding (a classical problem in bioinformatics). For these problems, ours are the first algorithms that break the cubic-time barrier of any combinatorial algorithm, and quadratic-space barrier of "any" algorithm significantly improving upon their long-standing space and time complexities. Our technique applies to many other problems as well including string edit distance computation, and finding longest increasing subsequence. Our improvements come from directly grinding the dynamic programming and looking through the lens of language edit distance which generalizes both context free grammar recognition, and RNA folding. From known conditional lower bound results, neither of these problems can have an exact combinatorial algorithm (one that does not use fast matrix multiplication) running in truly subcubic time. Moreover, for language edit distance such an algorithm cannot exist even when nontrivial multiplicative approximation is allowed. We overcome this hurdle by designing an additive-approximation algorithm that for any parameter k > 0, uses O(nk log n) space and O(n 2 k log n) time and provides an additive O(nk log n)approximation. In particular, in Õ(n) 1 space and Õ(n 2 ) time it can solve deterministically whether a string belongs to a context free grammar, or ϵ-far from it for any constant ϵ > 0. We also improve the above results to obtain an algorithm that outputs an ϵ · n-additive approximation to the above problems with space complexity O(n2/3 log n). The space complexity remains sublinear in n, as long as ϵ = o(n -1 / 4 ). Moreover, we provide the first MapReduce and streaming algorithms for them with multiple passes and sublinear space complexity. |
|---|---|
| AbstractList | Dynamic programming is a basic, and one of the most systematic techniques for developing polynomial time algorithms with overwhelming applications. However, it often suffers from having high running time and space complexity due to (a) maintaining a table of solutions for a large number of sub-instances, and (b) combining/comparing these solutions to successively solve larger sub-instances. In this paper, we consider a canonical cubic time and quadratic space dynamic programming, and show how improvements in both its time and space uses are possible. As a result, we obtain fast small-space approximation algorithms for the fundamental problems of context free grammar recognition (the basic computer science problem of parsing), the language edit distance (a significant generalization of string edit distance and parsing), and RNA folding (a classical problem in bioinformatics). For these problems, ours are the first algorithms that break the cubic-time barrier of any combinatorial algorithm, and quadratic-space barrier of "any" algorithm significantly improving upon their long-standing space and time complexities. Our technique applies to many other problems as well including string edit distance computation, and finding longest increasing subsequence. Our improvements come from directly grinding the dynamic programming and looking through the lens of language edit distance which generalizes both context free grammar recognition, and RNA folding. From known conditional lower bound results, neither of these problems can have an exact combinatorial algorithm (one that does not use fast matrix multiplication) running in truly subcubic time. Moreover, for language edit distance such an algorithm cannot exist even when nontrivial multiplicative approximation is allowed. We overcome this hurdle by designing an additive-approximation algorithm that for any parameter k > 0, uses O(nk log n) space and O(n 2 k log n) time and provides an additive O(nk log n)approximation. In particular, in Õ(n) 1 space and Õ(n 2 ) time it can solve deterministically whether a string belongs to a context free grammar, or ϵ-far from it for any constant ϵ > 0. We also improve the above results to obtain an algorithm that outputs an ϵ · n-additive approximation to the above problems with space complexity O(n2/3 log n). The space complexity remains sublinear in n, as long as ϵ = o(n -1 / 4 ). Moreover, we provide the first MapReduce and streaming algorithms for them with multiple passes and sublinear space complexity. |
| Author | Saha, Barna |
| Author_xml | – sequence: 1 givenname: Barna surname: Saha fullname: Saha, Barna email: barna@cs.umass.edu organization: Coll. of Inf. & Comput. Sci., Univ. of Massachusetts Amherst, Amherst, MA, USA |
| BookMark | eNotkMFLwzAchSNMcJ3evHnJyVtr0jRp6q1sqwrDidPzSJNfamRNS9PB9t9bmKfv8HgfvBehme88IHRPSUIpKZ6q7XKXpITmCeNXKKKcScEykbEZmpM0T2OepfIGRSH8EpIRTtgcnSoVRvyId73SEK-tddqBH3HZ90N3cq0aXecD7izeKN8cVQN4bdyIVy6MymvAyhv8-V7iqjsY55tnXHpcth6C03h19qqd-DF0zaDadsovYqV_btG1VYcAd_9coO9q_bV8jTfbl7dluYkdzfkYp6wGygtBlU0ZABgDhYRCy6ImtiY5EKpzCrUBIzSzBjgVlgsqmJQmE8AW6OHidVN73w_TpOG8l3Q6QOTsD_72XWg |
| CODEN | IEEPAD |
| ContentType | Conference Proceeding |
| DBID | 6IE 6IH CBEJK RIE RIO |
| DOI | 10.1109/FOCS.2017.35 |
| DatabaseName | IEEE Electronic Library (IEL) Conference Proceedings IEEE Proceedings Order Plan (POP) 1998-present by volume IEEE Xplore All Conference Proceedings IEEE Electronic Library (IEL) IEEE Proceedings Order Plans (POP) 1998-present |
| DatabaseTitleList | |
| Database_xml | – sequence: 1 dbid: RIE name: IEEE Electronic Library (IEL) url: https://ieeexplore.ieee.org/ sourceTypes: Publisher |
| DeliveryMethod | fulltext_linktorsrc |
| Discipline | Mathematics |
| EISBN | 1538634643 9781538634646 |
| EndPage | 306 |
| ExternalDocumentID | 8104067 |
| Genre | orig-research |
| GroupedDBID | -~X 23M 6IE 6IF 6IH 6IK 6IL 6IM 6IN AAJGR AAWTH ACNCT ADZIZ AFFNX AI. ALMA_UNASSIGNED_HOLDINGS BEFXN BFFAM BGNUA BKEBE BPEOZ CBEJK CHZPO IPLJI OCL RIE RIL RIO RNS VH1 |
| ID | FETCH-LOGICAL-i175t-23be15961af23eeedde98e9c89b0fb07e01c71ebded6c3fde516f5616388d46e3 |
| IEDL.DBID | RIE |
| ISICitedReferencesCount | 11 |
| ISICitedReferencesURI | http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000417425300026&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| ISSN | 0272-5428 |
| IngestDate | Wed Aug 27 02:33:43 EDT 2025 |
| IsPeerReviewed | false |
| IsScholarly | false |
| Language | English |
| LinkModel | DirectLink |
| MergedId | FETCHMERGED-LOGICAL-i175t-23be15961af23eeedde98e9c89b0fb07e01c71ebded6c3fde516f5616388d46e3 |
| PageCount | 12 |
| ParticipantIDs | ieee_primary_8104067 |
| PublicationCentury | 2000 |
| PublicationDate | 2017-Oct. |
| PublicationDateYYYYMMDD | 2017-10-01 |
| PublicationDate_xml | – month: 10 year: 2017 text: 2017-Oct. |
| PublicationDecade | 2010 |
| PublicationTitle | Annual Symposium on Foundations of Computer Science |
| PublicationTitleAbbrev | SFCS |
| PublicationYear | 2017 |
| Publisher | IEEE |
| Publisher_xml | – name: IEEE |
| SSID | ssj0040503 |
| Score | 1.785325 |
| Snippet | Dynamic programming is a basic, and one of the most systematic techniques for developing polynomial time algorithms with overwhelming applications. However, it... |
| SourceID | ieee |
| SourceType | Publisher |
| StartPage | 295 |
| SubjectTerms | Algorithm design and analysis Approximation algorithms Context Free Grammar Dynamic programming Edit Distance Grammar Heuristic algorithms Language Edit Distance Light emitting diodes Parsing RNA RNA-Folding |
| Title | Fast & Space-Efficient Approximations of Language Edit Distance and RNA Folding: An Amnesic Dynamic Programming Approach |
| URI | https://ieeexplore.ieee.org/document/8104067 |
| WOSCitedRecordID | wos000417425300026&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| link | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV07T8MwELZKxQALjxbxlgfERNq8bbNFbSMGKBUFqVvl2GeRoSlqUtSfj-2UwMDClod0SWzl_N35vu8QuvEFkzExFGUV-U5IQulQFsb6VESKByC4UrbZBBmP6WzGJi1013BhAMAWn0HPHNq9fLkUa5Mq61MdO2jvuoN2CCE1V-vb64ZG18TmU4jR9vdpU-TO-unzYGqKuEjPNHX71UTFriHpwf-efoi6P2Q8PGmWmSPUguIY7T81cqtlB21SXlb4Fk91AAzOyKpCaFs4MYLhm7xmJ5Z4qfDjNj2JRzKv8NCAR2OfFxK_jBOc1ntR9zgpcLLQXjAXeFj3rDfvYCq5Fvp-bZiL9y56S0evgwdn21LByTVOqBw_yEADmNjjyg_0F2rnxigwQVnmqswl4HqCeJBJkLEIlITIi5WGWPovpTKMIThB7WJZwCnCwLlkSsMXj_JQScYiqaPJKIp4kFHlijPUMWM4_6hVM-bb4Tv_-_IF2jMzVJfJXaJ2tVrDFdoVn1Verq7tVH8B4qWrEw |
| linkProvider | IEEE |
| linkToHtml | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV07T8MwELZKQQIWHgXxxgNiIm3esdmitlERbahokbpVjn0WGZqiJkX9-dhJKQwsbHlIl8RWzt-d7_sOoTubU-EHmqIsPdtwA1cYhLq-OuWeZA5wJmXZbCKIYzKZ0GENPWy4MABQFp9BUx-We_lizpc6VdYiKnZQ3nULbXuua1sVW-vb77pa2aTMqARa3d8mmzJ32ope2iNdxhU0dVu3X21UylUkOvjf8w_RyQ8dDw83C80RqkF2jPYHG8HVvIFWEcsLfI9HKgQGo1vqQihbONSS4au04ifmeC5xf52gxF2RFrij4aO2zzKBX-MQR9Vu1CMOMxzOlB9MOe5UXev1O-harpm6Xxlm_P0EvUXdcbtnrJsqGKlCCoVhOwkoCONbTNqO-kLl3igByglNTJmYAZgWDyxIBAifO1KAZ_lSgSz1nxLh-uCcono2z-AMYWBMUKkAjEWYKwWlnlDxpOd5zEmINPk5augxnH5UuhnT9fBd_H35Fu32xoP-tP8UP1-iPT1bVdHcFaoXiyVcox3-WaT54qac9i_1Ta5a |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=proceeding&rft.title=Annual+Symposium+on+Foundations+of+Computer+Science&rft.atitle=Fast+%26+Space-Efficient+Approximations+of+Language+Edit+Distance+and+RNA+Folding%3A+An+Amnesic+Dynamic+Programming+Approach&rft.au=Saha%2C+Barna&rft.date=2017-10-01&rft.pub=IEEE&rft.issn=0272-5428&rft.spage=295&rft.epage=306&rft_id=info:doi/10.1109%2FFOCS.2017.35&rft.externalDocID=8104067 |
| thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0272-5428&client=summon |
| thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0272-5428&client=summon |
| thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0272-5428&client=summon |