High-Performance Parallel Location-Aware Algorithms for Approximate String Matching on GPUs
Approximate string matching has been widely used in many applications, including deoxyribonucleic acid sequence searching, spell checking, text mining, and spam filters. The method is designed to find all locations of strings that approximately match a pattern in accordance with the number of insert...
Saved in:
| Published in: | Proceedings - International Conference on Parallel and Distributed Systems pp. 570 - 575 |
|---|---|
| Main Authors: | , |
| Format: | Conference Proceeding Journal Article |
| Language: | English |
| Published: |
IEEE
01.12.2015
|
| Subjects: | |
| ISSN: | 1521-9097 |
| Online Access: | Get full text |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| Abstract | Approximate string matching has been widely used in many applications, including deoxyribonucleic acid sequence searching, spell checking, text mining, and spam filters. The method is designed to find all locations of strings that approximately match a pattern in accordance with the number of insertion, deletion, and substitution operations. Among the proposed algorithms, the bit-parallel algorithms are considered to be the best and highly efficient algorithms. However, the traditional bit-parallel algorithms lacks the ability of identifying the start and end positions of a matched pattern. Furthermore, acceleration of the bit-parallel algorithms has become a crucial issue for processing big data nowadays. In this paper, we propose two kinds of parallel location-aware algorithms called data-segmented parallelism and high-degree parallelism as means to accelerate approximate string matching using graphic processing units. Experimental results show that the high-degree parallelism on GPUs achieves significant improvement in system and kernel throughputs compared to CPU counterparts. Compared to state-of-the-art approaches, the proposed high-degree parallelism achieves 11 to 105 times improvement. |
|---|---|
| AbstractList | Approximate string matching has been widely used in many applications, including deoxyribonucleic acid sequence searching, spell checking, text mining, and spam filters. The method is designed to find all locations of strings that approximately match a pattern in accordance with the number of insertion, deletion, and substitution operations. Among the proposed algorithms, the bit-parallel algorithms are considered to be the best and highly efficient algorithms. However, the traditional bit-parallel algorithms lacks the ability of identifying the start and end positions of a matched pattern. Furthermore, acceleration of the bit-parallel algorithms has become a crucial issue for processing big data nowadays. In this paper, we propose two kinds of parallel location-aware algorithms called data-segmented parallelism and high-degree parallelism as means to accelerate approximate string matching using graphic processing units. Experimental results show that the high-degree parallelism on GPUs achieves significant improvement in system and kernel throughputs compared to CPU counterparts. Compared to state-of-the-art approaches, the proposed high-degree parallelism achieves 11 to 105 times improvement. |
| Author | Cheng-Hung Lin Chun-Cheng Huang |
| Author_xml | – sequence: 1 givenname: Cheng-Hung surname: Lin fullname: Lin, Cheng-Hung – sequence: 2 givenname: Chun-Cheng surname: Huang fullname: Huang, Chun-Cheng |
| BookMark | eNotjzFPwzAUhA0CibawsrBkZEl5jms7HqMCbaUiIpVODNGr-9IGJXGxUwH_nqAy3Z306XQ3ZBeta4mxWw5jzsE8LKZ59rgaJ8DlWOszNgStjJQ6leqcDbhMeGzA6Cs2DOEDIAEhYcDe59VuH-fkS-cbbC1FOXqsa6qjpbPYVa6Nsy_0FGX1zvmq2zch6tkoOxy8-64a7Chadb5qd9ELdnb_Z1wbzfJ1uGaXJdaBbv51xNbPT2_Tebx8nS2m2TKuEki7WKOxRnFMSlSwNYLEBoW0qg-kjBZyS1iWcjsBUoBYAmxsAqVMob9mbSlG7P7U2y_6PFLoiqYKluoaW3LHUHBtRJKkSqU9endCKyIqDr7f738KLdKJmID4BcZLYjY |
| CODEN | IEEPAD |
| ContentType | Conference Proceeding Journal Article |
| DBID | 6IE 6IL CBEJK RIE RIL 7SC 8FD JQ2 L7M L~C L~D |
| DOI | 10.1109/ICPADS.2015.77 |
| DatabaseName | IEEE Electronic Library (IEL) Conference Proceedings IEEE Xplore POP ALL IEEE Xplore All Conference Proceedings IEEE Electronic Library (IEL) IEEE Proceedings Order Plans (POP All) 1998-Present Computer and Information Systems Abstracts Technology Research Database ProQuest Computer Science Collection Advanced Technologies Database with Aerospace Computer and Information Systems Abstracts Academic Computer and Information Systems Abstracts Professional |
| DatabaseTitle | Computer and Information Systems Abstracts Technology Research Database Computer and Information Systems Abstracts – Academic Advanced Technologies Database with Aerospace ProQuest Computer Science Collection Computer and Information Systems Abstracts Professional |
| DatabaseTitleList | Computer and Information Systems Abstracts |
| Database_xml | – sequence: 1 dbid: RIE name: IEEE Electronic Library (IEL) url: https://ieeexplore.ieee.org/ sourceTypes: Publisher |
| DeliveryMethod | fulltext_linktorsrc |
| Discipline | Computer Science |
| EISBN | 0769557856 9780769557854 |
| EISSN | 1521-9097 |
| EndPage | 575 |
| ExternalDocumentID | 7384340 |
| Genre | orig-research |
| GroupedDBID | 23M 29O 6IE 6IF 6IH 6IK 6IL 6IM 6IN AAJGR AAWTH ABLEC ADZIZ ALMA_UNASSIGNED_HOLDINGS BEFXN BFFAM BGNUA BKEBE BPEOZ CBEJK CHZPO IEGSK IPLJI OCL RIE RIL RNS 7SC 8FD JQ2 L7M L~C L~D |
| ID | FETCH-LOGICAL-i208t-7a9c961a2fa60d93e3ba35c660de69735deaff5d40e60aaf00bc20f580856ccf3 |
| IEDL.DBID | RIE |
| ISICitedReferencesCount | 0 |
| ISICitedReferencesURI | http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000381621400069&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| IngestDate | Sun Nov 09 10:14:30 EST 2025 Wed Aug 27 02:36:42 EDT 2025 |
| IsPeerReviewed | false |
| IsScholarly | true |
| Language | English |
| LinkModel | DirectLink |
| MergedId | FETCHMERGED-LOGICAL-i208t-7a9c961a2fa60d93e3ba35c660de69735deaff5d40e60aaf00bc20f580856ccf3 |
| Notes | ObjectType-Article-2 SourceType-Scholarly Journals-1 ObjectType-Conference-1 ObjectType-Feature-3 content type line 23 SourceType-Conference Papers & Proceedings-2 |
| PQID | 1793228668 |
| PQPubID | 23500 |
| PageCount | 6 |
| ParticipantIDs | ieee_primary_7384340 proquest_miscellaneous_1793228668 |
| PublicationCentury | 2000 |
| PublicationDate | 20151201 |
| PublicationDateYYYYMMDD | 2015-12-01 |
| PublicationDate_xml | – month: 12 year: 2015 text: 20151201 day: 01 |
| PublicationDecade | 2010 |
| PublicationTitle | Proceedings - International Conference on Parallel and Distributed Systems |
| PublicationTitleAbbrev | PADSW |
| PublicationYear | 2015 |
| Publisher | IEEE |
| Publisher_xml | – name: IEEE |
| SSID | ssj0020350 ssib030099276 |
| Score | 1.930796 |
| Snippet | Approximate string matching has been widely used in many applications, including deoxyribonucleic acid sequence searching, spell checking, text mining, and... |
| SourceID | proquest ieee |
| SourceType | Aggregation Database Publisher |
| StartPage | 570 |
| SubjectTerms | Acceleration Algorithm design and analysis Algorithms Approximate string matching Approximation algorithms bit-parallel algorithm Business process re-engineering Computer networks Data mining Digital signal processing graphic processing units Insertion Levenshtein distance nondeterministic finite automaton parallel algorithm Parallel processing Pattern matching Position (location) Registers Searching |
| Title | High-Performance Parallel Location-Aware Algorithms for Approximate String Matching on GPUs |
| URI | https://ieeexplore.ieee.org/document/7384340 https://www.proquest.com/docview/1793228668 |
| WOSCitedRecordID | wos000381621400069&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| link | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1LS8NAEF5q8eCpahXrixU8GrvNZh85lmpV0BJQQfBQNptZLdRE0lT9-e7mUQ968ZaFBJbZyczszDfzIXQ6AMOscWReomPp2fifeiGzB6J9rWOeGKaEKskmxGQin57CqIXOVr0wAFCCz-DcPZa1_CTTS5cq6wsqAxrYC_qaELzq1Wp0h7pQx3eutb5suYpZPaRxQML-zSgaXtw7KBc7Fw2Vyi_7WzqVced_29lEOz_deTha-Z0t1IJ0G3UaegZc_61d9OwwHF700xmAI5U76pQ5vs2qTJ03_FQ54OH8JctnxevbAtt38dDNGf-a2VgW8H3hEn_4zlpsl6vCWYqvosfFDnocXz6Mrr2aS8Gb-UQWnlChDvlA-UZxkoQUaKwo09wugIeCsgSUMSwJCHCilCEk1j4xTNqQjGtt6C5qp1kKewhDGMtgYKNykqhAGZABMybhxqGzFBO6h7pOVtP3alzGtBZTD500wp5aFXZ1CZVCtlxMnY3wfcm53P_70wO04Q6uQpEconaRL-EIreuPYrbIj0s9-AYiL7UE |
| linkProvider | IEEE |
| linkToHtml | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV3dS8MwEA8yBX3yYxO_jeCjnVnTpOnjUKfiNgrbYOBDSdOLDrSVrlP_fJOu2x70xbcGWgiX693l7nf3Q-iyBZoZ48icRMXCMfE_dQJmDkS5SsU80Uz6siSb8Pt9MR4H4Rq6WvbCAEAJPoOmfSxr-UmmZjZVdu1T4VHPXNDXmee5ZN6ttdAeaoMd1zrX6rpla2bVmMYWCa4fb8L27cCCuVjTX5Cp_LLApVvpbP9vQzuoserPw-HS8-yiNUj30PaCoAFX_2sdPVsUhxOuegNwKHNLnvKGu9k8V-e0v2QOuP32kuWT4vV9is27uG0njX9PTDQLeFDY1B_uGZtts1U4S_F9OJo20KhzN7x5cCo2BWfiElE4vgxUwFvS1ZKTJKBAY0mZ4mYBPPApS0BqzRKPACdSakJi5RLNhAnKuFKa7qNamqVwgDAEsfBaJi4nifSkBuExrROuLT5LMl8dorqVVfQxH5gRVWI6RBcLYUdGiW1lQqaQzaaRtRKuKzgXR39_eo42H4a9btR97D8doy17iHNMyQmqFfkMTtGG-iwm0_ys1Ikftoi4Sw |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=proceeding&rft.title=Proceedings+-+International+Conference+on+Parallel+and+Distributed+Systems&rft.atitle=High-Performance+Parallel+Location-Aware+Algorithms+for+Approximate+String+Matching+on+GPUs&rft.au=Cheng-Hung+Lin&rft.au=Chun-Cheng+Huang&rft.date=2015-12-01&rft.pub=IEEE&rft.eissn=1521-9097&rft.spage=570&rft.epage=575&rft_id=info:doi/10.1109%2FICPADS.2015.77&rft.externalDocID=7384340 |