High-Performance Parallel Location-Aware Algorithms for Approximate String Matching on GPUs

Approximate string matching has been widely used in many applications, including deoxyribonucleic acid sequence searching, spell checking, text mining, and spam filters. The method is designed to find all locations of strings that approximately match a pattern in accordance with the number of insert...

Full description

Saved in:
Bibliographic Details
Published in:Proceedings - International Conference on Parallel and Distributed Systems pp. 570 - 575
Main Authors: Lin, Cheng-Hung, Huang, Chun-Cheng
Format: Conference Proceeding Journal Article
Language:English
Published: IEEE 01.12.2015
Subjects:
ISSN:1521-9097
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Abstract Approximate string matching has been widely used in many applications, including deoxyribonucleic acid sequence searching, spell checking, text mining, and spam filters. The method is designed to find all locations of strings that approximately match a pattern in accordance with the number of insertion, deletion, and substitution operations. Among the proposed algorithms, the bit-parallel algorithms are considered to be the best and highly efficient algorithms. However, the traditional bit-parallel algorithms lacks the ability of identifying the start and end positions of a matched pattern. Furthermore, acceleration of the bit-parallel algorithms has become a crucial issue for processing big data nowadays. In this paper, we propose two kinds of parallel location-aware algorithms called data-segmented parallelism and high-degree parallelism as means to accelerate approximate string matching using graphic processing units. Experimental results show that the high-degree parallelism on GPUs achieves significant improvement in system and kernel throughputs compared to CPU counterparts. Compared to state-of-the-art approaches, the proposed high-degree parallelism achieves 11 to 105 times improvement.
AbstractList Approximate string matching has been widely used in many applications, including deoxyribonucleic acid sequence searching, spell checking, text mining, and spam filters. The method is designed to find all locations of strings that approximately match a pattern in accordance with the number of insertion, deletion, and substitution operations. Among the proposed algorithms, the bit-parallel algorithms are considered to be the best and highly efficient algorithms. However, the traditional bit-parallel algorithms lacks the ability of identifying the start and end positions of a matched pattern. Furthermore, acceleration of the bit-parallel algorithms has become a crucial issue for processing big data nowadays. In this paper, we propose two kinds of parallel location-aware algorithms called data-segmented parallelism and high-degree parallelism as means to accelerate approximate string matching using graphic processing units. Experimental results show that the high-degree parallelism on GPUs achieves significant improvement in system and kernel throughputs compared to CPU counterparts. Compared to state-of-the-art approaches, the proposed high-degree parallelism achieves 11 to 105 times improvement.
Author Cheng-Hung Lin
Chun-Cheng Huang
Author_xml – sequence: 1
  givenname: Cheng-Hung
  surname: Lin
  fullname: Lin, Cheng-Hung
– sequence: 2
  givenname: Chun-Cheng
  surname: Huang
  fullname: Huang, Chun-Cheng
BookMark eNotjzFPwzAUhA0CibawsrBkZEl5jms7HqMCbaUiIpVODNGr-9IGJXGxUwH_nqAy3Z306XQ3ZBeta4mxWw5jzsE8LKZ59rgaJ8DlWOszNgStjJQ6leqcDbhMeGzA6Cs2DOEDIAEhYcDe59VuH-fkS-cbbC1FOXqsa6qjpbPYVa6Nsy_0FGX1zvmq2zch6tkoOxy8-64a7Chadb5qd9ELdnb_Z1wbzfJ1uGaXJdaBbv51xNbPT2_Tebx8nS2m2TKuEki7WKOxRnFMSlSwNYLEBoW0qg-kjBZyS1iWcjsBUoBYAmxsAqVMob9mbSlG7P7U2y_6PFLoiqYKluoaW3LHUHBtRJKkSqU9endCKyIqDr7f738KLdKJmID4BcZLYjY
CODEN IEEPAD
ContentType Conference Proceeding
Journal Article
DBID 6IE
6IL
CBEJK
RIE
RIL
7SC
8FD
JQ2
L7M
L~C
L~D
DOI 10.1109/ICPADS.2015.77
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Xplore POP ALL
IEEE Xplore All Conference Proceedings
IEEE Electronic Library (IEL)
IEEE Proceedings Order Plans (POP All) 1998-Present
Computer and Information Systems Abstracts
Technology Research Database
ProQuest Computer Science Collection
Advanced Technologies Database with Aerospace
Computer and Information Systems Abstracts – Academic
Computer and Information Systems Abstracts Professional
DatabaseTitle Computer and Information Systems Abstracts
Technology Research Database
Computer and Information Systems Abstracts – Academic
Advanced Technologies Database with Aerospace
ProQuest Computer Science Collection
Computer and Information Systems Abstracts Professional
DatabaseTitleList Computer and Information Systems Abstracts

Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library (IEL)
  url: https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
Discipline Computer Science
EISBN 0769557856
9780769557854
EISSN 1521-9097
EndPage 575
ExternalDocumentID 7384340
Genre orig-research
GroupedDBID 23M
29O
6IE
6IF
6IH
6IK
6IL
6IM
6IN
AAJGR
AAWTH
ABLEC
ADZIZ
ALMA_UNASSIGNED_HOLDINGS
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CBEJK
CHZPO
IEGSK
IPLJI
OCL
RIE
RIL
RNS
7SC
8FD
JQ2
L7M
L~C
L~D
ID FETCH-LOGICAL-i208t-7a9c961a2fa60d93e3ba35c660de69735deaff5d40e60aaf00bc20f580856ccf3
IEDL.DBID RIE
ISICitedReferencesCount 0
ISICitedReferencesURI http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000381621400069&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
IngestDate Sun Nov 09 10:14:30 EST 2025
Wed Aug 27 02:36:42 EDT 2025
IsPeerReviewed false
IsScholarly true
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-i208t-7a9c961a2fa60d93e3ba35c660de69735deaff5d40e60aaf00bc20f580856ccf3
Notes ObjectType-Article-2
SourceType-Scholarly Journals-1
ObjectType-Conference-1
ObjectType-Feature-3
content type line 23
SourceType-Conference Papers & Proceedings-2
PQID 1793228668
PQPubID 23500
PageCount 6
ParticipantIDs ieee_primary_7384340
proquest_miscellaneous_1793228668
PublicationCentury 2000
PublicationDate 20151201
PublicationDateYYYYMMDD 2015-12-01
PublicationDate_xml – month: 12
  year: 2015
  text: 20151201
  day: 01
PublicationDecade 2010
PublicationTitle Proceedings - International Conference on Parallel and Distributed Systems
PublicationTitleAbbrev PADSW
PublicationYear 2015
Publisher IEEE
Publisher_xml – name: IEEE
SSID ssj0020350
ssib030099276
Score 1.930796
Snippet Approximate string matching has been widely used in many applications, including deoxyribonucleic acid sequence searching, spell checking, text mining, and...
SourceID proquest
ieee
SourceType Aggregation Database
Publisher
StartPage 570
SubjectTerms Acceleration
Algorithm design and analysis
Algorithms
Approximate string matching
Approximation algorithms
bit-parallel algorithm
Business process re-engineering
Computer networks
Data mining
Digital signal processing
graphic processing units
Insertion
Levenshtein distance
nondeterministic finite automaton
parallel algorithm
Parallel processing
Pattern matching
Position (location)
Registers
Searching
Title High-Performance Parallel Location-Aware Algorithms for Approximate String Matching on GPUs
URI https://ieeexplore.ieee.org/document/7384340
https://www.proquest.com/docview/1793228668
WOSCitedRecordID wos000381621400069&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1LS8NAEF5q8eCpahXrixU8GrvNZh85lmpV0BJQQfBQNptZLdRE0lT9-e7mUQ968ZaFBJbZyczszDfzIXQ6AMOscWReomPp2fifeiGzB6J9rWOeGKaEKskmxGQin57CqIXOVr0wAFCCz-DcPZa1_CTTS5cq6wsqAxrYC_qaELzq1Wp0h7pQx3eutb5suYpZPaRxQML-zSgaXtw7KBc7Fw2Vyi_7WzqVced_29lEOz_deTha-Z0t1IJ0G3UaegZc_61d9OwwHF700xmAI5U76pQ5vs2qTJ03_FQ54OH8JctnxevbAtt38dDNGf-a2VgW8H3hEn_4zlpsl6vCWYqvosfFDnocXz6Mrr2aS8Gb-UQWnlChDvlA-UZxkoQUaKwo09wugIeCsgSUMSwJCHCilCEk1j4xTNqQjGtt6C5qp1kKewhDGMtgYKNykqhAGZABMybhxqGzFBO6h7pOVtP3alzGtBZTD500wp5aFXZ1CZVCtlxMnY3wfcm53P_70wO04Q6uQpEconaRL-EIreuPYrbIj0s9-AYiL7UE
linkProvider IEEE
linkToHtml http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV3dS8MwEA8yBX3yYxO_jeCjnVnTpOnjUKfiNgrbYOBDSdOLDrSVrlP_fJOu2x70xbcGWgiX693l7nf3Q-iyBZoZ48icRMXCMfE_dQJmDkS5SsU80Uz6siSb8Pt9MR4H4Rq6WvbCAEAJPoOmfSxr-UmmZjZVdu1T4VHPXNDXmee5ZN6ttdAeaoMd1zrX6rpla2bVmMYWCa4fb8L27cCCuVjTX5Cp_LLApVvpbP9vQzuoserPw-HS8-yiNUj30PaCoAFX_2sdPVsUhxOuegNwKHNLnvKGu9k8V-e0v2QOuP32kuWT4vV9is27uG0njX9PTDQLeFDY1B_uGZtts1U4S_F9OJo20KhzN7x5cCo2BWfiElE4vgxUwFvS1ZKTJKBAY0mZ4mYBPPApS0BqzRKPACdSakJi5RLNhAnKuFKa7qNamqVwgDAEsfBaJi4nifSkBuExrROuLT5LMl8dorqVVfQxH5gRVWI6RBcLYUdGiW1lQqaQzaaRtRKuKzgXR39_eo42H4a9btR97D8doy17iHNMyQmqFfkMTtGG-iwm0_ys1Ikftoi4Sw
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=proceeding&rft.title=Proceedings+-+International+Conference+on+Parallel+and+Distributed+Systems&rft.atitle=High-Performance+Parallel+Location-Aware+Algorithms+for+Approximate+String+Matching+on+GPUs&rft.au=Cheng-Hung+Lin&rft.au=Chun-Cheng+Huang&rft.date=2015-12-01&rft.pub=IEEE&rft.eissn=1521-9097&rft.spage=570&rft.epage=575&rft_id=info:doi/10.1109%2FICPADS.2015.77&rft.externalDocID=7384340