High-Performance Parallel Location-Aware Algorithms for Approximate String Matching on GPUs

Approximate string matching has been widely used in many applications, including deoxyribonucleic acid sequence searching, spell checking, text mining, and spam filters. The method is designed to find all locations of strings that approximately match a pattern in accordance with the number of insert...

Celý popis

Uložené v:
Podrobná bibliografia
Vydané v:Proceedings - International Conference on Parallel and Distributed Systems s. 570 - 575
Hlavní autori: Lin, Cheng-Hung, Huang, Chun-Cheng
Médium: Konferenčný príspevok.. Journal Article
Jazyk:English
Vydavateľské údaje: IEEE 01.12.2015
Predmet:
ISSN:1521-9097
On-line prístup:Získať plný text
Tagy: Pridať tag
Žiadne tagy, Buďte prvý, kto otaguje tento záznam!
Abstract Approximate string matching has been widely used in many applications, including deoxyribonucleic acid sequence searching, spell checking, text mining, and spam filters. The method is designed to find all locations of strings that approximately match a pattern in accordance with the number of insertion, deletion, and substitution operations. Among the proposed algorithms, the bit-parallel algorithms are considered to be the best and highly efficient algorithms. However, the traditional bit-parallel algorithms lacks the ability of identifying the start and end positions of a matched pattern. Furthermore, acceleration of the bit-parallel algorithms has become a crucial issue for processing big data nowadays. In this paper, we propose two kinds of parallel location-aware algorithms called data-segmented parallelism and high-degree parallelism as means to accelerate approximate string matching using graphic processing units. Experimental results show that the high-degree parallelism on GPUs achieves significant improvement in system and kernel throughputs compared to CPU counterparts. Compared to state-of-the-art approaches, the proposed high-degree parallelism achieves 11 to 105 times improvement.
AbstractList Approximate string matching has been widely used in many applications, including deoxyribonucleic acid sequence searching, spell checking, text mining, and spam filters. The method is designed to find all locations of strings that approximately match a pattern in accordance with the number of insertion, deletion, and substitution operations. Among the proposed algorithms, the bit-parallel algorithms are considered to be the best and highly efficient algorithms. However, the traditional bit-parallel algorithms lacks the ability of identifying the start and end positions of a matched pattern. Furthermore, acceleration of the bit-parallel algorithms has become a crucial issue for processing big data nowadays. In this paper, we propose two kinds of parallel location-aware algorithms called data-segmented parallelism and high-degree parallelism as means to accelerate approximate string matching using graphic processing units. Experimental results show that the high-degree parallelism on GPUs achieves significant improvement in system and kernel throughputs compared to CPU counterparts. Compared to state-of-the-art approaches, the proposed high-degree parallelism achieves 11 to 105 times improvement.
Author Cheng-Hung Lin
Chun-Cheng Huang
Author_xml – sequence: 1
  givenname: Cheng-Hung
  surname: Lin
  fullname: Lin, Cheng-Hung
– sequence: 2
  givenname: Chun-Cheng
  surname: Huang
  fullname: Huang, Chun-Cheng
BookMark eNotjzFPwzAUhA0CibawsrBkZEl5jms7HqMCbaUiIpVODNGr-9IGJXGxUwH_nqAy3Z306XQ3ZBeta4mxWw5jzsE8LKZ59rgaJ8DlWOszNgStjJQ6leqcDbhMeGzA6Cs2DOEDIAEhYcDe59VuH-fkS-cbbC1FOXqsa6qjpbPYVa6Nsy_0FGX1zvmq2zch6tkoOxy8-64a7Chadb5qd9ELdnb_Z1wbzfJ1uGaXJdaBbv51xNbPT2_Tebx8nS2m2TKuEki7WKOxRnFMSlSwNYLEBoW0qg-kjBZyS1iWcjsBUoBYAmxsAqVMob9mbSlG7P7U2y_6PFLoiqYKluoaW3LHUHBtRJKkSqU9endCKyIqDr7f738KLdKJmID4BcZLYjY
CODEN IEEPAD
ContentType Conference Proceeding
Journal Article
DBID 6IE
6IL
CBEJK
RIE
RIL
7SC
8FD
JQ2
L7M
L~C
L~D
DOI 10.1109/ICPADS.2015.77
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume
IEEE Xplore All Conference Proceedings
IEEE Xplore Digital Library
IEEE Proceedings Order Plans (POP All) 1998-Present
Computer and Information Systems Abstracts
Technology Research Database
ProQuest Computer Science Collection
Advanced Technologies Database with Aerospace
Computer and Information Systems Abstracts – Academic
Computer and Information Systems Abstracts Professional
DatabaseTitle Computer and Information Systems Abstracts
Technology Research Database
Computer and Information Systems Abstracts – Academic
Advanced Technologies Database with Aerospace
ProQuest Computer Science Collection
Computer and Information Systems Abstracts Professional
DatabaseTitleList
Computer and Information Systems Abstracts
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Xplore Digital Library
  url: https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
Discipline Computer Science
EISBN 0769557856
9780769557854
EISSN 1521-9097
EndPage 575
ExternalDocumentID 7384340
Genre orig-research
GroupedDBID 23M
29O
6IE
6IF
6IH
6IK
6IL
6IM
6IN
AAJGR
AAWTH
ABLEC
ADZIZ
ALMA_UNASSIGNED_HOLDINGS
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CBEJK
CHZPO
IEGSK
IPLJI
OCL
RIE
RIL
RNS
7SC
8FD
JQ2
L7M
L~C
L~D
ID FETCH-LOGICAL-i208t-7a9c961a2fa60d93e3ba35c660de69735deaff5d40e60aaf00bc20f580856ccf3
IEDL.DBID RIE
ISICitedReferencesCount 0
ISICitedReferencesURI http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000381621400069&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
IngestDate Sun Nov 09 10:14:30 EST 2025
Wed Aug 27 02:36:42 EDT 2025
IsPeerReviewed false
IsScholarly true
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-i208t-7a9c961a2fa60d93e3ba35c660de69735deaff5d40e60aaf00bc20f580856ccf3
Notes ObjectType-Article-2
SourceType-Scholarly Journals-1
ObjectType-Conference-1
ObjectType-Feature-3
content type line 23
SourceType-Conference Papers & Proceedings-2
PQID 1793228668
PQPubID 23500
PageCount 6
ParticipantIDs ieee_primary_7384340
proquest_miscellaneous_1793228668
PublicationCentury 2000
PublicationDate 20151201
PublicationDateYYYYMMDD 2015-12-01
PublicationDate_xml – month: 12
  year: 2015
  text: 20151201
  day: 01
PublicationDecade 2010
PublicationTitle Proceedings - International Conference on Parallel and Distributed Systems
PublicationTitleAbbrev PADSW
PublicationYear 2015
Publisher IEEE
Publisher_xml – name: IEEE
SSID ssj0020350
ssib030099276
Score 1.9309021
Snippet Approximate string matching has been widely used in many applications, including deoxyribonucleic acid sequence searching, spell checking, text mining, and...
SourceID proquest
ieee
SourceType Aggregation Database
Publisher
StartPage 570
SubjectTerms Acceleration
Algorithm design and analysis
Algorithms
Approximate string matching
Approximation algorithms
bit-parallel algorithm
Business process re-engineering
Computer networks
Data mining
Digital signal processing
graphic processing units
Insertion
Levenshtein distance
nondeterministic finite automaton
parallel algorithm
Parallel processing
Pattern matching
Position (location)
Registers
Searching
Title High-Performance Parallel Location-Aware Algorithms for Approximate String Matching on GPUs
URI https://ieeexplore.ieee.org/document/7384340
https://www.proquest.com/docview/1793228668
WOSCitedRecordID wos000381621400069&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1LS8QwEA4qHjytjxXXFxE8Wk2bNo_j4htUCroieFjSNNGCttJ21Z9vpg896MVbC20p0-l88_pmENp3mOhANXFhCQ21F1qpPeU0CxrXksTXytcNF-b-it_ciIcHGc-hg28ujDGmaT4zh3DY1PLTQs8gVXbEqQhp6AL0ec5Zy9XqdYeCqxMAtHbBFlTMuiGNPpFHl8fx-OQWWrmiQ96vUvllfxtQORv873WW0fCHnYfjb9xZQXMmX0WDfj0D7v7WNfQIPRxe_MMMwLEqYXXKC74q2kydN_5QpcHjl6eizOrn1wq7a_EY5ox_Zs6XNfi2hsQfvnYWG3JVuMjxeTyphmhydnp3fOF1uxS8LCCi9riSWjJfBVYxkkpqaKJopJk7MUxyGqVGWRulITGMKGUJSXRAbCScS8a0tnQdLeRFbjYQ5u5BPPKT1ILzQoJERko4lE-ZIEYKf4TWQFbTt3ZcxrQT0wjt9cKeOhWGuoTKTTGrpmAjgkAwJjb_vnULLcGHa7tIttFCXc7MDlrU73VWlbuNHnwBIgGzgg
linkProvider IEEE
linkToHtml http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1LT4NAEN40aqKn-oz1uSYepV1YWHaPTbW2sTYkWtPEA1mWRZtUMJSqP98dSu1BL94gAUKGYb55fTMIXRpMNKAambCEuspyE6EsaTQLGteiyFbSViUX5mngD4d8PBZBDV39cGG01mXzmW7CYVnLjzM1h1RZy6fcpa4J0Nc913XIgq211B4Kzo4D4FqFW1Azq8Y02kS0-p2gff0AzVxe018uU_llgUtY6db_90LbaH_Fz8PBD_LsoJpOd1F9uaABV__rHnqGLg4rWHEDcCBzWJ4yxYNskauz2p8y17g9fcnySfH6NsPmWtyGSeNfE-PNavxQQOoP3xubDdkqnKX4NhjN9tGoe_PY6VnVNgVr4hBeWL4USjBbOolkJBZU00hSTzFzopnwqRdrmSRe7BLNiJQJIZFySOJx45QxpRJ6gNbSLNWHCPvmQb5nR3EC7gtxIuFJbnA-Zpxowe0G2gNZhe-LgRlhJaYGulgKOzRKDJUJmepsPgvBSjgOZ4wf_X3rOdrsPd4PwkF_eHeMtuAjLnpKTtBakc_1KdpQH8Vklp-VOvENovK2yQ
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=proceeding&rft.title=Proceedings+-+International+Conference+on+Parallel+and+Distributed+Systems&rft.atitle=High-Performance+Parallel+Location-Aware+Algorithms+for+Approximate+String+Matching+on+GPUs&rft.au=Cheng-Hung+Lin&rft.au=Chun-Cheng+Huang&rft.date=2015-12-01&rft.pub=IEEE&rft.eissn=1521-9097&rft.spage=570&rft.epage=575&rft_id=info:doi/10.1109%2FICPADS.2015.77&rft.externalDocID=7384340