A tile-based parallel Viterbi algorithm for biological sequence alignment on GPU with CUDA
The Viterbi algorithm is the compute-intensive kernel in Hidden Markov Model (HMM) based sequence alignment applications. In this paper, we investigate extending several parallel methods, such as the wave-front and streaming methods for the Smith-Waterman algorithm, to achieve a significant speed-up...
Saved in:
| Published in: | 2010 IEEE International Symposium on Parallel and Distributed Processing, Workshops and Phd Forum pp. 1 - 8 |
|---|---|
| Main Authors: | , , |
| Format: | Conference Proceeding |
| Language: | English |
| Published: |
IEEE
01.04.2010
|
| Subjects: | |
| ISBN: | 9781424465330, 1424465338 |
| Online Access: | Get full text |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| Abstract | The Viterbi algorithm is the compute-intensive kernel in Hidden Markov Model (HMM) based sequence alignment applications. In this paper, we investigate extending several parallel methods, such as the wave-front and streaming methods for the Smith-Waterman algorithm, to achieve a significant speed-up on a GPU. The wave-front method can take advantage of the computing power of the GPU but it cannot handle long sequences because of the physical GPU memory limit. On the other hand, the streaming method can process long sequences but with increased overhead due to the increased data transmission between CPU and GPU. To further improve the performance on GPU, we propose a new tile-based parallel algorithm. We take advantage of the homological segments to divide long sequences into many short pieces and each piece pair (tile) can be fully held in the GPU's memory. By reorganizing the computational kernel of the Viterbi algorithm, the basic computing unit can be divided into two parts: independent and dependent parts. All of the independent parts are executed with a balanced load in an optimized coalesced memory-accessing manner, which significantly improves the Viterbi algorithm's performance on GPU. The experimental results show that our new tile-based parallel Viterbi algorithm can outperform the wave-front and the streaming methods. Especially for the long sequence alignment problem, the best performance of tile-based algorithm is on average about an order magnitude faster than the serial Viterbi algorithm. |
|---|---|
| AbstractList | The Viterbi algorithm is the compute-intensive kernel in Hidden Markov Model (HMM) based sequence alignment applications. In this paper, we investigate extending several parallel methods, such as the wave-front and streaming methods for the Smith-Waterman algorithm, to achieve a significant speed-up on a GPU. The wave-front method can take advantage of the computing power of the GPU but it cannot handle long sequences because of the physical GPU memory limit. On the other hand, the streaming method can process long sequences but with increased overhead due to the increased data transmission between CPU and GPU. To further improve the performance on GPU, we propose a new tile-based parallel algorithm. We take advantage of the homological segments to divide long sequences into many short pieces and each piece pair (tile) can be fully held in the GPU's memory. By reorganizing the computational kernel of the Viterbi algorithm, the basic computing unit can be divided into two parts: independent and dependent parts. All of the independent parts are executed with a balanced load in an optimized coalesced memory-accessing manner, which significantly improves the Viterbi algorithm's performance on GPU. The experimental results show that our new tile-based parallel Viterbi algorithm can outperform the wave-front and the streaming methods. Especially for the long sequence alignment problem, the best performance of tile-based algorithm is on average about an order magnitude faster than the serial Viterbi algorithm. |
| Author | Zhihui Du Zhaoming Yin Bader, David A |
| Author_xml | – sequence: 1 surname: Zhihui Du fullname: Zhihui Du email: duzh@tsinghua.edu.cn organization: Dept. of Comput. Sci. & Technol., Tsinghua Univ., Beijing, China – sequence: 2 surname: Zhaoming Yin fullname: Zhaoming Yin email: zhaoming_leon@pku.edu.cn organization: Sch. of Software & Microelectron., Peking Univ., Beijing, China – sequence: 3 givenname: David A surname: Bader fullname: Bader, David A organization: Coll. of Comput., Georgia Inst. of Technol., Atlanta, GA, USA |
| BookMark | eNo1UM1Kw0AYXFFBW_MEvewLpH7Zv2yOIdVaKBiwVfFSNsm3cWWT1CQivr0B61yGGYZhmBm5aLsWCVlEsIwiSG43-Sp_elkymAwpYkiAn5FZJJgQSnL2ek6CJNb_msMVCYbhAyYIyRSIa_KW0tF5DAszYEWPpjfeo6fPbsS-cNT4uuvd-N5Q2_W0cJ3valcaTwf8_MK2xCnh6rbBdqRdS9f5nn5PcZrtV-kNubTGDxiceE5293e77CHcPq43WboNXQJjaCM1DdEKoFRCgS0LjCXaQlfCFparkoHW1jITM1RKaommkiau4jISnKPmc7L4q3WIeDj2rjH9z-H0Bv8FMIRVfg |
| ContentType | Conference Proceeding |
| DBID | 6IE 6IL CBEJK RIE RIL |
| DOI | 10.1109/IPDPSW.2010.5470903 |
| DatabaseName | IEEE Electronic Library (IEL) Conference Proceedings IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume IEEE Xplore All Conference Proceedings IEEE Xplore Digital Library IEEE Proceedings Order Plans (POP All) 1998-Present |
| DatabaseTitleList | |
| Database_xml | – sequence: 1 dbid: RIE name: IEEE Xplore Digital Library url: https://ieeexplore.ieee.org/ sourceTypes: Publisher |
| DeliveryMethod | fulltext_linktorsrc |
| EISBN | 142446532X 9781424465347 1424465346 9781424465323 |
| EndPage | 8 |
| ExternalDocumentID | 5470903 |
| Genre | orig-research |
| GroupedDBID | 6IE 6IF 6IK 6IL 6IN AAJGR AAWTH ADFMO ALMA_UNASSIGNED_HOLDINGS BEFXN BFFAM BGNUA BKEBE BPEOZ CBEJK IEGSK IERZE OCL RIB RIC RIE RIL |
| ID | FETCH-LOGICAL-i90t-f166048600c6460fcbe75efb8d4fbf36c2088ff2a72e66585ead5a7d7c1433e83 |
| IEDL.DBID | RIE |
| ISBN | 9781424465330 1424465338 |
| IngestDate | Wed Aug 27 02:22:30 EDT 2025 |
| IsPeerReviewed | false |
| IsScholarly | false |
| Language | English |
| LinkModel | DirectLink |
| MergedId | FETCHMERGED-LOGICAL-i90t-f166048600c6460fcbe75efb8d4fbf36c2088ff2a72e66585ead5a7d7c1433e83 |
| PageCount | 8 |
| ParticipantIDs | ieee_primary_5470903 |
| PublicationCentury | 2000 |
| PublicationDate | 2010-April |
| PublicationDateYYYYMMDD | 2010-04-01 |
| PublicationDate_xml | – month: 04 year: 2010 text: 2010-April |
| PublicationDecade | 2010 |
| PublicationTitle | 2010 IEEE International Symposium on Parallel and Distributed Processing, Workshops and Phd Forum |
| PublicationTitleAbbrev | IPDPSW |
| PublicationYear | 2010 |
| Publisher | IEEE |
| Publisher_xml | – name: IEEE |
| SSID | ssj0000452604 |
| Score | 1.5303143 |
| Snippet | The Viterbi algorithm is the compute-intensive kernel in Hidden Markov Model (HMM) based sequence alignment applications. In this paper, we investigate... |
| SourceID | ieee |
| SourceType | Publisher |
| StartPage | 1 |
| SubjectTerms | Algorithm design and analysis Biology computing Concurrent computing Decoding GPGPU Graphics Heuristic algorithms hidden Markov model Hidden Markov models Kernel Tiles Viterbi algorithm |
| Title | A tile-based parallel Viterbi algorithm for biological sequence alignment on GPU with CUDA |
| URI | https://ieeexplore.ieee.org/document/5470903 |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| link | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV07T8MwELbaioEJUIt4ywMjpnn4lbFqKbBUkWihYqmS-FwihQSFlN-PnaRFSCxstmVZll93On_fdwhde5QJCBgQrXyPUK6ASM4UUb52hVZBJD1VJ5sQs5lcLoOwg252XBgAqMFncGuL9V--KpKNDZUNGRU2rNBFXSF4w9XaxVOsNDh36Ja7xY0bI7eSTm3daVWHXCcYPoaT8OmlgXa1w_7Kr1Kbl-nB_yZ2iAY_PD0c7izQEepA3kevI1yZq06sfVLYSntnGWT42XKN4xRH2boo0-rtHRt3FTcaTHaj8BZUbXqk6xojgIsc34cLbGO1eLyYjAZoPr2bjx9Im0GBpIFTEe1yXmeZchJOuaOTGAQDHUtFdax9nnjmjdHai4QH3LgizBwrFgklEuNF-SD9Y9TLixxOEJZCKT_yHa3jgErNIsoibaklAaMgqXuK-nZZVh-NRsaqXZGzv5vP0X7zC28RMBeoV5UbuER7yVeVfpZX9cZ-AyvyoDM |
| linkProvider | IEEE |
| linkToHtml | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1bT8IwFG4QTfRJDRjv9sFHK2XrbY8ERIhIlghKfCHb2uKSuRkc_n7bbWBMfPGtbZqm6e2cnH7fdwC4dgjlyqMKaek6iDCpkGBUIunqNtfSC4Qji2QTfDwWs5nn18DNhgujlCrAZ-rWFou_fJlFKxsqa1HCbVhhC2xTQhxcsrU2ERUrDs4wWbO3mHFkxFrUqarjSneojb3W0O_5Ty8luKsa-FeGlcLA9Pf_N7UD0Pxh6kF_Y4MOQU2lDfDagbm57MhaKAmtuHeSqAQ-W7ZxGMMgWWTLOH97h8ZhhaUKk90quIZVmx7xokAJwCyF9_4U2mgt7E57nSaY9O8m3QGqciig2MM50m3GijxTOGKEYR2FilOlQyGJDrXLIse8Mlo7AXcUM84INQeLBlzyyPhRrhLuEainWaqOARRcSjdwsdahR4SmAaGBtuQSjxIlSPsENOyyzD9KlYx5tSKnfzdfgd3B5HE0Hw3HD2dgr_yTt3iYc1DPlyt1AXairzz-XF4Wm_wNtamjeg |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2010+IEEE+International+Symposium+on+Parallel+and+Distributed+Processing%2C+Workshops+and+Phd+Forum&rft.atitle=A+tile-based+parallel+Viterbi+algorithm+for+biological+sequence+alignment+on+GPU+with+CUDA&rft.au=Zhihui+Du&rft.au=Zhaoming+Yin&rft.au=Bader%2C+David+A&rft.date=2010-04-01&rft.pub=IEEE&rft.isbn=9781424465330&rft.spage=1&rft.epage=8&rft_id=info:doi/10.1109%2FIPDPSW.2010.5470903&rft.externalDocID=5470903 |
| thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9781424465330/lc.gif&client=summon&freeimage=true |
| thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9781424465330/mc.gif&client=summon&freeimage=true |
| thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9781424465330/sc.gif&client=summon&freeimage=true |

