GOP-level parallelization of the H.264 decoder without a start-code scanner
Recent researches on parallelization of H.264 video decoders focused on fine-grain methods. These works led to designs having very short latencies and good memory usage. However, they could not reach the scalability of Group of Pictures (GOP) level approaches although assuming a well-designed entrop...
Saved in:
| Published in: | 2010 2nd International Conference on Signal Processing Systems Vol. 3; pp. V3-627 - V3-630 |
|---|---|
| Main Authors: | , , |
| Format: | Conference Proceeding |
| Language: | English |
| Published: |
IEEE
01.07.2010
|
| Subjects: | |
| ISBN: | 9781424468928, 1424468922 |
| Online Access: | Get full text |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| Abstract | Recent researches on parallelization of H.264 video decoders focused on fine-grain methods. These works led to designs having very short latencies and good memory usage. However, they could not reach the scalability of Group of Pictures (GOP) level approaches although assuming a well-designed entropy decoder which can feed the increasing number of parallel working cores. We would like to introduce a GOP-level approach due to its high scalability, mentioning solution approaches for the well-known latency and memory issues. Our design revokes the need to a scanner for GOP startcodes which was used in the earlier methods. This approach lets all the cores work on the decoding task. Although the performance on shared memory systems is subject to improve, we have observed a one-to-one linear speedup in parallel working nodes. We have tested our method using a cluster of 5 machines each having 2 processors with 4 cores. The decoding is 5 times faster when we run only one process in each machine, that is we saw one-to-one linear speedup when there is no memory shortage. We observed a maximum of 11 times speedup when using all of the 40 cores distributed among 5 machines. |
|---|---|
| AbstractList | Recent researches on parallelization of H.264 video decoders focused on fine-grain methods. These works led to designs having very short latencies and good memory usage. However, they could not reach the scalability of Group of Pictures (GOP) level approaches although assuming a well-designed entropy decoder which can feed the increasing number of parallel working cores. We would like to introduce a GOP-level approach due to its high scalability, mentioning solution approaches for the well-known latency and memory issues. Our design revokes the need to a scanner for GOP startcodes which was used in the earlier methods. This approach lets all the cores work on the decoding task. Although the performance on shared memory systems is subject to improve, we have observed a one-to-one linear speedup in parallel working nodes. We have tested our method using a cluster of 5 machines each having 2 processors with 4 cores. The decoding is 5 times faster when we run only one process in each machine, that is we saw one-to-one linear speedup when there is no memory shortage. We observed a maximum of 11 times speedup when using all of the 40 cores distributed among 5 machines. |
| Author | Shih-Hao Hung Gurhanli, Ahmet Chen, Charlie Chung-Ping |
| Author_xml | – sequence: 1 givenname: Ahmet surname: Gurhanli fullname: Gurhanli, Ahmet email: ahmet@ntu.edu.tw organization: Grad. Inst. of Electron. Eng., Nat. Taiwan Univ., Taipei, Taiwan – sequence: 2 givenname: Charlie Chung-Ping surname: Chen fullname: Chen, Charlie Chung-Ping email: cchen@cc.ee.ntu.edu.tw organization: Grad. Inst. of Electron. Eng., Nat. Taiwan Univ., Taipei, Taiwan – sequence: 3 surname: Shih-Hao Hung fullname: Shih-Hao Hung email: hungsh@csie.ntu.edu.tw organization: Grad. Inst. of Networking & Multimedia, Nat. Taiwan Univ., Taipei, Taiwan |
| BookMark | eNpVkMtOwzAURI0ACSj5Adj4BxL8ihMvUQRtRaVWavfVjX2tGpmkSgwIvp4guuFsRmcWs5gbctH1HRJyx1nBOTMPy2a72RaCTV5OKK7PSGaqmiuhlK6NLM__uaivSDaOr2xClUILdU1e5utNHvEDIz3CADFiDN-QQt_R3tN0QLoohFbUoe0dDvQzpEP_nijQMcGQ8t-Wjha6DodbcukhjpidckZ2z0-7ZpGv1vNl87jKg2EpR-G1dtJjpcEKXZYVQ1YjOsMrZBVj0letkoaDVrZtgTltfGulQwXglJUzcv83GxBxfxzCGwxf-9MD8gfF5lC7 |
| ContentType | Conference Proceeding |
| DBID | 6IE 6IL CBEJK RIE RIL |
| DOI | 10.1109/ICSPS.2010.5555416 |
| DatabaseName | IEEE Electronic Library (IEL) Conference Proceedings IEEE Xplore POP ALL IEEE Xplore All Conference Proceedings IEEE Electronic Library (IEL) IEEE Proceedings Order Plans (POP All) 1998-Present |
| DatabaseTitleList | |
| Database_xml | – sequence: 1 dbid: RIE name: IEEE Electronic Library (IEL) url: https://ieeexplore.ieee.org/ sourceTypes: Publisher |
| DeliveryMethod | fulltext_linktorsrc |
| EISBN | 9781424468935 1424468930 |
| EndPage | V3-630 |
| ExternalDocumentID | 5555416 |
| Genre | orig-research |
| GroupedDBID | 6IE 6IF 6IK 6IL 6IN AAJGR AAWTH ADFMO ALMA_UNASSIGNED_HOLDINGS BEFXN BFFAM BGNUA BKEBE BPEOZ CBEJK IEGSK IERZE OCL RIE RIL |
| ID | FETCH-LOGICAL-i90t-e2f66d3fe76ac265570e08eed917e07003f7b4391a64cbba0d69fbc3de4aad4c3 |
| IEDL.DBID | RIE |
| ISBN | 9781424468928 1424468922 |
| IngestDate | Wed Aug 27 02:57:50 EDT 2025 |
| IsPeerReviewed | false |
| IsScholarly | false |
| Language | English |
| LinkModel | DirectLink |
| MergedId | FETCHMERGED-LOGICAL-i90t-e2f66d3fe76ac265570e08eed917e07003f7b4391a64cbba0d69fbc3de4aad4c3 |
| ParticipantIDs | ieee_primary_5555416 |
| PublicationCentury | 2000 |
| PublicationDate | 2010-July |
| PublicationDateYYYYMMDD | 2010-07-01 |
| PublicationDate_xml | – month: 07 year: 2010 text: 2010-July |
| PublicationDecade | 2010 |
| PublicationTitle | 2010 2nd International Conference on Signal Processing Systems |
| PublicationTitleAbbrev | ICSPS |
| PublicationYear | 2010 |
| Publisher | IEEE |
| Publisher_xml | – name: IEEE |
| SSID | ssj0000452624 |
| Score | 1.466924 |
| Snippet | Recent researches on parallelization of H.264 video decoders focused on fine-grain methods. These works led to designs having very short latencies and good... |
| SourceID | ieee |
| SourceType | Publisher |
| StartPage | V3-627 |
| SubjectTerms | Decoding Entropy H.264 Decoder High-Performance Computing Image/Video Processing and Coding Magnetic cores Parallel and Distributed Processing Parallel processing Program processors Scalability Streaming media Video Compression |
| Title | GOP-level parallelization of the H.264 decoder without a start-code scanner |
| URI | https://ieeexplore.ieee.org/document/5555416 |
| Volume | 3 |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| link | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV07a8MwGBRJ6NCpLUnpGw0dq8SRZT3m0DSlkBqSIVvQ4zMEjF1cp7-_ku2kFLrUk63BiBNG50939yH0KDMWmyDF0VYlhFkmiJmaKQHOnaTGRTHIptmEWC7lZqPSHno6emEAoBGfwTjcNmf5rrT7UCqbJP7yBKKP-kLw1qt1rKeEaHBO2cG7xaWi9BDp1D3Lg2kmUpPX2Spdtcqu7q2_2qs0u8v87H_zOkejH5seTo8b0AXqQTFEby_vKcmDFAiHWO88h7yzWuIyw57u4cXY0xPsILjZKxwKseW-xhp7nljVJIziTw94AdUIrefP69mCdB0TyE5FNQGaeYjjDATXlvKQrgWR9JPw_2Tgv-0ozoQJVlvNmTVGR46rzNjYAdPaMRtfokFRFnCFsHIJtY4pOdWMmcRpj6GFxJNLkDFLxDUaBhy2H20mxraD4Obv4Vt02p66B5nrHRrU1R7u0Yn9qnef1UOzkN-XVZp0 |
| linkProvider | IEEE |
| linkToHtml | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV3PT8IwGG0QTfSkBoy_7cGjhbF1XXsmIgTEJXDgRvrjW0KybGZs_v22Y2BMvLjT1sPSvGbp29f33ofQM09ooJwUR2oREqppRNRADQgwZrivjBcAr5tNRPM5X61E3EIvBy8MANTiM-i52_os3-S6cqWyfmgvSyCO0LHrnNW4tQ4VFRcOzny6d28xLnx_H-rUPPO9bcYT_clwES922q7mvb8arNT7y-j8fzO7QN0fox6OD1vQJWpB1kHTt4-YpE4MhF2wd5pC2pgtcZ5gS_jwuGcJCjbg_OwFdqXYvCqxxJYpFiVxo3hrIc-g6KLl6HU5HJOmZwLZCK8k4CcW5CCBiEntM5evBR63k7B_ZWC_bi9IIuXMtpJRrZT0DBOJ0oEBKqWhOrhC7SzP4BphYUJfGyr4QFKqQiMthhpCSy-BBzSMblDH4bD-3KVirBsIbv8efkKn4-X7bD2bzKd36Gx3Bu9Er_eoXRYVPKAT_VVutsVjvajf2W2dvQ |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2010+2nd+International+Conference+on+Signal+Processing+Systems&rft.atitle=GOP-level+parallelization+of+the+H.264+decoder+without+a+start-code+scanner&rft.au=Gurhanli%2C+Ahmet&rft.au=Chen%2C+Charlie+Chung-Ping&rft.au=Shih-Hao+Hung&rft.date=2010-07-01&rft.pub=IEEE&rft.isbn=9781424468928&rft.volume=3&rft.spage=V3-627&rft.epage=V3-630&rft_id=info:doi/10.1109%2FICSPS.2010.5555416&rft.externalDocID=5555416 |
| thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9781424468928/lc.gif&client=summon&freeimage=true |
| thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9781424468928/mc.gif&client=summon&freeimage=true |
| thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9781424468928/sc.gif&client=summon&freeimage=true |

