Superword-Level Parallelism in the Presence of Control Flow
In this paper, we describe how to extend the concept of superword-level parallelization (SLP), used for multimedia extension architectures, so that it can be applied in the presence of control flow constructs. Superword-level parallelization involves identifying scalar instructions in a large basic...
Gespeichert in:
| Veröffentlicht in: | International Symposium on Code Generation and Optimization : CGO 2005 : 20-23 March, 2005 : San Jose, California S. 165 - 175 |
|---|---|
| Hauptverfasser: | , , |
| Format: | Tagungsbericht |
| Sprache: | Englisch |
| Veröffentlicht: |
Washington, DC, USA
IEEE Computer Society
20.03.2005
IEEE |
| Schriftenreihe: | ACM Conferences |
| Schlagworte: | |
| ISBN: | 9780769522982, 076952298X |
| Online-Zugang: | Volltext |
| Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
| Abstract | In this paper, we describe how to extend the concept of superword-level parallelization (SLP), used for multimedia extension architectures, so that it can be applied in the presence of control flow constructs. Superword-level parallelization involves identifying scalar instructions in a large basic block that perform the same operation, and, if dependences do not prevent it, combining them into a superword operation on a multi-word object. A key insight is that we can use techniques related to optimizations for architectures supporting predicated execution, even for multimedia ISAs that do not provide hardware predication. We derive large basic blocks with predicated instructions to which SLP can be applied. We describe how to minimize overheads for superword predicates and re-introduce control flow for scalar operations. We discuss other extensions to SLP to address common features of real multimedia codes. We present automatically-generated performance results on 8 multimedia codes to demonstrate the power of this approach. We observe speedups ranging from 1.97X to 15.07X as compared to both sequential execution and SLP alone. |
|---|---|
| AbstractList | In this paper, we describe how to extend the concept of superword-level parallelization (SLP), used for multimedia extension architectures, so that it can be applied in the presence of control flow constructs. Superword-level parallelization involves identifying scalar instructions in a large basic block that perform the same operation, and, if dependences do not prevent it, combining them into a superword operation on a multi-word object. A key insight is that we can use techniques related to optimizations for architectures supporting predicated execution, even for multimedia ISAs that do not provide hardware predication. We derive large basic blocks with predicated instructions to which SLP can be applied. We describe how to minimize overheads for superword predicates and re-introduce control flow for scalar operations. We discuss other extensions to SLP to address common features of real multimedia codes. We present automatically-generated performance results on 8 multimedia codes to demonstrate the power of this approach. We observe speedups ranging from 1.97X to 15.07X as compared to both sequential execution and SLP alone. |
| Author | Shin, Jaewook Hall, Mary Chame, Jacqueline |
| Author_xml | – sequence: 1 givenname: Jaewook surname: Shin fullname: Shin, Jaewook organization: University of Southern California, Marina del Rey – sequence: 2 givenname: Mary surname: Hall fullname: Hall, Mary organization: University of Southern California, Marina del Rey – sequence: 3 givenname: Jacqueline surname: Chame fullname: Chame, Jacqueline organization: University of Southern California, Marina del Rey |
| BookMark | eNqNkEFLwzAYhgMqqHMnj156EA9C55ekbRI8SXFTGGygnkOafsVq2sykc_jv7Zg_wPfyHt6H9_Cck-Pe90jIJYUZpaDuysVqxgDyGedHZKqEBFGonDEl2SmZxvgBY7jiooAzcv-y3WDY-VCnS_xGl6xNMM6ha2OXtH0yvGOyDhixt5j4Jil9PwTvkrnzuwty0hgXcfrXE_I2f3wtn9LlavFcPixTwwQMqaLAkUpkoqgzzilaNLVQVSOt5AqUaWwGylY2k1axoq7zOm8kz2kmZCWk4RNyc_jdBP-1xTjoro0WnTM9-m3UnOZSFZCP4NUBbBFRb0LbmfCjaQYMZDGut4fV2E5X3n9GTUHvlelRmd4r05zrKrTYjPD1P2D-Cylna60 |
| ContentType | Conference Proceeding |
| DBID | 6IE 6IL CBEJK RIE RIL 7SC 8FD JQ2 L7M L~C L~D |
| DOI | 10.1109/CGO.2005.33 |
| DatabaseName | IEEE Electronic Library (IEL) Conference Proceedings IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume IEEE Xplore All Conference Proceedings IEEE Electronic Library (IEL) IEEE Proceedings Order Plans (POP All) 1998-Present Computer and Information Systems Abstracts Technology Research Database ProQuest Computer Science Collection Advanced Technologies Database with Aerospace Computer and Information Systems Abstracts Academic Computer and Information Systems Abstracts Professional |
| DatabaseTitle | Computer and Information Systems Abstracts Technology Research Database Computer and Information Systems Abstracts – Academic Advanced Technologies Database with Aerospace ProQuest Computer Science Collection Computer and Information Systems Abstracts Professional |
| DatabaseTitleList | Computer and Information Systems Abstracts |
| Database_xml | – sequence: 1 dbid: RIE name: IEEE Electronic Library (IEL) url: https://ieeexplore.ieee.org/ sourceTypes: Publisher |
| DeliveryMethod | fulltext_linktorsrc |
| Discipline | Computer Science |
| EndPage | 175 |
| ExternalDocumentID | 1402086 |
| Genre | orig-research Conference Paper |
| GroupedDBID | 6IE 6IF 6IK 6IL 6IN AAJGR AARBI ACM ADPZR ALMA_UNASSIGNED_HOLDINGS APO BEFXN BFFAM BGNUA BKEBE BPEOZ CBEJK GUFHI IERZE OCL RIB RIC RIE RIL AAWTH LHSKQ 7SC 8FD JQ2 L7M L~C L~D |
| ID | FETCH-LOGICAL-a270t-9103e18e276d4331ecead79bf8c83909afc409cbc48c926dd5d5f8351478b78a3 |
| IEDL.DBID | RIE |
| ISBN | 9780769522982 076952298X |
| ISICitedReferencesCount | 50 |
| ISICitedReferencesURI | http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000228332300013&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| IngestDate | Fri Jul 11 10:57:12 EDT 2025 Wed Aug 27 02:14:15 EDT 2025 Wed Jan 31 06:46:24 EST 2024 Wed Jan 31 06:45:08 EST 2024 |
| IsPeerReviewed | false |
| IsScholarly | true |
| Language | English |
| LinkModel | DirectLink |
| MeetingName | CGO05: 3rd Annual IEEE / ACM International Symposium on Code Generation and Optimization |
| MergedId | FETCHMERGED-LOGICAL-a270t-9103e18e276d4331ecead79bf8c83909afc409cbc48c926dd5d5f8351478b78a3 |
| Notes | SourceType-Conference Papers & Proceedings-1 ObjectType-Conference Paper-1 content type line 25 |
| PQID | 31589605 |
| PQPubID | 23500 |
| PageCount | 11 |
| ParticipantIDs | acm_books_10_1109_CGO_2005_33_brief acm_books_10_1109_CGO_2005_33 ieee_primary_1402086 proquest_miscellaneous_31589605 |
| PublicationCentury | 2000 |
| PublicationDate | 20050320 20050000 |
| PublicationDateYYYYMMDD | 2005-03-20 2005-01-01 |
| PublicationDate_xml | – month: 03 year: 2005 text: 20050320 day: 20 |
| PublicationDecade | 2000 |
| PublicationPlace | Washington, DC, USA |
| PublicationPlace_xml | – name: Washington, DC, USA |
| PublicationSeriesTitle | ACM Conferences |
| PublicationTitle | International Symposium on Code Generation and Optimization : CGO 2005 : 20-23 March, 2005 : San Jose, California |
| PublicationTitleAbbrev | CGO |
| PublicationYear | 2005 |
| Publisher | IEEE Computer Society IEEE |
| Publisher_xml | – name: IEEE Computer Society – name: IEEE |
| SSID | ssj0000393760 |
| Score | 1.8864771 |
| Snippet | In this paper, we describe how to extend the concept of superword-level parallelization (SLP), used for multimedia extension architectures, so that it can be... |
| SourceID | proquest ieee acm |
| SourceType | Aggregation Database Publisher |
| StartPage | 165 |
| SubjectTerms | Aggregates Appropriate technology Automatic control Computer architecture Computing methodologies -- Parallel computing methodologies -- Parallel programming languages Concurrent computing Hardware Instruction sets Microprocessors Open loop systems Parallel processing Security and privacy -- Systems security -- Information flow control Security and privacy -- Systems security -- Operating systems security Software and its engineering -- Software notations and tools -- General programming languages -- Language types -- Parallel programming languages Theory of computation -- Models of computation -- Concurrency Theory of computation -- Models of computation -- Concurrency -- Parallel computing models |
| Title | Superword-Level Parallelism in the Presence of Control Flow |
| URI | https://ieeexplore.ieee.org/document/1402086 https://www.proquest.com/docview/31589605 |
| WOSCitedRecordID | wos000228332300013&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| link | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1LS8QwEB5c8eDJN67PgB6tm77ywOPi6kF0QQVvoU0msFDbZR_6902y3fWgCN5aSkqZTJPvm8w3A3Apdc5EwUyEDgw5gsJsVNBMR9waljEhssSGkvkP_PFRvL3J4RpcrbQwiBiSz_DaX4azfNPouQ-V9WJPdgTrQIdzttBqreIpXmPKGV0wc-lQhRTLAjvL-6TV58VU9vp3T4uAiu-Y2yn0e9td5ceSHPaZwdb_vnAb9r8Fe2S42op2YA3rXdhadmwg7Q-8BzfP8zFOPh3hjCqfLUR85e-qwmo0fSejmjg0SMZBkORe2FjSJrITWzWf-_A6uH3p30dt-4SoSDiduWWMphgLTDgzXheF2nkNl6UV2qEiKgurHbnTpc6ElgkzJje5FT6zn4uSiyI9gPW6qfEQiIhzw2lsEmllpt0cFtqWMRp0YEpIqrtw6kyoPC-YqkArqFTOxL7JZa7StAsXfz5X5WSEtgt73qJqvKizoVpjduF8OSXK-b4_0ChqbOZTlca5cAwsP_p94DFshkKrIWByAuuzyRxPYUN_zEbTyVlwny_8or7u |
| linkProvider | IEEE |
| linkToHtml | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1LSyQxEC50FNbT-FqcddXAerQ1_coDj-ILZ0dBBW-hO6nAQNs9zEP__iaZntmDInjrpknTVKqT76vUVwVwLHXORMFMhA4MOYLCbFTQTEfcGpYxIbLEhpL5fT4YiJcX-bACJ0stDCKG5DM89ZfhLN80euZDZWexJzuCrcJanmUJnau1lhEVrzLljM65uXS4QopFiZ3FfdIq9GIqzy6u7-chFd8zd7XQr21_lQ-Lcthprrrf-8ZN2P0v2SMPy81oC1aw3obuomcDaX_hHTh_nI1w_O4oZ1T5fCHia39XFVbDySsZ1sThQTIKkiT3wsaSNpWd2Kp534Xnq8uni5uobaAQFQmnU7eQ0RRjgQlnxiujUDu_4bK0QjtcRGVhtaN3utSZ0DJhxuQmt8Ln9nNRclGkP6FTNzXuARFxbjiNTSKtzLSbxULbMkaDDk4JSXUPDpwJlWcGExWIBZXKmdi3ucxVmvbgz5fPVTkeou3BjreoGs0rbajWmD04WkyJct7vjzSKGpvZRKVxLhwHy399PvAIftw8_e2r_u3gbh82QtnVED75DZ3peIYHsK7fpsPJ-DC40j8bcsI1 |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=International+Symposium+on+Code+Generation+and+Optimization+%3A+CGO+2005+%3A+20-23+March%2C+2005+%3A+San+Jose%2C+California&rft.atitle=Superword-level+parallelism+in+the+presence+of+control+flow&rft.au=Shin%2C+J.&rft.au=Hall%2C+M.&rft.au=Chame%2C+J.&rft.date=2005-01-01&rft.pub=IEEE&rft.isbn=9780769522982&rft.spage=165&rft.epage=175&rft_id=info:doi/10.1109%2FCGO.2005.33&rft.externalDocID=1402086 |
| thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9780769522982/lc.gif&client=summon&freeimage=true |
| thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9780769522982/mc.gif&client=summon&freeimage=true |
| thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9780769522982/sc.gif&client=summon&freeimage=true |

