Superword-Level Parallelism in the Presence of Control Flow

In this paper, we describe how to extend the concept of superword-level parallelization (SLP), used for multimedia extension architectures, so that it can be applied in the presence of control flow constructs. Superword-level parallelization involves identifying scalar instructions in a large basic...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:International Symposium on Code Generation and Optimization : CGO 2005 : 20-23 March, 2005 : San Jose, California S. 165 - 175
Hauptverfasser: Shin, Jaewook, Hall, Mary, Chame, Jacqueline
Format: Tagungsbericht
Sprache:Englisch
Veröffentlicht: Washington, DC, USA IEEE Computer Society 20.03.2005
IEEE
Schriftenreihe:ACM Conferences
Schlagworte:
ISBN:9780769522982, 076952298X
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Abstract In this paper, we describe how to extend the concept of superword-level parallelization (SLP), used for multimedia extension architectures, so that it can be applied in the presence of control flow constructs. Superword-level parallelization involves identifying scalar instructions in a large basic block that perform the same operation, and, if dependences do not prevent it, combining them into a superword operation on a multi-word object. A key insight is that we can use techniques related to optimizations for architectures supporting predicated execution, even for multimedia ISAs that do not provide hardware predication. We derive large basic blocks with predicated instructions to which SLP can be applied. We describe how to minimize overheads for superword predicates and re-introduce control flow for scalar operations. We discuss other extensions to SLP to address common features of real multimedia codes. We present automatically-generated performance results on 8 multimedia codes to demonstrate the power of this approach. We observe speedups ranging from 1.97X to 15.07X as compared to both sequential execution and SLP alone.
AbstractList In this paper, we describe how to extend the concept of superword-level parallelization (SLP), used for multimedia extension architectures, so that it can be applied in the presence of control flow constructs. Superword-level parallelization involves identifying scalar instructions in a large basic block that perform the same operation, and, if dependences do not prevent it, combining them into a superword operation on a multi-word object. A key insight is that we can use techniques related to optimizations for architectures supporting predicated execution, even for multimedia ISAs that do not provide hardware predication. We derive large basic blocks with predicated instructions to which SLP can be applied. We describe how to minimize overheads for superword predicates and re-introduce control flow for scalar operations. We discuss other extensions to SLP to address common features of real multimedia codes. We present automatically-generated performance results on 8 multimedia codes to demonstrate the power of this approach. We observe speedups ranging from 1.97X to 15.07X as compared to both sequential execution and SLP alone.
Author Shin, Jaewook
Hall, Mary
Chame, Jacqueline
Author_xml – sequence: 1
  givenname: Jaewook
  surname: Shin
  fullname: Shin, Jaewook
  organization: University of Southern California, Marina del Rey
– sequence: 2
  givenname: Mary
  surname: Hall
  fullname: Hall, Mary
  organization: University of Southern California, Marina del Rey
– sequence: 3
  givenname: Jacqueline
  surname: Chame
  fullname: Chame, Jacqueline
  organization: University of Southern California, Marina del Rey
BookMark eNqNkEFLwzAYhgMqqHMnj156EA9C55ekbRI8SXFTGGygnkOafsVq2sykc_jv7Zg_wPfyHt6H9_Cck-Pe90jIJYUZpaDuysVqxgDyGedHZKqEBFGonDEl2SmZxvgBY7jiooAzcv-y3WDY-VCnS_xGl6xNMM6ha2OXtH0yvGOyDhixt5j4Jil9PwTvkrnzuwty0hgXcfrXE_I2f3wtn9LlavFcPixTwwQMqaLAkUpkoqgzzilaNLVQVSOt5AqUaWwGylY2k1axoq7zOm8kz2kmZCWk4RNyc_jdBP-1xTjoro0WnTM9-m3UnOZSFZCP4NUBbBFRb0LbmfCjaQYMZDGut4fV2E5X3n9GTUHvlelRmd4r05zrKrTYjPD1P2D-Cylna60
ContentType Conference Proceeding
DBID 6IE
6IL
CBEJK
RIE
RIL
7SC
8FD
JQ2
L7M
L~C
L~D
DOI 10.1109/CGO.2005.33
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume
IEEE Xplore All Conference Proceedings
IEEE Electronic Library (IEL)
IEEE Proceedings Order Plans (POP All) 1998-Present
Computer and Information Systems Abstracts
Technology Research Database
ProQuest Computer Science Collection
Advanced Technologies Database with Aerospace
Computer and Information Systems Abstracts – Academic
Computer and Information Systems Abstracts Professional
DatabaseTitle Computer and Information Systems Abstracts
Technology Research Database
Computer and Information Systems Abstracts – Academic
Advanced Technologies Database with Aerospace
ProQuest Computer Science Collection
Computer and Information Systems Abstracts Professional
DatabaseTitleList
Computer and Information Systems Abstracts

Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library (IEL)
  url: https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
Discipline Computer Science
EndPage 175
ExternalDocumentID 1402086
Genre orig-research
Conference Paper
GroupedDBID 6IE
6IF
6IK
6IL
6IN
AAJGR
AARBI
ACM
ADPZR
ALMA_UNASSIGNED_HOLDINGS
APO
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CBEJK
GUFHI
IERZE
OCL
RIB
RIC
RIE
RIL
AAWTH
LHSKQ
7SC
8FD
JQ2
L7M
L~C
L~D
ID FETCH-LOGICAL-a270t-9103e18e276d4331ecead79bf8c83909afc409cbc48c926dd5d5f8351478b78a3
IEDL.DBID RIE
ISBN 9780769522982
076952298X
ISICitedReferencesCount 50
ISICitedReferencesURI http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000228332300013&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
IngestDate Fri Jul 11 10:57:12 EDT 2025
Wed Aug 27 02:14:15 EDT 2025
Wed Jan 31 06:46:24 EST 2024
Wed Jan 31 06:45:08 EST 2024
IsPeerReviewed false
IsScholarly true
Language English
LinkModel DirectLink
MeetingName CGO05: 3rd Annual IEEE / ACM International Symposium on Code Generation and Optimization
MergedId FETCHMERGED-LOGICAL-a270t-9103e18e276d4331ecead79bf8c83909afc409cbc48c926dd5d5f8351478b78a3
Notes SourceType-Conference Papers & Proceedings-1
ObjectType-Conference Paper-1
content type line 25
PQID 31589605
PQPubID 23500
PageCount 11
ParticipantIDs acm_books_10_1109_CGO_2005_33_brief
acm_books_10_1109_CGO_2005_33
ieee_primary_1402086
proquest_miscellaneous_31589605
PublicationCentury 2000
PublicationDate 20050320
20050000
PublicationDateYYYYMMDD 2005-03-20
2005-01-01
PublicationDate_xml – month: 03
  year: 2005
  text: 20050320
  day: 20
PublicationDecade 2000
PublicationPlace Washington, DC, USA
PublicationPlace_xml – name: Washington, DC, USA
PublicationSeriesTitle ACM Conferences
PublicationTitle International Symposium on Code Generation and Optimization : CGO 2005 : 20-23 March, 2005 : San Jose, California
PublicationTitleAbbrev CGO
PublicationYear 2005
Publisher IEEE Computer Society
IEEE
Publisher_xml – name: IEEE Computer Society
– name: IEEE
SSID ssj0000393760
Score 1.8864771
Snippet In this paper, we describe how to extend the concept of superword-level parallelization (SLP), used for multimedia extension architectures, so that it can be...
SourceID proquest
ieee
acm
SourceType Aggregation Database
Publisher
StartPage 165
SubjectTerms Aggregates
Appropriate technology
Automatic control
Computer architecture
Computing methodologies -- Parallel computing methodologies -- Parallel programming languages
Concurrent computing
Hardware
Instruction sets
Microprocessors
Open loop systems
Parallel processing
Security and privacy -- Systems security -- Information flow control
Security and privacy -- Systems security -- Operating systems security
Software and its engineering -- Software notations and tools -- General programming languages -- Language types -- Parallel programming languages
Theory of computation -- Models of computation -- Concurrency
Theory of computation -- Models of computation -- Concurrency -- Parallel computing models
Title Superword-Level Parallelism in the Presence of Control Flow
URI https://ieeexplore.ieee.org/document/1402086
https://www.proquest.com/docview/31589605
WOSCitedRecordID wos000228332300013&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1LS8QwEB5c8eDJN67PgB6tm77ywOPi6kF0QQVvoU0msFDbZR_6902y3fWgCN5aSkqZTJPvm8w3A3Apdc5EwUyEDgw5gsJsVNBMR9waljEhssSGkvkP_PFRvL3J4RpcrbQwiBiSz_DaX4azfNPouQ-V9WJPdgTrQIdzttBqreIpXmPKGV0wc-lQhRTLAjvL-6TV58VU9vp3T4uAiu-Y2yn0e9td5ceSHPaZwdb_vnAb9r8Fe2S42op2YA3rXdhadmwg7Q-8BzfP8zFOPh3hjCqfLUR85e-qwmo0fSejmjg0SMZBkORe2FjSJrITWzWf-_A6uH3p30dt-4SoSDiduWWMphgLTDgzXheF2nkNl6UV2qEiKgurHbnTpc6ElgkzJje5FT6zn4uSiyI9gPW6qfEQiIhzw2lsEmllpt0cFtqWMRp0YEpIqrtw6kyoPC-YqkArqFTOxL7JZa7StAsXfz5X5WSEtgt73qJqvKizoVpjduF8OSXK-b4_0ChqbOZTlca5cAwsP_p94DFshkKrIWByAuuzyRxPYUN_zEbTyVlwny_8or7u
linkProvider IEEE
linkToHtml http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1LSyQxEC50FNbT-FqcddXAerQ1_coDj-ILZ0dBBW-hO6nAQNs9zEP__iaZntmDInjrpknTVKqT76vUVwVwLHXORMFMhA4MOYLCbFTQTEfcGpYxIbLEhpL5fT4YiJcX-bACJ0stDCKG5DM89ZfhLN80euZDZWexJzuCrcJanmUJnau1lhEVrzLljM65uXS4QopFiZ3FfdIq9GIqzy6u7-chFd8zd7XQr21_lQ-Lcthprrrf-8ZN2P0v2SMPy81oC1aw3obuomcDaX_hHTh_nI1w_O4oZ1T5fCHia39XFVbDySsZ1sThQTIKkiT3wsaSNpWd2Kp534Xnq8uni5uobaAQFQmnU7eQ0RRjgQlnxiujUDu_4bK0QjtcRGVhtaN3utSZ0DJhxuQmt8Ln9nNRclGkP6FTNzXuARFxbjiNTSKtzLSbxULbMkaDDk4JSXUPDpwJlWcGExWIBZXKmdi3ucxVmvbgz5fPVTkeou3BjreoGs0rbajWmD04WkyJct7vjzSKGpvZRKVxLhwHy399PvAIftw8_e2r_u3gbh82QtnVED75DZ3peIYHsK7fpsPJ-DC40j8bcsI1
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=International+Symposium+on+Code+Generation+and+Optimization+%3A+CGO+2005+%3A+20-23+March%2C+2005+%3A+San+Jose%2C+California&rft.atitle=Superword-level+parallelism+in+the+presence+of+control+flow&rft.au=Shin%2C+J.&rft.au=Hall%2C+M.&rft.au=Chame%2C+J.&rft.date=2005-01-01&rft.pub=IEEE&rft.isbn=9780769522982&rft.spage=165&rft.epage=175&rft_id=info:doi/10.1109%2FCGO.2005.33&rft.externalDocID=1402086
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9780769522982/lc.gif&client=summon&freeimage=true
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9780769522982/mc.gif&client=summon&freeimage=true
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9780769522982/sc.gif&client=summon&freeimage=true