Optimizing shared cache behavior of chip multiprocessors
One of the critical problems associated with emerging chip multiprocessors (CMPs) is the management of on-chip shared cache space. Unfortunately, single processor centric data locality optimization schemes may not work well in the CMP case as data accesses from multiple cores can create conflicts in...
Saved in:
| Published in: | 2009 42nd Annual IEEE/ACM International Symposium on Microarchitecture (MICRO) pp. 505 - 516 |
|---|---|
| Main Authors: | , , , , |
| Format: | Conference Proceeding |
| Language: | English |
| Published: |
New York, NY, USA
ACM
12.12.2009
IEEE |
| Series: | ACM Conferences |
| Subjects: | |
| ISBN: | 9781605587981, 1605587982 |
| ISSN: | 1072-4451 |
| Online Access: | Get full text |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| Abstract | One of the critical problems associated with emerging chip multiprocessors (CMPs) is the management of on-chip shared cache space. Unfortunately, single processor centric data locality optimization schemes may not work well in the CMP case as data accesses from multiple cores can create conflicts in the shared cache space. The main contribution of this paper is a compiler directed code restructuring scheme for enhancing locality of shared data in CMPs. The proposed scheme targets the last level shared cache that exist in many commercial CMPs and has two components, namely, allocation, which determines the set of loop iterations assigned to each core, and scheduling, which determines the order in which the iterations assigned to a core are executed. Our scheme restructures the application code such that the different cores operate on shared data blocks at the same time, to the extent allowed by data dependencies. This helps to reduce reuse distances for the shared data and improves on-chip cache performance. We evaluated our approach using the Splash-2 and Parsec applications through both simulations and experiments on two commercial multi-core machines. Our experimental evaluation indicates that the proposed data locality optimization scheme improves inter-core conflict misses in the shared cache by 67% on average when both allocation and scheduling are used. Also, the execution time improvements we achieve (29% on average) are very close to the optimal savings that could be achieved using a hypothetical scheme. |
|---|---|
| AbstractList | One of the critical problems associated with emerging chip multiprocessors (CMPs) is the management of on-chip shared cache space. Unfortunately, single processor centric data locality optimization schemes may not work well in the CMP case as data accesses from multiple cores can create conflicts in the shared cache space. The main contribution of this paper is a compiler directed code restructuring scheme for enhancing locality of shared data in CMPs. The proposed scheme targets the last level shared cache that exist in many commercial CMPs and has two components, namely, allocation, which determines the set of loop iterations assigned to each core, and scheduling, which determines the order in which the iterations assigned to a core are executed. Our scheme restructures the application code such that the different cores operate on shared data blocks at the same time, to the extent allowed by data dependencies. This helps to reduce reuse distances for the shared data and improves on-chip cache performance. We evaluated our approach using the Splash-2 and Parsec applications through both simulations and experiments on two commercial multi-core machines. Our experimental evaluation indicates that the proposed data locality optimization scheme improves inter-core conflict misses in the shared cache by 67% on average when both allocation and scheduling are used. Also, the execution time improvements we achieve (29% on average) are very close to the optimal savings that could be achieved using a hypothetical scheme. |
| Author | Muralidhara, Sai Prashanth Kandemir, Mahmut Narayanan, Sri Hari Krishna Zhang, Yuanrui Ozturk, Ozcan |
| Author_xml | – sequence: 1 givenname: Mahmut surname: Kandemir fullname: Kandemir, Mahmut organization: Pennsylvania State University – sequence: 2 givenname: Sai Prashanth surname: Muralidhara fullname: Muralidhara, Sai Prashanth organization: Pennsylvania State University – sequence: 3 givenname: Sri Hari Krishna surname: Narayanan fullname: Narayanan, Sri Hari Krishna organization: Pennsylvania State University – sequence: 4 givenname: Yuanrui surname: Zhang fullname: Zhang, Yuanrui organization: Pennsylvania State University – sequence: 5 givenname: Ozcan surname: Ozturk fullname: Ozturk, Ozcan organization: Bilkent University |
| BookMark | eNqNkLtOAzEURC0BEiGkpqDZkmaDr98uUcRLipQGasv22qwhG6_sgARfT0LyAUwzxRxNcS7Q6SZvAkJXgOcAjN-CEBqAzP9aihM001KBwJwrqRWcoglgSVrGOJyjWa3veJcdqymbILUat2lIP2nz1tTeltA13vo-NC709ivl0uTY-D6NzfC53qaxZB9qzaVeorNo1zXMjj1Frw_3L4undrl6fF7cLVsLgmxboDRSG7sOK0VIoIE67QnuQALhQCPhrNM6UuWEddJ6rTqwlhJqCXVCSzpF14ffFEIwY0mDLd-GU8kZ2683h9X6wbicP6oBbPZazFGLOWrZofN_osaVFCL9Ba7UYUc |
| ContentType | Conference Proceeding |
| Copyright | 2009 ACM |
| Copyright_xml | – notice: 2009 ACM |
| DBID | 6IE 6IL CBEJK RIE RIL |
| DOI | 10.1145/1669112.1669176 |
| DatabaseName | IEEE Electronic Library (IEL) Conference Proceedings IEEE Xplore POP ALL IEEE Xplore All Conference Proceedings IEEE Electronic Library (IEL) IEEE Proceedings Order Plans (POP All) 1998-Present |
| DatabaseTitleList | |
| Database_xml | – sequence: 1 dbid: RIE name: IEEE Electronic Library url: https://ieeexplore.ieee.org/ sourceTypes: Publisher |
| DeliveryMethod | fulltext_linktorsrc |
| Discipline | Computer Science |
| EndPage | 516 |
| ExternalDocumentID | 5375447 |
| Genre | orig-research |
| GroupedDBID | 6IE 6IF 6IK 6IL 6IN AAJGR AARBI ACM ADPZR ALMA_UNASSIGNED_HOLDINGS APO BEFXN BFFAM BGNUA BKEBE BPEOZ CBEJK GUFHI IERZE OCL RIB RIC RIE RIL -~X 123 29O AAWTH ADZIZ AFFNX CHZPO IPLJI M43 RNS |
| ID | FETCH-LOGICAL-a162t-133f3afdd08822e3e3b9c20d1712513f254d99f38b6ab7ac98d1aa323a23b6973 |
| IEDL.DBID | RIE |
| ISBN | 9781605587981 1605587982 |
| ISSN | 1072-4451 |
| IngestDate | Wed Aug 27 02:52:36 EDT 2025 Wed Jan 31 06:44:13 EST 2024 Wed Jan 31 06:43:54 EST 2024 |
| IsPeerReviewed | false |
| IsScholarly | true |
| Language | English |
| License | Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from Permissions@acm.org |
| LinkModel | DirectLink |
| MeetingName | Micro-42: The 42nd Annual IEEE/ACM International Symposium on Microarchitecture |
| MergedId | FETCHMERGED-LOGICAL-a162t-133f3afdd08822e3e3b9c20d1712513f254d99f38b6ab7ac98d1aa323a23b6973 |
| PageCount | 12 |
| ParticipantIDs | ieee_primary_5375447 acm_books_10_1145_1669112_1669176 acm_books_10_1145_1669112_1669176_brief |
| PublicationCentury | 2000 |
| PublicationDate | 20091212 2009-Dec. |
| PublicationDateYYYYMMDD | 2009-12-12 2009-12-01 |
| PublicationDate_xml | – month: 12 year: 2009 text: 20091212 day: 12 |
| PublicationDecade | 2000 |
| PublicationPlace | New York, NY, USA |
| PublicationPlace_xml | – name: New York, NY, USA |
| PublicationSeriesTitle | ACM Conferences |
| PublicationTitle | 2009 42nd Annual IEEE/ACM International Symposium on Microarchitecture (MICRO) |
| PublicationTitleAbbrev | MICRO |
| PublicationYear | 2009 |
| Publisher | ACM IEEE |
| Publisher_xml | – name: ACM – name: IEEE |
| SSID | ssj0000669934 ssj0008695 |
| Score | 1.8774037 |
| Snippet | One of the critical problems associated with emerging chip multiprocessors (CMPs) is the management of on-chip shared cache space. Unfortunately, single... |
| SourceID | ieee acm |
| SourceType | Publisher |
| StartPage | 505 |
| SubjectTerms | Algorithm Algorithm design and analysis Computer languages Costs Design Energy consumption Experimentation Hardware -- Integrated circuits -- Semiconductor memory -- Dynamic memory Job shop scheduling Laboratories Manufacturing Permission Processor scheduling Software and its engineering -- Software notations and tools -- Compilers Yarn |
| Title | Optimizing shared cache behavior of chip multiprocessors |
| URI | https://ieeexplore.ieee.org/document/5375447 |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| link | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1NS8NAEB3a4sFT1VasX6wgeDFtspvNZs9i8VR7UOgt7CfNoU1JWw_-eneTNCII4ikfLCE8sjszm3nvAdwLjoXEUgdUUhLEgrspZYUNXDmkubGcG6kqswk2m6WLBZ934LHlwhhjquYzM_an1b98Xai93yqbUO_XGrMudBlLaq5Wu5_iQqcLtXG7CqdJ5bjiqhsceBEuT-pyqTtNGU9xo_V0uI4azZ8oppPIPcflIOPqWCmRCLX6Yb1SRZ5p_3_vfALDbwofmrfB6RQ6Zn0G_YOHA2qm9ADSV7dmrPJPNwhtl74bHSmv8YwO_H1UWKSW-QbVrYc1r6Aot0N4nz6_Pb0EjZtCIKIEe895YomwWvukGhtiiOQKhzpiPsch1lWKmnNLUpkIyYTiqY6EIJgITGTCGTmH3rpYmwtAXsTQuOCnlBExDbVIKbXYhqEVMjKcj-DOgZb5MmGb1cxnmjXAZg2wI3j4c0wmy9zYEQw8rNmmlt_IGkQvf799Bce4sXgIo2vo7cq9uYEj9bHLt-Vt9c18Af_YtfE |
| linkProvider | IEEE |
| linkToHtml | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV3NS8MwFH_MKehp6ibOzwiCFzvbpGmbszgmzrnDhN1KPlkPW8c-PPjXm7TdRBDEU9ISQnk0ee8l7_f7AdxyhrnAQnlUUOKFnNklZbjxbDqkmDaMaSELsYl4MEjGYzaswf0WC6O1LorPdMd1i7t8lcu1Oyp7oE6vNYx3YNc22C_RWtsTFes8rbMNt_twEhWaKza_wZ6j4XKwLhu80yRmCa7YnjbPQcX6E4T0IbDz2CikU7QFFwmX0x_iK4Xv6Tb-99WH0PoG8aHh1j0dQU3PjqGxUXFA1aJuQvJmd41p9mkHoeXE1aMj6Vie0QbBj3KD5CSbo7L4sEQW5ItlC967T6PHnlfpKXg8iLBTnSeGcKOUC6uxJpoIJrGvgthFOcTYXFExZkgiIi5iLlmiAs4JJhwTEbGYnEB9ls_0KSBHY6it-5NS85D6iieUGmx833ARaMbacGONlrpEYZmW2GeaVoZNK8O24e7PMalYZNq0oenMms5LAo60sujZ76-vYb83eu2n_efByzkc4ErwwQ8uoL5arPUl7MmPVbZcXBX_zxewZLk4 |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2009+42nd+Annual+IEEE%2FACM+International+Symposium+on+Microarchitecture+%28MICRO%29&rft.atitle=Optimizing+shared+cache+behavior+of+chip+multiprocessors&rft.au=Kandemir%2C+M.&rft.au=Muralidhara%2C+S.P.&rft.au=Narayanan%2C+S.H.K.&rft.au=Yuanrui+Zhang&rft.date=2009-12-01&rft.pub=IEEE&rft.isbn=9781605587981&rft.issn=1072-4451&rft.spage=505&rft.epage=516&rft_id=info:doi/10.1145%2F1669112.1669176&rft.externalDocID=5375447 |
| thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1072-4451&client=summon |
| thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1072-4451&client=summon |
| thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1072-4451&client=summon |

