A scalable processing-in-memory accelerator for parallel graph processing

The explosion of digital data and the ever-growing need for fast data analysis have made in-memory big-data processing in computer systems increasingly important. In particular, large-scale graph processing is gaining attention due to its broad applicability from social science to machine learning....

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Proceedings - International Symposium on Computer Architecture S. 105 - 117
Hauptverfasser: Ahn, Junwhan, Hong, Sungpack, Yoo, Sungjoo, Mutlu, Onur, Choi, Kiyoung
Format: Tagungsbericht
Sprache:Englisch
Veröffentlicht: IEEE 01.06.2015
Schlagworte:
ISSN:1063-6897
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Abstract The explosion of digital data and the ever-growing need for fast data analysis have made in-memory big-data processing in computer systems increasingly important. In particular, large-scale graph processing is gaining attention due to its broad applicability from social science to machine learning. However, scalable hardware design that can efficiently process large graphs in main memory is still an open problem. Ideally, cost-effective and scalable graph processing systems can be realized by building a system whose performance increases proportionally with the sizes of graphs that can be stored in the system, which is extremely challenging in conventional systems due to severe memory bandwidth limitations. In this work, we argue that the conventional concept of processing-in-memory (PIM) can be a viable solution to achieve such an objective. The key modern enabler for PIM is the recent advancement of the 3D integration technology that facilitates stacking logic and memory dies in a single package, which was not available when the PIM concept was originally examined. In order to take advantage of such a new technology to enable memory-capacity-proportional performance, we design a programmable PIM accelerator for large-scale graph processing called Tesseract. Tesseract is composed of (1) a new hardware architecture that fully utilizes the available memory bandwidth, (2) an efficient method of communication between different memory partitions, and (3) a programming interface that reflects and exploits the unique hardware design. It also includes two hardware prefetchers specialized for memory access patterns of graph processing, which operate based on the hints provided by our programming model. Our comprehensive evaluations using five state-of-the-art graph processing workloads with large real-world graphs show that the proposed architecture improves average system performance by a factor of ten and achieves 87% average energy reduction over conventional systems.
AbstractList The explosion of digital data and the ever-growing need for fast data analysis have made in-memory big-data processing in computer systems increasingly important. In particular, large-scale graph processing is gaining attention due to its broad applicability from social science to machine learning. However, scalable hardware design that can efficiently process large graphs in main memory is still an open problem. Ideally, cost-effective and scalable graph processing systems can be realized by building a system whose performance increases proportionally with the sizes of graphs that can be stored in the system, which is extremely challenging in conventional systems due to severe memory bandwidth limitations. In this work, we argue that the conventional concept of processing-in-memory (PIM) can be a viable solution to achieve such an objective. The key modern enabler for PIM is the recent advancement of the 3D integration technology that facilitates stacking logic and memory dies in a single package, which was not available when the PIM concept was originally examined. In order to take advantage of such a new technology to enable memory-capacity-proportional performance, we design a programmable PIM accelerator for large-scale graph processing called Tesseract. Tesseract is composed of (1) a new hardware architecture that fully utilizes the available memory bandwidth, (2) an efficient method of communication between different memory partitions, and (3) a programming interface that reflects and exploits the unique hardware design. It also includes two hardware prefetchers specialized for memory access patterns of graph processing, which operate based on the hints provided by our programming model. Our comprehensive evaluations using five state-of-the-art graph processing workloads with large real-world graphs show that the proposed architecture improves average system performance by a factor of ten and achieves 87% average energy reduction over conventional systems.
Author Ahn, Junwhan
Choi, Kiyoung
Yoo, Sungjoo
Hong, Sungpack
Mutlu, Onur
Author_xml – sequence: 1
  givenname: Junwhan
  surname: Ahn
  fullname: Ahn, Junwhan
  email: junwhan@snu.ac.kr
  organization: Seoul National University, Korea
– sequence: 2
  givenname: Sungpack
  surname: Hong
  fullname: Hong, Sungpack
  email: sungpack.hong@oracle.com
  organization: Oracle Labs, USA
– sequence: 3
  givenname: Sungjoo
  surname: Yoo
  fullname: Yoo, Sungjoo
  email: sungjoo.yoo@gmail.com
  organization: Seoul National University, Korea
– sequence: 4
  givenname: Onur
  surname: Mutlu
  fullname: Mutlu, Onur
  email: onur@cmu.edu
  organization: Carnegie Mellon University, USA
– sequence: 5
  givenname: Kiyoung
  surname: Choi
  fullname: Choi, Kiyoung
  email: kchoi@snu.ac.kr
  organization: Seoul National University, Korea
BookMark eNpNjjtPwzAURo1UJNrSmYElfyDFj1z7eqwqHpUqscBc3TjXJchNIrtL_z2VYGD4dKZz9C3EbBgHFuJBybVSDTxp1_jG-rV2IA3aG7HyDhVIL6UGa2dirqQ1tUXv7sSilG8plfdg52K3qUqgRG3iaspj4FL64Vj3Q33i05gvFYXAiTOdx1zF6ybKlBKn6php-vrn3IvbSKnw6o9L8fny_LF9q_fvr7vtZl-TUeZcR9LggYAiMCC2iOyxkyqGttUaFXVeGcsYNbDBVnbXqyYoluhBdY0zS_H42-2Z-TDl_kT5cnAaGwne_AB7004z
ContentType Conference Proceeding
DBID 6IE
6IH
CBEJK
RIE
RIO
DOI 10.1145/2749469.2750386
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Proceedings Order Plan (POP) 1998-present by volume
IEEE Xplore All Conference Proceedings
IEEE Electronic Library (IEL)
IEEE Proceedings Order Plans (POP) 1998-present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library (IEL)
  url: https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
Discipline Computer Science
EISBN 9781509002566
1509002561
EndPage 117
ExternalDocumentID 7284059
Genre orig-research
GroupedDBID 23M
29F
29O
6IE
6IF
6IH
6IK
6IL
6IM
6IN
AAJGR
AAWTH
ACGFS
ADZIZ
ALMA_UNASSIGNED_HOLDINGS
APO
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CBEJK
CHZPO
IEGSK
IJVOP
IPLJI
M43
OCL
RIE
RIL
RIO
ZY4
ID FETCH-LOGICAL-a313t-fa2595a5af5e588b88e98d01fcbb2281ad9136e8f25e38b0d0013c1e08951d473
IEDL.DBID RIE
ISICitedReferencesCount 383
ISICitedReferencesURI http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000380455700009&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
ISSN 1063-6897
IngestDate Wed Jun 04 06:02:04 EDT 2025
IsPeerReviewed false
IsScholarly true
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-a313t-fa2595a5af5e588b88e98d01fcbb2281ad9136e8f25e38b0d0013c1e08951d473
PageCount 13
ParticipantIDs ieee_primary_7284059
PublicationCentury 2000
PublicationDate 20150601
PublicationDateYYYYMMDD 2015-06-01
PublicationDate_xml – month: 06
  year: 2015
  text: 20150601
  day: 01
PublicationDecade 2010
PublicationTitle Proceedings - International Symposium on Computer Architecture
PublicationTitleAbbrev ISCA
PublicationYear 2015
Publisher IEEE
Publisher_xml – name: IEEE
SSID ssj0019956
ssib048751269
Score 2.528052
Snippet The explosion of digital data and the ever-growing need for fast data analysis have made in-memory big-data processing in computer systems increasingly...
SourceID ieee
SourceType Publisher
StartPage 105
SubjectTerms Internet
Lead
Out of order
Parallel processing
Prefetching
System-on-chip
Title A scalable processing-in-memory accelerator for parallel graph processing
URI https://ieeexplore.ieee.org/document/7284059
WOSCitedRecordID wos000380455700009&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1LSwMxEB7a4sFT1VZ8k4NH0242m032KGJRkNKDSm8lj1ko1Lb0IfjvTbLb6sGLt7AksGSSnW925vsG4FZpTKS0gpqk9AGKcYIWDnPKtXQSRSlkVLx5f5HDoRqPi1ED7vZcGESMxWfYC8OYy3cLuw2_yvrSf0s9HGhCU8q84mrtzk7A3Sw2y64zCIGxGTOdOae5KmQt68My0feBWOGjwl6QNueBRP2rr0p0K4P2_17oCLo__Dwy2nueY2jg_ATauwYNpL6vHXi-J2tvg8COIsuKEeDn0-mcfoQC2y-irfVuJ2baiUevJAiBz2Y4I1HH-teaLrwNHl8fnmjdO4FqzviGltrHNUILXQoUShmlsFAuYaU1Jk0V065gPEdVpgK5MokLWNAyTJSHXC6T_BRa88Ucz4Ck0l9sb8nMWZdZyzTLU4OCYap5Kgw_h07YmcmykseY1Jty8ffjSzj0mENU1VZX0NqstngNB_ZzM12vbqJNvwFN3KA_
linkProvider IEEE
linkToHtml http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1LTwIxEG4QTfSECsa3PXi0sN1ut92jMRKISDig4Ub6mE1MEAgPE_-9bXdBDl68NZs22XTanW925vsGoXupIBLCcKKj3AUo2nKSWUgJU8IK4DkXQfHmvSf6fTkaZYMKethyYQAgFJ9B0w9DLt_OzNr_KmsJ9y11cGAP7fMkiaOCrbU5PR5509Auu8wheM5myHWmjKQyE6WwD014y4VimYsLm17cnHka9U5nleBY2rX_vdIxavwy9PBg63tOUAWmp6i2adGAyxtbR91HvHRW8PwoPC84AW4--ZiST19i-42VMc7xhFw7dvgVeynwyQQmOChZ76xpoLf28_CpQ8ruCUQxylYkVy6y4YqrnAOXUksJmbQRzY3WcSypshllKcg85sCkjqxHg4ZCJB3osolgZ6g6nU3hHOFYuKvtbJlYYxNjqKJprIFTiBWLuWYXqO53ZjwvBDLG5aZc_v34Dh12hq-9ca_bf7lCRw6B8KL26hpVV4s13KAD87X6WC5ug31_ANXwo4Y
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=Proceedings+-+International+Symposium+on+Computer+Architecture&rft.atitle=A+scalable+processing-in-memory+accelerator+for+parallel+graph+processing&rft.au=Ahn%2C+Junwhan&rft.au=Hong%2C+Sungpack&rft.au=Yoo%2C+Sungjoo&rft.au=Mutlu%2C+Onur&rft.date=2015-06-01&rft.pub=IEEE&rft.issn=1063-6897&rft.spage=105&rft.epage=117&rft_id=info:doi/10.1145%2F2749469.2750386&rft.externalDocID=7284059
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1063-6897&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1063-6897&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1063-6897&client=summon