A scalable processing-in-memory accelerator for parallel graph processing

The explosion of digital data and the ever-growing need for fast data analysis have made in-memory big-data processing in computer systems increasingly important. In particular, large-scale graph processing is gaining attention due to its broad applicability from social science to machine learning....

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Proceedings - International Symposium on Computer Architecture S. 105 - 117
Hauptverfasser:	Ahn, Junwhan, Hong, Sungpack, Yoo, Sungjoo, Mutlu, Onur, Choi, Kiyoung
Format:	Tagungsbericht
Sprache:	Englisch
Veröffentlicht:	IEEE 01.06.2015
Schlagworte:	Internet Lead Out of order Parallel processing Prefetching System-on-chip
ISSN:	1063-6897
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Abstract	The explosion of digital data and the ever-growing need for fast data analysis have made in-memory big-data processing in computer systems increasingly important. In particular, large-scale graph processing is gaining attention due to its broad applicability from social science to machine learning. However, scalable hardware design that can efficiently process large graphs in main memory is still an open problem. Ideally, cost-effective and scalable graph processing systems can be realized by building a system whose performance increases proportionally with the sizes of graphs that can be stored in the system, which is extremely challenging in conventional systems due to severe memory bandwidth limitations. In this work, we argue that the conventional concept of processing-in-memory (PIM) can be a viable solution to achieve such an objective. The key modern enabler for PIM is the recent advancement of the 3D integration technology that facilitates stacking logic and memory dies in a single package, which was not available when the PIM concept was originally examined. In order to take advantage of such a new technology to enable memory-capacity-proportional performance, we design a programmable PIM accelerator for large-scale graph processing called Tesseract. Tesseract is composed of (1) a new hardware architecture that fully utilizes the available memory bandwidth, (2) an efficient method of communication between different memory partitions, and (3) a programming interface that reflects and exploits the unique hardware design. It also includes two hardware prefetchers specialized for memory access patterns of graph processing, which operate based on the hints provided by our programming model. Our comprehensive evaluations using five state-of-the-art graph processing workloads with large real-world graphs show that the proposed architecture improves average system performance by a factor of ten and achieves 87% average energy reduction over conventional systems.
AbstractList	The explosion of digital data and the ever-growing need for fast data analysis have made in-memory big-data processing in computer systems increasingly important. In particular, large-scale graph processing is gaining attention due to its broad applicability from social science to machine learning. However, scalable hardware design that can efficiently process large graphs in main memory is still an open problem. Ideally, cost-effective and scalable graph processing systems can be realized by building a system whose performance increases proportionally with the sizes of graphs that can be stored in the system, which is extremely challenging in conventional systems due to severe memory bandwidth limitations. In this work, we argue that the conventional concept of processing-in-memory (PIM) can be a viable solution to achieve such an objective. The key modern enabler for PIM is the recent advancement of the 3D integration technology that facilitates stacking logic and memory dies in a single package, which was not available when the PIM concept was originally examined. In order to take advantage of such a new technology to enable memory-capacity-proportional performance, we design a programmable PIM accelerator for large-scale graph processing called Tesseract. Tesseract is composed of (1) a new hardware architecture that fully utilizes the available memory bandwidth, (2) an efficient method of communication between different memory partitions, and (3) a programming interface that reflects and exploits the unique hardware design. It also includes two hardware prefetchers specialized for memory access patterns of graph processing, which operate based on the hints provided by our programming model. Our comprehensive evaluations using five state-of-the-art graph processing workloads with large real-world graphs show that the proposed architecture improves average system performance by a factor of ten and achieves 87% average energy reduction over conventional systems.
Author	Ahn, Junwhan Choi, Kiyoung Yoo, Sungjoo Hong, Sungpack Mutlu, Onur
Author_xml	– sequence: 1 givenname: Junwhan surname: Ahn fullname: Ahn, Junwhan email: junwhan@snu.ac.kr organization: Seoul National University, Korea – sequence: 2 givenname: Sungpack surname: Hong fullname: Hong, Sungpack email: sungpack.hong@oracle.com organization: Oracle Labs, USA – sequence: 3 givenname: Sungjoo surname: Yoo fullname: Yoo, Sungjoo email: sungjoo.yoo@gmail.com organization: Seoul National University, Korea – sequence: 4 givenname: Onur surname: Mutlu fullname: Mutlu, Onur email: onur@cmu.edu organization: Carnegie Mellon University, USA – sequence: 5 givenname: Kiyoung surname: Choi fullname: Choi, Kiyoung email: kchoi@snu.ac.kr organization: Seoul National University, Korea
BookMark	eNpNjjtPwzAURo1UJNrSmYElfyDFj1z7eqwqHpUqscBc3TjXJchNIrtL_z2VYGD4dKZz9C3EbBgHFuJBybVSDTxp1_jG-rV2IA3aG7HyDhVIL6UGa2dirqQ1tUXv7sSilG8plfdg52K3qUqgRG3iaspj4FL64Vj3Q33i05gvFYXAiTOdx1zF6ybKlBKn6php-vrn3IvbSKnw6o9L8fny_LF9q_fvr7vtZl-TUeZcR9LggYAiMCC2iOyxkyqGttUaFXVeGcsYNbDBVnbXqyYoluhBdY0zS_H42-2Z-TDl_kT5cnAaGwne_AB7004z
ContentType	Conference Proceeding
DBID	6IE 6IH CBEJK RIE RIO
DOI	10.1145/2749469.2750386
DatabaseName	IEEE Electronic Library (IEL) Conference Proceedings IEEE Proceedings Order Plan (POP) 1998-present by volume IEEE Xplore All Conference Proceedings IEEE Electronic Library (IEL) IEEE Proceedings Order Plans (POP) 1998-present
DatabaseTitleList
Database_xml	– sequence: 1 dbid: RIE name: IEEE Electronic Library (IEL) url: https://ieeexplore.ieee.org/ sourceTypes: Publisher
DeliveryMethod	fulltext_linktorsrc
Discipline	Computer Science
EISBN	9781509002566 1509002561
EndPage	117
ExternalDocumentID	7284059
Genre	orig-research
GroupedDBID	23M 29F 29O 6IE 6IF 6IH 6IK 6IL 6IM 6IN AAJGR AAWTH ACGFS ADZIZ ALMA_UNASSIGNED_HOLDINGS APO BEFXN BFFAM BGNUA BKEBE BPEOZ CBEJK CHZPO IEGSK IJVOP IPLJI M43 OCL RIE RIL RIO ZY4
ID	FETCH-LOGICAL-a313t-fa2595a5af5e588b88e98d01fcbb2281ad9136e8f25e38b0d0013c1e08951d473
IEDL.DBID	RIE
ISICitedReferencesCount	383
ISICitedReferencesURI	http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000380455700009&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
ISSN	1063-6897
IngestDate	Wed Jun 04 06:02:04 EDT 2025
IsPeerReviewed	false
IsScholarly	true
Language	English
LinkModel	DirectLink
MergedId	FETCHMERGED-LOGICAL-a313t-fa2595a5af5e588b88e98d01fcbb2281ad9136e8f25e38b0d0013c1e08951d473
PageCount	13
ParticipantIDs	ieee_primary_7284059
PublicationCentury	2000
PublicationDate	20150601
PublicationDateYYYYMMDD	2015-06-01
PublicationDate_xml	– month: 06 year: 2015 text: 20150601 day: 01
PublicationDecade	2010
PublicationTitle	Proceedings - International Symposium on Computer Architecture
PublicationTitleAbbrev	ISCA
PublicationYear	2015
Publisher	IEEE
Publisher_xml	– name: IEEE
SSID	ssj0019956 ssib048751269
Score	2.528052
Snippet	The explosion of digital data and the ever-growing need for fast data analysis have made in-memory big-data processing in computer systems increasingly...
SourceID	ieee
SourceType	Publisher
StartPage	105
SubjectTerms	Internet Lead Out of order Parallel processing Prefetching System-on-chip
Title	A scalable processing-in-memory accelerator for parallel graph processing
URI	https://ieeexplore.ieee.org/document/7284059
WOSCitedRecordID	wos000380455700009&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
link	http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1LSwMxEB7a4sFT1VZ8k4NH0242m032KGJRkNKDSm8lj1ko1Lb0IfjvTbLb6sGLt7AksGSSnW925vsG4FZpTKS0gpqk9AGKcYIWDnPKtXQSRSlkVLx5f5HDoRqPi1ED7vZcGESMxWfYC8OYy3cLuw2_yvrSf0s9HGhCU8q84mrtzk7A3Sw2y64zCIGxGTOdOae5KmQt68My0feBWOGjwl6QNueBRP2rr0p0K4P2_17oCLo__Dwy2nueY2jg_ATauwYNpL6vHXi-J2tvg8COIsuKEeDn0-mcfoQC2y-irfVuJ2baiUevJAiBz2Y4I1HH-teaLrwNHl8fnmjdO4FqzviGltrHNUILXQoUShmlsFAuYaU1Jk0V065gPEdVpgK5MokLWNAyTJSHXC6T_BRa88Ucz4Ck0l9sb8nMWZdZyzTLU4OCYap5Kgw_h07YmcmykseY1Jty8ffjSzj0mENU1VZX0NqstngNB_ZzM12vbqJNvwFN3KA_
linkProvider	IEEE
linkToHtml	http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1LTwIxEG4QTfSECsa3PXi0sN1ut92jMRKISDig4Ub6mE1MEAgPE_-9bXdBDl68NZs22XTanW925vsGoXupIBLCcKKj3AUo2nKSWUgJU8IK4DkXQfHmvSf6fTkaZYMKethyYQAgFJ9B0w9DLt_OzNr_KmsJ9y11cGAP7fMkiaOCrbU5PR5509Auu8wheM5myHWmjKQyE6WwD014y4VimYsLm17cnHka9U5nleBY2rX_vdIxavwy9PBg63tOUAWmp6i2adGAyxtbR91HvHRW8PwoPC84AW4--ZiST19i-42VMc7xhFw7dvgVeynwyQQmOChZ76xpoLf28_CpQ8ruCUQxylYkVy6y4YqrnAOXUksJmbQRzY3WcSypshllKcg85sCkjqxHg4ZCJB3osolgZ6g6nU3hHOFYuKvtbJlYYxNjqKJprIFTiBWLuWYXqO53ZjwvBDLG5aZc_v34Dh12hq-9ca_bf7lCRw6B8KL26hpVV4s13KAD87X6WC5ug31_ANXwo4Y
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=Proceedings+-+International+Symposium+on+Computer+Architecture&rft.atitle=A+scalable+processing-in-memory+accelerator+for+parallel+graph+processing&rft.au=Ahn%2C+Junwhan&rft.au=Hong%2C+Sungpack&rft.au=Yoo%2C+Sungjoo&rft.au=Mutlu%2C+Onur&rft.date=2015-06-01&rft.pub=IEEE&rft.issn=1063-6897&rft.spage=105&rft.epage=117&rft_id=info:doi/10.1145%2F2749469.2750386&rft.externalDocID=7284059
thumbnail_l	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1063-6897&client=summon
thumbnail_m	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1063-6897&client=summon
thumbnail_s	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1063-6897&client=summon