A Parallel Graph Environment for Real-World Data Analytics Workflows

Economic competitiveness and national security depend increasingly on the insightful analysis of large data sets. The diversity of real-world data sources and analytic workflows impose challenging hardware and software requirements for parallel graph platforms. The irregular nature of graph methods...

Celý popis

Uloženo v:
Podrobná bibliografie
Vydáno v:Proceedings - Design, Automation, and Test in Europe Conference and Exhibition s. 1313 - 1318
Hlavní autoři: Castellana, Vito Giovanni, Drocco, Maurizio, Feo, John, Firoz, Jesun, Kanewala, Thejaka, Lumsdaine, Andrew, Manzano, Joseph, Marquez, Andres, Minutoli, Marco, Suetterlein, Joshua, Tumeo, Antonino, Zalewski, Marcin
Médium: Konferenční příspěvek
Jazyk:angličtina
Vydáno: EDAA 01.03.2019
Témata:
ISSN:1558-1101
On-line přístup:Získat plný text
Tagy: Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
Abstract Economic competitiveness and national security depend increasingly on the insightful analysis of large data sets. The diversity of real-world data sources and analytic workflows impose challenging hardware and software requirements for parallel graph platforms. The irregular nature of graph methods is not supported well by the deep memory hierarchies of conventional distributed systems, requiring new processor and runtime system designs to tolerate memory and synchronization latencies. Moreover, the efficiency of relational table operations and matrix computations are not attainable when data is stored in common graph data structures. In this paper, we present HAGGLE, a high-performance, scalable data analytics platform. The platform's hybrid data model supports a variety of distributed, thread-safe data structures, parallel programming constructs, and persistent and streaming data. An abstract runtime layer enables us to map the stack to conventional, distributed computer systems with accelerators. The runtime uses multithreading, active messages, and data aggregation to hide memory and synchronization latencies on large-scale systems.
AbstractList Economic competitiveness and national security depend increasingly on the insightful analysis of large data sets. The diversity of real-world data sources and analytic workflows impose challenging hardware and software requirements for parallel graph platforms. The irregular nature of graph methods is not supported well by the deep memory hierarchies of conventional distributed systems, requiring new processor and runtime system designs to tolerate memory and synchronization latencies. Moreover, the efficiency of relational table operations and matrix computations are not attainable when data is stored in common graph data structures. In this paper, we present HAGGLE, a high-performance, scalable data analytics platform. The platform's hybrid data model supports a variety of distributed, thread-safe data structures, parallel programming constructs, and persistent and streaming data. An abstract runtime layer enables us to map the stack to conventional, distributed computer systems with accelerators. The runtime uses multithreading, active messages, and data aggregation to hide memory and synchronization latencies on large-scale systems.
Author Tumeo, Antonino
Manzano, Joseph
Drocco, Maurizio
Feo, John
Firoz, Jesun
Castellana, Vito Giovanni
Minutoli, Marco
Lumsdaine, Andrew
Zalewski, Marcin
Kanewala, Thejaka
Suetterlein, Joshua
Marquez, Andres
Author_xml – sequence: 1
  givenname: Vito Giovanni
  surname: Castellana
  fullname: Castellana, Vito Giovanni
  organization: High Performance Computing, Pacific Northwest National Laboratory, Richland, WA, USA
– sequence: 2
  givenname: Maurizio
  surname: Drocco
  fullname: Drocco, Maurizio
  organization: High Performance Computing, Pacific Northwest National Laboratory, Richland, WA, USA
– sequence: 3
  givenname: John
  surname: Feo
  fullname: Feo, John
  organization: High Performance Computing, Pacific Northwest National Laboratory, Richland, WA, USA
– sequence: 4
  givenname: Jesun
  surname: Firoz
  fullname: Firoz, Jesun
  organization: School of Informatics, Computing, and Engineering Indiana University, Bloomington, IN, USA
– sequence: 5
  givenname: Thejaka
  surname: Kanewala
  fullname: Kanewala, Thejaka
  organization: School of Informatics, Computing, and Engineering Indiana University, Bloomington, IN, USA
– sequence: 6
  givenname: Andrew
  surname: Lumsdaine
  fullname: Lumsdaine, Andrew
  organization: High Performance Computing, Pacific Northwest National Laboratory, Richland, WA, USA
– sequence: 7
  givenname: Joseph
  surname: Manzano
  fullname: Manzano, Joseph
  organization: High Performance Computing, Pacific Northwest National Laboratory, Richland, WA, USA
– sequence: 8
  givenname: Andres
  surname: Marquez
  fullname: Marquez, Andres
  organization: High Performance Computing, Pacific Northwest National Laboratory, Richland, WA, USA
– sequence: 9
  givenname: Marco
  surname: Minutoli
  fullname: Minutoli, Marco
  organization: High Performance Computing, Pacific Northwest National Laboratory, Richland, WA, USA
– sequence: 10
  givenname: Joshua
  surname: Suetterlein
  fullname: Suetterlein, Joshua
  organization: High Performance Computing, Pacific Northwest National Laboratory, Richland, WA, USA
– sequence: 11
  givenname: Antonino
  surname: Tumeo
  fullname: Tumeo, Antonino
  organization: High Performance Computing, Pacific Northwest National Laboratory, Richland, WA, USA
– sequence: 12
  givenname: Marcin
  surname: Zalewski
  fullname: Zalewski, Marcin
  organization: High Performance Computing, Pacific Northwest National Laboratory, Richland, WA, USA
BookMark eNotj9FKwzAUhqMouE5fQG_yAq3nJE2TXJZ1m8JAkYmX4zRNsZq1Iy3K3t6Bu_rhu_j4_oRd9UPvGbtHyIS0aB-rcrvMBKDNjEaFtrhgibQGrSikkJdshkqZFBHwhiXj-AUASgo7Y1XJXylSCD7wdaTDJ1_2P10c-r3vJ94Okb95CunHEEPDK5qIlz2F49S5kZ_gdxuG3_GWXbcURn933jl7Xy23i6d087J-XpSbtBMgprSxptayBQdKI7nGqxoU1kLr2oNu81MSuhxzyo3TSprGEBS6tUCanKhRztnDv7fz3u8OsdtTPO7Oj-UfHWJLYQ
ContentType Conference Proceeding
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.23919/DATE.2019.8715196
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Xplore POP ALL
IEEE Xplore All Conference Proceedings
IEEE Electronic Library (IEL)
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library (IEL)
  url: https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
Discipline Engineering
Computer Science
EISBN 3981926323
9783981926323
EISSN 1558-1101
EndPage 1318
ExternalDocumentID 8715196
Genre orig-research
GroupedDBID 123
29F
29O
6IE
6IF
6IH
6IK
6IL
6IN
AAJGR
AAWTH
ABLEC
ADZIZ
ALMA_UNASSIGNED_HOLDINGS
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CBEJK
CHZPO
FEDTE
IEGSK
IPLJI
KZ1
LMP
M43
OCL
RIE
RIL
RNS
ID FETCH-LOGICAL-i202t-d98b73f0c0571acde5b051b277be07f40531c414a48c7538d8a067f90a7ac2b13
IEDL.DBID RIE
ISICitedReferencesCount 4
ISICitedReferencesURI http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000470666100244&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
IngestDate Wed Aug 27 02:47:06 EDT 2025
IsPeerReviewed false
IsScholarly true
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-i202t-d98b73f0c0571acde5b051b277be07f40531c414a48c7538d8a067f90a7ac2b13
PageCount 6
ParticipantIDs ieee_primary_8715196
PublicationCentury 2000
PublicationDate 2019-March
PublicationDateYYYYMMDD 2019-03-01
PublicationDate_xml – month: 03
  year: 2019
  text: 2019-March
PublicationDecade 2010
PublicationTitle Proceedings - Design, Automation, and Test in Europe Conference and Exhibition
PublicationTitleAbbrev DATE
PublicationYear 2019
Publisher EDAA
Publisher_xml – name: EDAA
SSID ssj0005329
Score 2.095987
Snippet Economic competitiveness and national security depend increasingly on the insightful analysis of large data sets. The diversity of real-world data sources and...
SourceID ieee
SourceType Publisher
StartPage 1313
SubjectTerms Attributed Graphs
Data analysis
Data models
Data structures
Graph Analytics
Libraries
Runtime
Subspace constraints
Title A Parallel Graph Environment for Real-World Data Analytics Workflows
URI https://ieeexplore.ieee.org/document/8715196
WOSCitedRecordID wos000470666100244&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1NTwIxEJ0A8aAXFDB-pwePruxH2W6PREBPhBhMuJFpt01ICBhY9O877a6giRdvm012m_TtdKbdee8B3NtcxyLq2SDjqhdw1Egxx1MChPC2SFsfab3ZhBiPs9lMTmrwsOfCGGN885l5dJf-X36-1jt3VNal4p4KjrQOdSHSkqt1aOdIYlmSYuj1kewO-tOh69yiT6F86pd9is8eo-b_xj2FzoGGxyb7BHMGNbNqQfPbh4FVYdmCkx-igm0Y9NkEN84iZcmenRw1Gx7IbIxqVPZKxWHgu2jYAAtkXpjEyTUzd3Rul-vPbQfeRsPp00tQeSUEiziMiyCXmRKJDTXVXxHq3PQUhZuKhVAmFJa7WNM84sgzTTuULM-Q8pSVIQrUsYqSc2is1itzASwO81AabVGlKbepIcAE4cZpLUgQjbyEtpui-XsphzGvZufq79vXcOxQKNu2bqBRbHbmFo70R7HYbu48hl_Jap0c
linkProvider IEEE
linkToHtml http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV3PT8IwFH5BNFEvKGD8bQ8enXRdt65HIiBGJMRgwo10XZuQEDAw9N_3dUPQxIu3ZcnSpF_f3mv7vu8DuLWpZsIPrRfzJPS40gpjjkcICOJtFW59pM3NJkS_H49GclCCuw0XxhiTN5-Ze_eY3-Wnc71yR2UNLO6x4Ih2YDfknNGCrbVt6AiYLGgxOIAvG63msO16t3AxFN_9MlDJ80en8r-Rj6C-JeKRwSbFHEPJzKpQ-XZiIOvArMLhD1nBGrSaZKAWziRlSh6dIDVpb-lsBKtU8orloZf30ZCWyhTJpUmcYDNxh-d2Ov9c1uGt0x4-dL21W4I3YZRlXirjRASWaqzAfKVTEyYYcAkTIjFUWO6iTXOfKx5r3KPEaawwU1lJlVCaJX5wAuXZfGZOgTCaUmm0VUkUcRsZhEwgchz_BoFSRp5BzU3R-L0QxBivZ-f879c3sN8dvvTGvaf-8wUcOESKJq5LKGeLlbmCPf2RTZaL6xzPLyVFoGM
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=proceeding&rft.title=Proceedings+-+Design%2C+Automation%2C+and+Test+in+Europe+Conference+and+Exhibition&rft.atitle=A+Parallel+Graph+Environment+for+Real-World+Data+Analytics+Workflows&rft.au=Castellana%2C+Vito+Giovanni&rft.au=Drocco%2C+Maurizio&rft.au=Feo%2C+John&rft.au=Firoz%2C+Jesun&rft.date=2019-03-01&rft.pub=EDAA&rft.eissn=1558-1101&rft.spage=1313&rft.epage=1318&rft_id=info:doi/10.23919%2FDATE.2019.8715196&rft.externalDocID=8715196