A Parallel Graph Environment for Real-World Data Analytics Workflows
Economic competitiveness and national security depend increasingly on the insightful analysis of large data sets. The diversity of real-world data sources and analytic workflows impose challenging hardware and software requirements for parallel graph platforms. The irregular nature of graph methods...
Uloženo v:
| Vydáno v: | Proceedings - Design, Automation, and Test in Europe Conference and Exhibition s. 1313 - 1318 |
|---|---|
| Hlavní autoři: | , , , , , , , , , , , |
| Médium: | Konferenční příspěvek |
| Jazyk: | angličtina |
| Vydáno: |
EDAA
01.03.2019
|
| Témata: | |
| ISSN: | 1558-1101 |
| On-line přístup: | Získat plný text |
| Tagy: |
Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
|
| Abstract | Economic competitiveness and national security depend increasingly on the insightful analysis of large data sets. The diversity of real-world data sources and analytic workflows impose challenging hardware and software requirements for parallel graph platforms. The irregular nature of graph methods is not supported well by the deep memory hierarchies of conventional distributed systems, requiring new processor and runtime system designs to tolerate memory and synchronization latencies. Moreover, the efficiency of relational table operations and matrix computations are not attainable when data is stored in common graph data structures. In this paper, we present HAGGLE, a high-performance, scalable data analytics platform. The platform's hybrid data model supports a variety of distributed, thread-safe data structures, parallel programming constructs, and persistent and streaming data. An abstract runtime layer enables us to map the stack to conventional, distributed computer systems with accelerators. The runtime uses multithreading, active messages, and data aggregation to hide memory and synchronization latencies on large-scale systems. |
|---|---|
| AbstractList | Economic competitiveness and national security depend increasingly on the insightful analysis of large data sets. The diversity of real-world data sources and analytic workflows impose challenging hardware and software requirements for parallel graph platforms. The irregular nature of graph methods is not supported well by the deep memory hierarchies of conventional distributed systems, requiring new processor and runtime system designs to tolerate memory and synchronization latencies. Moreover, the efficiency of relational table operations and matrix computations are not attainable when data is stored in common graph data structures. In this paper, we present HAGGLE, a high-performance, scalable data analytics platform. The platform's hybrid data model supports a variety of distributed, thread-safe data structures, parallel programming constructs, and persistent and streaming data. An abstract runtime layer enables us to map the stack to conventional, distributed computer systems with accelerators. The runtime uses multithreading, active messages, and data aggregation to hide memory and synchronization latencies on large-scale systems. |
| Author | Tumeo, Antonino Manzano, Joseph Drocco, Maurizio Feo, John Firoz, Jesun Castellana, Vito Giovanni Minutoli, Marco Lumsdaine, Andrew Zalewski, Marcin Kanewala, Thejaka Suetterlein, Joshua Marquez, Andres |
| Author_xml | – sequence: 1 givenname: Vito Giovanni surname: Castellana fullname: Castellana, Vito Giovanni organization: High Performance Computing, Pacific Northwest National Laboratory, Richland, WA, USA – sequence: 2 givenname: Maurizio surname: Drocco fullname: Drocco, Maurizio organization: High Performance Computing, Pacific Northwest National Laboratory, Richland, WA, USA – sequence: 3 givenname: John surname: Feo fullname: Feo, John organization: High Performance Computing, Pacific Northwest National Laboratory, Richland, WA, USA – sequence: 4 givenname: Jesun surname: Firoz fullname: Firoz, Jesun organization: School of Informatics, Computing, and Engineering Indiana University, Bloomington, IN, USA – sequence: 5 givenname: Thejaka surname: Kanewala fullname: Kanewala, Thejaka organization: School of Informatics, Computing, and Engineering Indiana University, Bloomington, IN, USA – sequence: 6 givenname: Andrew surname: Lumsdaine fullname: Lumsdaine, Andrew organization: High Performance Computing, Pacific Northwest National Laboratory, Richland, WA, USA – sequence: 7 givenname: Joseph surname: Manzano fullname: Manzano, Joseph organization: High Performance Computing, Pacific Northwest National Laboratory, Richland, WA, USA – sequence: 8 givenname: Andres surname: Marquez fullname: Marquez, Andres organization: High Performance Computing, Pacific Northwest National Laboratory, Richland, WA, USA – sequence: 9 givenname: Marco surname: Minutoli fullname: Minutoli, Marco organization: High Performance Computing, Pacific Northwest National Laboratory, Richland, WA, USA – sequence: 10 givenname: Joshua surname: Suetterlein fullname: Suetterlein, Joshua organization: High Performance Computing, Pacific Northwest National Laboratory, Richland, WA, USA – sequence: 11 givenname: Antonino surname: Tumeo fullname: Tumeo, Antonino organization: High Performance Computing, Pacific Northwest National Laboratory, Richland, WA, USA – sequence: 12 givenname: Marcin surname: Zalewski fullname: Zalewski, Marcin organization: High Performance Computing, Pacific Northwest National Laboratory, Richland, WA, USA |
| BookMark | eNotj9FKwzAUhqMouE5fQG_yAq3nJE2TXJZ1m8JAkYmX4zRNsZq1Iy3K3t6Bu_rhu_j4_oRd9UPvGbtHyIS0aB-rcrvMBKDNjEaFtrhgibQGrSikkJdshkqZFBHwhiXj-AUASgo7Y1XJXylSCD7wdaTDJ1_2P10c-r3vJ94Okb95CunHEEPDK5qIlz2F49S5kZ_gdxuG3_GWXbcURn933jl7Xy23i6d087J-XpSbtBMgprSxptayBQdKI7nGqxoU1kLr2oNu81MSuhxzyo3TSprGEBS6tUCanKhRztnDv7fz3u8OsdtTPO7Oj-UfHWJLYQ |
| ContentType | Conference Proceeding |
| DBID | 6IE 6IL CBEJK RIE RIL |
| DOI | 10.23919/DATE.2019.8715196 |
| DatabaseName | IEEE Electronic Library (IEL) Conference Proceedings IEEE Xplore POP ALL IEEE Xplore All Conference Proceedings IEEE Electronic Library (IEL) IEEE Proceedings Order Plans (POP All) 1998-Present |
| DatabaseTitleList | |
| Database_xml | – sequence: 1 dbid: RIE name: IEEE Electronic Library (IEL) url: https://ieeexplore.ieee.org/ sourceTypes: Publisher |
| DeliveryMethod | fulltext_linktorsrc |
| Discipline | Engineering Computer Science |
| EISBN | 3981926323 9783981926323 |
| EISSN | 1558-1101 |
| EndPage | 1318 |
| ExternalDocumentID | 8715196 |
| Genre | orig-research |
| GroupedDBID | 123 29F 29O 6IE 6IF 6IH 6IK 6IL 6IN AAJGR AAWTH ABLEC ADZIZ ALMA_UNASSIGNED_HOLDINGS BEFXN BFFAM BGNUA BKEBE BPEOZ CBEJK CHZPO FEDTE IEGSK IPLJI KZ1 LMP M43 OCL RIE RIL RNS |
| ID | FETCH-LOGICAL-i202t-d98b73f0c0571acde5b051b277be07f40531c414a48c7538d8a067f90a7ac2b13 |
| IEDL.DBID | RIE |
| ISICitedReferencesCount | 4 |
| ISICitedReferencesURI | http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000470666100244&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| IngestDate | Wed Aug 27 02:47:06 EDT 2025 |
| IsPeerReviewed | false |
| IsScholarly | true |
| Language | English |
| LinkModel | DirectLink |
| MergedId | FETCHMERGED-LOGICAL-i202t-d98b73f0c0571acde5b051b277be07f40531c414a48c7538d8a067f90a7ac2b13 |
| PageCount | 6 |
| ParticipantIDs | ieee_primary_8715196 |
| PublicationCentury | 2000 |
| PublicationDate | 2019-March |
| PublicationDateYYYYMMDD | 2019-03-01 |
| PublicationDate_xml | – month: 03 year: 2019 text: 2019-March |
| PublicationDecade | 2010 |
| PublicationTitle | Proceedings - Design, Automation, and Test in Europe Conference and Exhibition |
| PublicationTitleAbbrev | DATE |
| PublicationYear | 2019 |
| Publisher | EDAA |
| Publisher_xml | – name: EDAA |
| SSID | ssj0005329 |
| Score | 2.095987 |
| Snippet | Economic competitiveness and national security depend increasingly on the insightful analysis of large data sets. The diversity of real-world data sources and... |
| SourceID | ieee |
| SourceType | Publisher |
| StartPage | 1313 |
| SubjectTerms | Attributed Graphs Data analysis Data models Data structures Graph Analytics Libraries Runtime Subspace constraints |
| Title | A Parallel Graph Environment for Real-World Data Analytics Workflows |
| URI | https://ieeexplore.ieee.org/document/8715196 |
| WOSCitedRecordID | wos000470666100244&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| link | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1NTwIxEJ0A8aAXFDB-pwePruxH2W6PREBPhBhMuJFpt01ICBhY9O877a6giRdvm012m_TtdKbdee8B3NtcxyLq2SDjqhdw1Egxx1MChPC2SFsfab3ZhBiPs9lMTmrwsOfCGGN885l5dJf-X36-1jt3VNal4p4KjrQOdSHSkqt1aOdIYlmSYuj1kewO-tOh69yiT6F86pd9is8eo-b_xj2FzoGGxyb7BHMGNbNqQfPbh4FVYdmCkx-igm0Y9NkEN84iZcmenRw1Gx7IbIxqVPZKxWHgu2jYAAtkXpjEyTUzd3Rul-vPbQfeRsPp00tQeSUEiziMiyCXmRKJDTXVXxHq3PQUhZuKhVAmFJa7WNM84sgzTTuULM-Q8pSVIQrUsYqSc2is1itzASwO81AabVGlKbepIcAE4cZpLUgQjbyEtpui-XsphzGvZufq79vXcOxQKNu2bqBRbHbmFo70R7HYbu48hl_Jap0c |
| linkProvider | IEEE |
| linkToHtml | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV3PT8IwFH5BNFEvKGD8bQ8enXRdt65HIiBGJMRgwo10XZuQEDAw9N_3dUPQxIu3ZcnSpF_f3mv7vu8DuLWpZsIPrRfzJPS40gpjjkcICOJtFW59pM3NJkS_H49GclCCuw0XxhiTN5-Ze_eY3-Wnc71yR2UNLO6x4Ih2YDfknNGCrbVt6AiYLGgxOIAvG63msO16t3AxFN_9MlDJ80en8r-Rj6C-JeKRwSbFHEPJzKpQ-XZiIOvArMLhD1nBGrSaZKAWziRlSh6dIDVpb-lsBKtU8orloZf30ZCWyhTJpUmcYDNxh-d2Ov9c1uGt0x4-dL21W4I3YZRlXirjRASWaqzAfKVTEyYYcAkTIjFUWO6iTXOfKx5r3KPEaawwU1lJlVCaJX5wAuXZfGZOgTCaUmm0VUkUcRsZhEwgchz_BoFSRp5BzU3R-L0QxBivZ-f879c3sN8dvvTGvaf-8wUcOESKJq5LKGeLlbmCPf2RTZaL6xzPLyVFoGM |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=proceeding&rft.title=Proceedings+-+Design%2C+Automation%2C+and+Test+in+Europe+Conference+and+Exhibition&rft.atitle=A+Parallel+Graph+Environment+for+Real-World+Data+Analytics+Workflows&rft.au=Castellana%2C+Vito+Giovanni&rft.au=Drocco%2C+Maurizio&rft.au=Feo%2C+John&rft.au=Firoz%2C+Jesun&rft.date=2019-03-01&rft.pub=EDAA&rft.eissn=1558-1101&rft.spage=1313&rft.epage=1318&rft_id=info:doi/10.23919%2FDATE.2019.8715196&rft.externalDocID=8715196 |