Výsledky vyhľadávania - Computing methodologies Distributed computing methodologies Distributed programming languages

1

Načíta sa…

MAD-Max Beyond Single-Node: Enabling Large Machine Learning Model Acceleration on Distributed Systems Autor Hsia, Samuel, Golden, Alicia, Acun, Bilge, Ardalani, Newsha, DeVito, Zachary, Wei, Gu-Yeon, Brooks, David, Wu, Carole-Jean

Vydavateľské údaje: IEEE 29.06.2024

Vydané v 2024 ACM/IEEE 51st Annual International Symposium on Computer Architecture (ISCA) (29.06.2024)
“…Training and deploying large-scale machine learning models is time-consuming, requires significant distributed computing infrastructures, and incurs high operational costs…”

Získať plný text

Konferenčný príspevok..

Pridať do obľúbených

Uložené v:
2

Načíta sa…

Skywalker: Efficient Alias-Method-Based Graph Sampling and Random Walk on GPUs Autor Wang, Pengyu, Li, Chao, Wang, Jing, Wang, Taolei, Zhang, Lu, Leng, Jingwen, Chen, Quan, Guo, Minyi

Vydavateľské údaje: IEEE 01.09.2021

Vydané v 2021 30th International Conference on Parallel Architectures and Compilation Techniques (PACT) (01.09.2021)
“…Graph sampling and random walk operations, capturing the structural properties of graphs, are playing an important role today as we cannot directly adopt computing-intensive algorithms on large-scale graphs…”

Získať plný text

Konferenčný príspevok..

Pridať do obľúbených

Uložené v:
3

Načíta sa…

Gluon-Async: A Bulk-Asynchronous System for Distributed and Heterogeneous Graph Analytics Autor Dathathri, Roshan, Gill, Gurbinder, Hoang, Loc, Jatala, Vishwesh, Pingali, Keshav, Nandivada, V. Krishna, Dang, Hoang-Vu, Snir, Marc

ISSN: 2641-7936

Vydavateľské údaje: IEEE 01.09.2019

Vydané v Proceedings / International Conference on Parallel Architectures and Compilation Techniques (01.09.2019)
“…Distributed graph analytics systems for CPUs, like D-Galois and Gemini, and for GPUs, like D-IrGL and Lux, use a bulk-synchronous parallel (BSP…”

Získať plný text

Konferenčný príspevok..

Pridať do obľúbených

Uložené v:
4

Načíta sa…

AdaGL: Adaptive Learning for Agile Distributed Training of Gigantic GNNs Autor Zhang, Ruisi, Javaheripi, Mojan, Ghodsi, Zahra, Bleiweiss, Amit, Koushanfar, Farinaz

Vydavateľské údaje: IEEE 09.07.2023

Vydané v 2023 60th ACM/IEEE Design Automation Conference (DAC) (09.07.2023)
“…Distributed GNN training on contemporary massive and densely connected graphs requires information aggregation from all neighboring nodes, which leads to an explosion of inter-server communications…”

Získať plný text

Konferenčný príspevok..

Pridať do obľúbených

Uložené v:
5

Načíta sa…

Auto-parallelizing stateful distributed streaming applications Autor Schneider, Scott, Hirzel, Martin, Gedik, Bugra, Wu, Kun-Lung

Vydavateľské údaje: ACM 01.09.2012

Vydané v PACT'12 : proceedings of the 21st International Conference on Parallel Architectures and Compilation Techniques, September 19-23, Minneapolis, Minnesota, USA (01.09.2012)
“… They are comprised of operator graphs that produce and consume data tuples. The streaming programming model naturally exposes task and pipeline parallelism, enabling it to exploit parallel systems of all kinds, including large clusters…”

Získať plný text

Konferenčný príspevok..

Pridať do obľúbených

Uložené v:
6

Načíta sa…

Optimizing Distributed ML Communication with Fused Computation-Collective Operations Autor Punniyamurthy, Kishore, Hamidouche, Khaled, Beckmann, Bradford M.

Vydavateľské údaje: IEEE 17.11.2024

Vydané v SC24: International Conference for High Performance Computing, Networking, Storage and Analysis (17.11.2024)
“…Machine learning models are distributed across multiple nodes using numerous parallelism strategies…”

Získať plný text

Konferenčný príspevok..

Pridať do obľúbených

Uložené v:
7

Načíta sa…

Legate Sparse: Distributed Sparse Computing in Python Autor Yadav, Rohan, Lee, Wonchan, Elibol, Melih, Patti, Taylor Lee, Papadakis, Manolis, Garland, Michael, Aiken, Alex, Kjolstad, Fredrik, Bauer, Michael

ISSN: 2167-4337

Vydavateľské údaje: ACM 11.11.2023

Vydané v International Conference for High Performance Computing, Networking, Storage and Analysis (Online) (11.11.2023)
“… The standard implementation of SciPy is restricted to a single CPU and cannot take advantage of modern distributed and accelerated computing resources…”

Získať plný text

Konferenčný príspevok..

Pridať do obľúbených

Uložené v:
8

Načíta sa…

Introduction to Parallel and Distributed Programming using N-Body Simulations Autor Van Craen, Alexander, Breyer, Marcel, Pfluger, Dirk

Vydavateľské údaje: IEEE 17.11.2024

Vydané v SC24-W: Workshops of the International Conference for High Performance Computing, Networking, Storage and Analysis (17.11.2024)
“…This paper describes how we use n-body simulations as an interesting and visually compelling way to teach efficient, parallel, and distributed programming…”

Získať plný text

Konferenčný príspevok..

Pridať do obľúbených

Uložené v:
9

Načíta sa…

Legate NumPy: Accelerated and Distributed Array Computing Autor Bauer, Michael, Garland, Michael

ISSN: 2167-4337

Vydavateľské údaje: ACM 17.11.2019

Vydané v SC19: International Conference for High Performance Computing, Networking, Storage and Analysis (17.11.2019)
“… Legate works by translating NumPy programs to the Legion programming model and then leverages the scalability of the Legion runtime system to distribute data and computations across an arbitrary sized machine. Compared to similar programs written in the distributed Dask array library in Python, Legate achieves speed-ups of up to 10X on 1280 CPUs and 100X on 256 GPUs…”

Získať plný text

Konferenčný príspevok..

Pridať do obľúbených

Uložené v:
10

Načíta sa…

Managing Workflow Malleability in Urgent Computing for Earthquake Alerts Autor Ejarque, Jorge, Monterrubio-Velasco, Marisol, Bhihe, Cedric, Pienkowska, Marta, De La Puente, Josep, Badia, Rosa M.

Vydavateľské údaje: IEEE 17.11.2024

Vydané v SC24-W: Workshops of the International Conference for High Performance Computing, Networking, Storage and Analysis (17.11.2024)
“… UCIS4EQ is an urgent computing platform that estimates ground shaking based on high-performance parallel 3D simulations…”

Získať plný text

Konferenčný príspevok..

Pridať do obľúbených

Uložené v:
11

Načíta sa…

Enabling Low-Overhead HT-HPC Workflows at Extreme Scale using GNU Parallel Autor Maheshwari, Ketan, Arndt, William, Karimi, Ahmad Maroof, Yin, Junqi, Suter, Frederic, Johnson, Seth, Da Silva, Rafael Ferreira

Vydavateľské údaje: IEEE 17.11.2024

Vydané v SC24-W: Workshops of the International Conference for High Performance Computing, Networking, Storage and Analysis (17.11.2024)
“…GNU Parallel is a versatile and powerful tool for process parallelization widely used in scientific computing…”

Získať plný text

Konferenčný príspevok..

Pridať do obľúbených

Uložené v:
12

Načíta sa…

A Sparsity-Aware Distributed-Memory Algorithm for Sparse-Sparse Matrix Multiplication Autor Hong, Yuxi, Buluc, Aydin

Vydavateľské údaje: IEEE 17.11.2024

Vydané v SC24: International Conference for High Performance Computing, Networking, Storage and Analysis (17.11.2024)
“… Distributed-memory parallel algorithms for SpGEMM have mainly focused on sparsity-oblivious approaches that use 2D and 3D partitioning…”

Získať plný text

Konferenčný príspevok..

Pridať do obľúbených

Uložené v:
13

Načíta sa…

Towards an Optimized Heterogeneous Distributed Task Scheduler in OpenMP Cluster Autor Neveu, Remy, Ceccato, Rodrigo, Leite, Gustavo, Araujo, Guido, Diaz, Jose M. Monsalve, Yviquel, Herve

Vydavateľské údaje: IEEE 17.11.2024

Vydané v SC24-W: Workshops of the International Conference for High Performance Computing, Networking, Storage and Analysis (17.11.2024)
“…This paper addresses the challenges of optimizing task scheduling for a distributed, task-based execution model in OpenMP for cluster computing environments…”

Získať plný text

Konferenčný príspevok..

Pridať do obľúbených

Uložené v:
14

Načíta sa…

CUDASTF: Bridging the Gap Between CUDA and Task Parallelism Autor Augonnet, Cedric, Alexandrescu, Andrei, Sidelnik, Albert, Garland, Michael

Vydavateľské údaje: IEEE 17.11.2024

Vydané v SC24: International Conference for High Performance Computing, Networking, Storage and Analysis (17.11.2024)
“…Organizing computation as asynchronous tasks with data-driven dependencies is a simple and efficient model for single- and multi-GPU programs. Sequential Task…”

Získať plný text

Konferenčný príspevok..

Pridať do obľúbených

Uložené v:
15

Načíta sa…

Scalable and Consistent Graph Neural Networks for Distributed Mesh-based Data-driven Modeling Autor Barwey, Shivam, Balin, Riccardo, Lusch, Bethany, Patel, Saumil, Balakrishnan, Ramesh, Pal, Pinaki, Maulik, Romit, Vishwanath, Venkatram

Vydavateľské údaje: IEEE 17.11.2024

Vydané v SC24-W: Workshops of the International Conference for High Performance Computing, Networking, Storage and Analysis (17.11.2024)
“…This work develops a distributed graph neural network (GNN) methodology for mesh-based modeling applications using a consistent neural message passing layer…”

Získať plný text

Konferenčný príspevok..

Pridať do obľúbených

Uložené v:
16

Načíta sa…

Accelerating Communications in Federated Applications with Transparent Object Proxies Autor Pauloski, J. Gregory, Hayot-Sasson, Valerie, Ward, Logan, Hudson, Nathaniel, Sabino, Charlie, Baughman, Matt, Chard, Kyle, Foster, Ian

ISSN: 2167-4337

Vydavateľské údaje: ACM 11.11.2023

Vydané v International Conference for High Performance Computing, Networking, Storage and Analysis (Online) (11.11.2023)
“… Here, we overcome this obstacle with a new programming paradigm that decouples control flow from data flow by extending the pass-by-reference model to distributed applications…”

Získať plný text

Konferenčný príspevok..

Pridať do obľúbených

Uložené v:
17

Načíta sa…

Distributed-Memory Parallel Algorithms for Sparse Matrix and Sparse Tall-and-Skinny Matrix Multiplication Autor Ranawaka, Isuru, Hussain, Md Taufique, Block, Charles, Gerogiannis, Gerasimos, Torrellas, Josep, Azad, Ariful

Vydavateľské údaje: IEEE 17.11.2024

Vydané v SC24: International Conference for High Performance Computing, Networking, Storage and Analysis (17.11.2024)
“… Unfortunately, popular distributed algorithms like sparse SUMMA deliver suboptimal performance for TS-SpGEMM…”

Získať plný text

Konferenčný príspevok..

Pridať do obľúbených

Uložené v:
18

Načíta sa…

Reshaping High Energy Physics Applications for Near-Interactive Execution Using TaskVine Autor Sly-Delgado, Barry, Tovar, Ben, Zhou, Jin, Thain, Douglas

Vydavateľské údaje: IEEE 17.11.2024

Vydané v SC24: International Conference for High Performance Computing, Networking, Storage and Analysis (17.11.2024)
“…High energy physics experiments produce petabytes of data annually that must be reduced to gain insight into the laws of nature. Early-stage reduction executes…”

Získať plný text

Konferenčný príspevok..

Pridať do obľúbených

Uložené v:
19

Načíta sa…

Enhance the Strong Scaling of LAMMPS on Fugaku Autor Li, Jianxiong, Zhao, Tong, Guo, Zhuoqiang, Shi, Shunchen, Liu, Lijun, Tan, Guangming, Jia, Weile, Yuan, Guojun, Wang, Zhan

ISSN: 2167-4337

Vydavateľské údaje: ACM 11.11.2023

Vydané v International Conference for High Performance Computing, Networking, Storage and Analysis (Online) (11.11.2023)
“…Physical phenomenon such as protein folding requires simulation up to microseconds of physical time, which directly corresponds to the strong scaling of…”

Získať plný text

Konferenčný príspevok..

Pridať do obľúbených

Uložené v:
20

Načíta sa…

MPI Progress For All Autor Zhou, Hui, Latham, Robert, Raffenetti, Ken, Guo, Yanfei, Thakur, Rajeev

Vydavateľské údaje: IEEE 17.11.2024

Vydané v SC24-W: Workshops of the International Conference for High Performance Computing, Networking, Storage and Analysis (17.11.2024)
“… The opaque nature of MPI progress poses significant challenges in advancing MPI within modern high-performance computing practices…”

Získať plný text

Konferenčný príspevok..

Pridať do obľúbených

Uložené v:

Výsledky vyhľadávania - Computing methodologies Distributed computing methodologies Distributed programming languages

MAD-Max Beyond Single-Node: Enabling Large Machine Learning Model Acceleration on Distributed Systems Autor Hsia, Samuel, Golden, Alicia, Acun, Bilge, Ardalani, Newsha, DeVito, Zachary, Wei, Gu-Yeon, Brooks, David, Wu, Carole-Jean

Skywalker: Efficient Alias-Method-Based Graph Sampling and Random Walk on GPUs Autor Wang, Pengyu, Li, Chao, Wang, Jing, Wang, Taolei, Zhang, Lu, Leng, Jingwen, Chen, Quan, Guo, Minyi

Gluon-Async: A Bulk-Asynchronous System for Distributed and Heterogeneous Graph Analytics Autor Dathathri, Roshan, Gill, Gurbinder, Hoang, Loc, Jatala, Vishwesh, Pingali, Keshav, Nandivada, V. Krishna, Dang, Hoang-Vu, Snir, Marc

AdaGL: Adaptive Learning for Agile Distributed Training of Gigantic GNNs Autor Zhang, Ruisi, Javaheripi, Mojan, Ghodsi, Zahra, Bleiweiss, Amit, Koushanfar, Farinaz

Auto-parallelizing stateful distributed streaming applications Autor Schneider, Scott, Hirzel, Martin, Gedik, Bugra, Wu, Kun-Lung

Optimizing Distributed ML Communication with Fused Computation-Collective Operations Autor Punniyamurthy, Kishore, Hamidouche, Khaled, Beckmann, Bradford M.

Legate Sparse: Distributed Sparse Computing in Python Autor Yadav, Rohan, Lee, Wonchan, Elibol, Melih, Patti, Taylor Lee, Papadakis, Manolis, Garland, Michael, Aiken, Alex, Kjolstad, Fredrik, Bauer, Michael

Introduction to Parallel and Distributed Programming using N-Body Simulations Autor Van Craen, Alexander, Breyer, Marcel, Pfluger, Dirk

Legate NumPy: Accelerated and Distributed Array Computing Autor Bauer, Michael, Garland, Michael

Managing Workflow Malleability in Urgent Computing for Earthquake Alerts Autor Ejarque, Jorge, Monterrubio-Velasco, Marisol, Bhihe, Cedric, Pienkowska, Marta, De La Puente, Josep, Badia, Rosa M.

Enabling Low-Overhead HT-HPC Workflows at Extreme Scale using GNU Parallel Autor Maheshwari, Ketan, Arndt, William, Karimi, Ahmad Maroof, Yin, Junqi, Suter, Frederic, Johnson, Seth, Da Silva, Rafael Ferreira

A Sparsity-Aware Distributed-Memory Algorithm for Sparse-Sparse Matrix Multiplication Autor Hong, Yuxi, Buluc, Aydin

Towards an Optimized Heterogeneous Distributed Task Scheduler in OpenMP Cluster Autor Neveu, Remy, Ceccato, Rodrigo, Leite, Gustavo, Araujo, Guido, Diaz, Jose M. Monsalve, Yviquel, Herve

CUDASTF: Bridging the Gap Between CUDA and Task Parallelism Autor Augonnet, Cedric, Alexandrescu, Andrei, Sidelnik, Albert, Garland, Michael

Scalable and Consistent Graph Neural Networks for Distributed Mesh-based Data-driven Modeling Autor Barwey, Shivam, Balin, Riccardo, Lusch, Bethany, Patel, Saumil, Balakrishnan, Ramesh, Pal, Pinaki, Maulik, Romit, Vishwanath, Venkatram

Accelerating Communications in Federated Applications with Transparent Object Proxies Autor Pauloski, J. Gregory, Hayot-Sasson, Valerie, Ward, Logan, Hudson, Nathaniel, Sabino, Charlie, Baughman, Matt, Chard, Kyle, Foster, Ian

Distributed-Memory Parallel Algorithms for Sparse Matrix and Sparse Tall-and-Skinny Matrix Multiplication Autor Ranawaka, Isuru, Hussain, Md Taufique, Block, Charles, Gerogiannis, Gerasimos, Torrellas, Josep, Azad, Ariful

Reshaping High Energy Physics Applications for Near-Interactive Execution Using TaskVine Autor Sly-Delgado, Barry, Tovar, Ben, Zhou, Jin, Thain, Douglas

Enhance the Strong Scaling of LAMMPS on Fugaku Autor Li, Jianxiong, Zhao, Tong, Guo, Zhuoqiang, Shi, Shunchen, Liu, Lijun, Tan, Guangming, Jia, Weile, Yuan, Guojun, Wang, Zhan

MPI Progress For All Autor Zhou, Hui, Latham, Robert, Raffenetti, Ken, Guo, Yanfei, Thakur, Rajeev

Vyhľadávacie nástroje:

Upresniť hľadanie

Médium

Predmetová oblasť

Téma

Jazyk

Rok vydania