Search Results - "Computing methodologies Distributed computing methodologies Distributed programming languages"
-
1
Skywalker: Efficient Alias-Method-Based Graph Sampling and Random Walk on GPUs
Published: IEEE 01.09.2021Published in 2021 30th International Conference on Parallel Architectures and Compilation Techniques (PACT) (01.09.2021)“…Graph sampling and random walk operations, capturing the structural properties of graphs, are playing an important role today as we cannot directly adopt…”
Get full text
Conference Proceeding -
2
MAD-Max Beyond Single-Node: Enabling Large Machine Learning Model Acceleration on Distributed Systems
Published: IEEE 29.06.2024Published in 2024 ACM/IEEE 51st Annual International Symposium on Computer Architecture (ISCA) (29.06.2024)“…Training and deploying large-scale machine learning models is time-consuming, requires significant distributed computing infrastructures, and incurs high…”
Get full text
Conference Proceeding -
3
AdaGL: Adaptive Learning for Agile Distributed Training of Gigantic GNNs
Published: IEEE 09.07.2023Published in 2023 60th ACM/IEEE Design Automation Conference (DAC) (09.07.2023)“…Distributed GNN training on contemporary massive and densely connected graphs requires information aggregation from all neighboring nodes, which leads to an…”
Get full text
Conference Proceeding -
4
Gluon-Async: A Bulk-Asynchronous System for Distributed and Heterogeneous Graph Analytics
ISSN: 2641-7936Published: IEEE 01.09.2019Published in Proceedings / International Conference on Parallel Architectures and Compilation Techniques (01.09.2019)“…Distributed graph analytics systems for CPUs, like D-Galois and Gemini, and for GPUs, like D-IrGL and Lux, use a bulk-synchronous parallel (BSP) programming…”
Get full text
Conference Proceeding -
5
Auto-parallelizing stateful distributed streaming applications
Published: ACM 01.09.2012Published in PACT'12 : proceedings of the 21st International Conference on Parallel Architectures and Compilation Techniques, September 19-23, Minneapolis, Minnesota, USA (01.09.2012)“…Streaming applications transform possibly infinite streams of data and often have both high throughput and low latency requirements. They are comprised of…”
Get full text
Conference Proceeding -
6
Legate NumPy: Accelerated and Distributed Array Computing
ISSN: 2167-4337Published: ACM 17.11.2019Published in SC19: International Conference for High Performance Computing, Networking, Storage and Analysis (17.11.2019)“…NumPy is a popular Python library used for performing array-based numerical computations. The canonical implementation of NumPy used by most programmers runs…”
Get full text
Conference Proceeding -
7
Risk and Mitigation of Nondeterminism in Distributed Cyber-Physical Systems
ISSN: 2832-6520Published: ACM 21.09.2023Published in Proceedings (ACM and IEEE International Conference on Formal Methods and Models for Co-Design) (21.09.2023)“…Asynchronous frameworks for distributed embedded systems, like ROS and MQTT, are increasingly used in safety-critical applications such as autonomous driving,…”
Get full text
Conference Proceeding -
8
Scalable and Consistent Graph Neural Networks for Distributed Mesh-based Data-driven Modeling
Published: IEEE 17.11.2024Published in SC24-W: Workshops of the International Conference for High Performance Computing, Networking, Storage and Analysis (17.11.2024)“…This work develops a distributed graph neural network (GNN) methodology for mesh-based modeling applications using a consistent neural message passing layer…”
Get full text
Conference Proceeding -
9
Optimizing Distributed ML Communication with Fused Computation-Collective Operations
Published: IEEE 17.11.2024Published in SC24: International Conference for High Performance Computing, Networking, Storage and Analysis (17.11.2024)“…Machine learning models are distributed across multiple nodes using numerous parallelism strategies. The resulting collective communication is often on the…”
Get full text
Conference Proceeding -
10
Stratified sampling for even workload partitioning
Published: ACM 01.08.2014Published in PACT '14 : proceedings of the 23rd International Conference on Parallel Architectures and Compilation Techniques : August 24-27, 2014, Edmonton, AB, Canada (01.08.2014)“…This work presents a novel algorithm, Workload Partitioning and Scheduling (WPS), for evenly partitioning the computational workload of large…”
Get full text
Conference Proceeding -
11
Unifying Artifacts and Activities in a Visual Tool for Distributed Software Development Teams
ISBN: 9780769521633, 0769521630ISSN: 0270-5257Published: Washington, DC, USA IEEE Computer Society 23.05.2004Published in International Conference on Software Engineering: Proceedings of the 26th International Conference on Software Engineering; 23-28 May 2004 (23.05.2004)“…In large projects, software developers struggle with two sources of complexity the complexity of the code itself, and the complexity of of the process of…”
Get full text
Conference Proceeding -
12
Legate Sparse: Distributed Sparse Computing in Python
ISSN: 2167-4337Published: ACM 11.11.2023Published in International Conference for High Performance Computing, Networking, Storage and Analysis (Online) (11.11.2023)“…The sparse module of the popular SciPy Python library is widely used across applications in scientific computing, data analysis and machine learning. The…”
Get full text
Conference Proceeding -
13
Accelerating Communications in Federated Applications with Transparent Object Proxies
ISSN: 2167-4337Published: ACM 11.11.2023Published in International Conference for High Performance Computing, Networking, Storage and Analysis (Online) (11.11.2023)“…Advances in networks, accelerators, and cloud services encourage programmers to reconsider where to compute-such as when fast networks make it cost-effective…”
Get full text
Conference Proceeding -
14
CUDASTF: Bridging the Gap Between CUDA and Task Parallelism
Published: IEEE 17.11.2024Published in SC24: International Conference for High Performance Computing, Networking, Storage and Analysis (17.11.2024)“…Organizing computation as asynchronous tasks with data-driven dependencies is a simple and efficient model for single- and multi-GPU programs. Sequential Task…”
Get full text
Conference Proceeding -
15
Managing Workflow Malleability in Urgent Computing for Earthquake Alerts
Published: IEEE 17.11.2024Published in SC24-W: Workshops of the International Conference for High Performance Computing, Networking, Storage and Analysis (17.11.2024)“…When large earthquakes happen, first responders need fast and accurate information regarding their impact. UCIS4EQ is an urgent computing platform that…”
Get full text
Conference Proceeding -
16
Parallelized Multi-Agent Bayesian Optimization in Lava
Published: IEEE 30.07.2024Published in 2024 International Conference on Neuromorphic Systems (ICONS) (30.07.2024)“…In parallel with the continuously increasing parameter space dimensionality, search and optimization algorithms should support distributed parameter…”
Get full text
Conference Proceeding -
17
Multi-layer faults in the architectures of mobile, context-aware adaptive applications: a position paper
ISBN: 1605580228, 9781605580227ISSN: 0270-5257Published: 10.05.2008Published in International Conference on Software Engineering 2008 (10.05.2008)“…Five cellphones are sold every second, and there are four times more cellphones than computers, meaning there are some billions of mobile handheld devices in…”
Get full text
Journal Article -
18
Asynchronous Distributed-Memory Parallel Algorithms for Influence Maximization
Published: IEEE 17.11.2024Published in SC24: International Conference for High Performance Computing, Networking, Storage and Analysis (17.11.2024)“…Influence maximization (IM) is the problem of finding the k most influential nodes in a graph. We propose distributed-memory parallel algorithms for the two…”
Get full text
Conference Proceeding -
19
Enabling Low-Overhead HT-HPC Workflows at Extreme Scale using GNU Parallel
Published: IEEE 17.11.2024Published in SC24-W: Workshops of the International Conference for High Performance Computing, Networking, Storage and Analysis (17.11.2024)“…GNU Parallel is a versatile and powerful tool for process parallelization widely used in scientific computing. This paper demonstrates its effective…”
Get full text
Conference Proceeding -
20
Parsl+CWL: Towards Combining the Python and CWL Ecosystems
Published: IEEE 17.11.2024Published in SC24-W: Workshops of the International Conference for High Performance Computing, Networking, Storage and Analysis (17.11.2024)“…The Common Workflow Language (CWL) is a widely adopted language for defining and sharing computational workflows. It is designed to be independent of the…”
Get full text
Conference Proceeding

