Search Results - "• Computing methodologies → Distributed algorithms"
-
1
Submodularity of Distributed Join Computation
ISSN: 0730-8078Published: United States 01.06.2018Published in Proceedings - ACM-SIGMOD International Conference on Management of Data (01.06.2018)“…We study distributed equi-join computation in the presence of join-attribute skew, which causes load imbalance. Skew can be addressed by more fine-grained…”
Get more information
Journal Article -
2
Centralized Training and Decentralized Control through the Actor-Critic Paradigm for Highly Optimized Multicores
Published: IEEE 22.06.2025Published in 2025 62nd ACM/IEEE Design Automation Conference (DAC) (22.06.2025)“…While distributed, neural-network-based resource controllers represent the state of the art for their ability to cope with the ever-expanding decision space,…”
Get full text
Conference Proceeding -
3
HADFL: Heterogeneity-aware Decentralized Federated Learning Framework
Published: IEEE 05.12.2021Published in 2021 58th ACM/IEEE Design Automation Conference (DAC) (05.12.2021)“…Federated learning (FL) supports training models on geographically distributed devices. However, traditional FL systems adopt a centralized synchronous…”
Get full text
Conference Proceeding -
4
DeepScaler: Holistic Autoscaling for Microservices Based on Spatiotemporal GNN with Adaptive Graph Learning
ISSN: 2643-1572Published: IEEE 11.09.2023Published in IEEE/ACM International Conference on Automated Software Engineering : [proceedings] (11.09.2023)“…Autoscaling functions provide the foundation for achieving elasticity in the modern cloud computing paradigm. It enables dynamic provisioning or…”
Get full text
Conference Proceeding -
5
MMDFL: Multi-Model-based Decentralized Federated Learning for Resource-Constrained AIoT Systems
Published: IEEE 22.06.2025Published in 2025 62nd ACM/IEEE Design Automation Conference (DAC) (22.06.2025)“…Along with the prosperity of Artificial Intelligence (AI) techniques, more and more Artificial Intelligence of Things (AIoT) applications adopt Federated…”
Get full text
Conference Proceeding -
6
MAD-Max Beyond Single-Node: Enabling Large Machine Learning Model Acceleration on Distributed Systems
Published: IEEE 29.06.2024Published in 2024 ACM/IEEE 51st Annual International Symposium on Computer Architecture (ISCA) (29.06.2024)“…Training and deploying large-scale machine learning models is time-consuming, requires significant distributed computing infrastructures, and incurs high…”
Get full text
Conference Proceeding -
7
Derm: SLA-aware Resource Management for Highly Dynamic Microservices
Published: IEEE 29.06.2024Published in 2024 ACM/IEEE 51st Annual International Symposium on Computer Architecture (ISCA) (29.06.2024)“…Ensuring efficient resource allocation while providing service level agreement (SLA) guarantees for end-to-end (E2E) latency is crucial for microservice…”
Get full text
Conference Proceeding -
8
FedAT: A High-Performance and Communication-Efficient Federated Learning System with Asynchronous Tiers
ISSN: 2167-4337Published: ACM 14.11.2021Published in SC21: International Conference for High Performance Computing, Networking, Storage and Analysis (14.11.2021)“…Federated learning (FL) involves training a model over massive distributed devices, while keeping the training data localized and private. This form of…”
Get full text
Conference Proceeding -
9
PreSto: An In-Storage Data Preprocessing System for Training Recommendation Models
Published: IEEE 29.06.2024Published in 2024 ACM/IEEE 51st Annual International Symposium on Computer Architecture (ISCA) (29.06.2024)“…Training recommendation systems (RecSys) faces several challenges as it requires the "data preprocessing" stage to preprocess an ample amount of raw data and…”
Get full text
Conference Proceeding -
10
NDFT: Accelerating Density Functional Theory Calculations via Hardware/Software Co-Design on Near-Data Computing System
Published: IEEE 22.06.2025Published in 2025 62nd ACM/IEEE Design Automation Conference (DAC) (22.06.2025)“…Linear-response time-dependent Density Functional Theory (LR-TDDFT) is a widely used method for accurately predicting the excited-state properties of physical…”
Get full text
Conference Proceeding -
11
DS-GL: Advancing Graph Learning via Harnessing Nature's Power within Scalable Dynamical Systems
Published: IEEE 29.06.2024Published in 2024 ACM/IEEE 51st Annual International Symposium on Computer Architecture (ISCA) (29.06.2024)“…With the rapid digitization of the world, an increasing number of real-world applications are turning to non-Euclidean data, modeled as graphs. Due to their…”
Get full text
Conference Proceeding -
12
Invited: Waving the Double-Edged Sword: Building Resilient CAVs with Edge and Cloud Computing
Published: IEEE 09.07.2023Published in 2023 60th ACM/IEEE Design Automation Conference (DAC) (09.07.2023)“…The rapid advancement of edge and cloud computing platforms, vehicular ad-hoc networks, and machine learning techniques have brought both opportunities and…”
Get full text
Conference Proceeding -
13
AdaGL: Adaptive Learning for Agile Distributed Training of Gigantic GNNs
Published: IEEE 09.07.2023Published in 2023 60th ACM/IEEE Design Automation Conference (DAC) (09.07.2023)“…Distributed GNN training on contemporary massive and densely connected graphs requires information aggregation from all neighboring nodes, which leads to an…”
Get full text
Conference Proceeding -
14
Personalized Heterogeneity-aware Federated Search Towards Better Accuracy and Energy Efficiency
ISSN: 1558-2434Published: ACM 29.10.2022Published in 2022 IEEE/ACM International Conference On Computer Aided Design (ICCAD) (29.10.2022)“…Federated learning (FL), a new distributed technology, allows us to train the global model on the edge and embedded devices without local data sharing…”
Get full text
Conference Proceeding -
15
Clairvoyant Prefetching for Distributed Machine Learning I/O
ISSN: 2167-4337Published: ACM 14.11.2021Published in SC21: International Conference for High Performance Computing, Networking, Storage and Analysis (14.11.2021)“…I/O is emerging as a major bottleneck for machine learning training, especially in distributed environments. Indeed, at large scale, I/O takes as much as 85%…”
Get full text
Conference Proceeding -
16
SparCML: High-Performance Sparse Communication for Machine Learning
ISSN: 2167-4337Published: ACM 17.11.2019Published in SC19: International Conference for High Performance Computing, Networking, Storage and Analysis (17.11.2019)“…Applying machine learning techniques to the quickly growing data in science and industry requires highly-scalable algorithms. Large datasets are most commonly…”
Get full text
Conference Proceeding -
17
BLOwing Trees to the Ground: Layout Optimization of Decision Trees on Racetrack Memory
Published: IEEE 05.12.2021Published in 2021 58th ACM/IEEE Design Automation Conference (DAC) (05.12.2021)“…Modern distributed low power systems tend to integrate machine learning algorithms, which are directly executed on the distributed devices (on the edge). In…”
Get full text
Conference Proceeding -
18
MyML: User-Driven Machine Learning
Published: IEEE 05.12.2021Published in 2021 58th ACM/IEEE Design Automation Conference (DAC) (05.12.2021)“…Machine learning (ML) on resource-constrained edge devices is expensive and often requires offloading computation to the cloud, which may compromise the…”
Get full text
Conference Proceeding -
19
Leveraging the Compute Power of two HPC Systems for Higher-Dimensional Grid-Based Simulations with the Widely-Distributed Sparse Grid Combination Technique
ISSN: 2167-4337Published: ACM 11.11.2023Published in International Conference for High Performance Computing, Networking, Storage and Analysis (Online) (11.11.2023)“…Grid-based simulations of hot fusion plasmas are often severely limited by computational and memory resources; the grids live in four- to six-dimensional space…”
Get full text
Conference Proceeding -
20
Efficient Scaling of Dynamic Graph Neural Networks
ISSN: 2167-4337Published: ACM 14.11.2021Published in SC21: International Conference for High Performance Computing, Networking, Storage and Analysis (14.11.2021)“…We present distributed algorithms for training dynamic Graph Neural Networks (GNN) on large scale graphs spanning multi-node, multi-GPU systems. To the best of…”
Get full text
Conference Proceeding