Search Results - IEEE International Conference on Algorithms AND Architectures for Parallel Processing
-
1
Proceedings fifth International Conference on algorithms and architectures for parallel processing
ISBN: 0769515126, 9780769515144, 0769515134, 9780769515137, 0769515142, 9780769515120Published: Los Alamitos ; Tokyo IEEE Computer Society 2002Get full text
Book -
2
ICAPP 95 : IEEE First ICA[3]PP : IEEE First International Conference on Algorithms and Architectures for Parallel Processing, Brisbane, Australia, 19-21 April, 1995
ISBN: 9780780320185, 0780320182Published: New York Institute of Electrical and Electronics Engineers 1995Get full text
Book -
3
Proceedings of 1996 IEEE Second International Conference on Algorithms & Architectures for Parallel Processing, ICA[3]PP '96, June 11-13, 1996, Singapore
ISBN: 9780780335295, 0780335295Published: New York IEEE Service Center 1996Get full text
Book -
4
1997 3rd International Conference on Algorithms & Architectures for Parallel Processing, ICA[3]PP '97, Melbourne, Australia Decsmber 10-12 1997
ISBN: 0780342291, 9780780342293Published: Singapore World Scientific 1997Get full text
Book -
5
Algorithms and architectures for parallel processing: 1997 3rd international conference, Melbourne, Australia, December 10-12 1997
ISBN: 0780342291, 9780780342293Published: World Scientific Publishing Co. Pte. Ltd 1997“…This volume of proceedings describes the lower costs and higher degrees of integration of chip architecture which allow parallel processing…”
Get full text
eBook -
6
Neural Speech Phase Prediction Based on Parallel Estimation Architecture and Anti-Wrapping Losses
ISSN: 2379-190XPublished: IEEE 04.06.2023Published in Proceedings of the ... IEEE International Conference on Acoustics, Speech and Signal Processing (1998) (04.06.2023)“… The proposed model is a cascade of a residual convolutional network and a parallel estimation architecture…”
Get full text
Conference Proceeding -
7
InnerSP: A Memory Efficient Sparse Matrix Multiplication Accelerator with Locality-Aware Inner Product Processing
Published: IEEE 01.09.2021Published in 2021 30th International Conference on Parallel Architectures and Compilation Techniques (PACT) (01.09.2021)“… To mitigate the memory access overheads, recent accelerator designs advocated the outer product processing which minimizes input accesses but generates intermediate products to be merged to the final output matrix…”
Get full text
Conference Proceeding -
8
Skywalker: Efficient Alias-Method-Based Graph Sampling and Random Walk on GPUs
Published: IEEE 01.09.2021Published in 2021 30th International Conference on Parallel Architectures and Compilation Techniques (PACT) (01.09.2021)“…Graph sampling and random walk operations, capturing the structural properties of graphs, are playing an important role today as we cannot directly adopt computing-intensive algorithms on large-scale graphs…”
Get full text
Conference Proceeding -
9
Splitwise: Efficient Generative LLM Inference Using Phase Splitting
Published: IEEE 29.06.2024Published in 2024 ACM/IEEE 51st Annual International Symposium on Computer Architecture (ISCA) (29.06.2024)“…Generative large language model (LLM) applications are growing rapidly, leading to large-scale deployments of expensive and power-hungry GPUs. Our…”
Get full text
Conference Proceeding -
10
Accelerating Graph Convolutional Networks Using Crossbar-based Processing-In-Memory Architectures
ISSN: 2378-203XPublished: IEEE 01.04.2022Published in Proceedings - International Symposium on High-Performance Computer Architecture (01.04.2022)“… efficiency.In this paper, we present a new GCN accelerator, RE-FLIP, with three key innovations in terms of architecture design, algorithm mappings, and practical implementations…”
Get full text
Conference Proceeding -
11
A Hybrid Systolic-Dataflow Architecture for Inductive Matrix Algorithms
ISSN: 2378-203XPublished: IEEE 01.02.2020Published in Proceedings - International Symposium on High-Performance Computer Architecture (01.02.2020)“… in the hardware/software interface, then a spatial architecture could efficiently execute parallel code regions…”
Get full text
Conference Proceeding -
12
pSyncPIM: Partially Synchronous Execution of Sparse Matrix Operations for All-Bank PIM Architectures
Published: IEEE 29.06.2024Published in 2024 ACM/IEEE 51st Annual International Symposium on Computer Architecture (ISCA) (29.06.2024)“… Sparse matrix processing is another critical computation that can significantly benefit from the PIM architecture, but the current all-bank PIM control cannot support diverging executions due to the random sparsity…”
Get full text
Conference Proceeding -
13
MAD-Max Beyond Single-Node: Enabling Large Machine Learning Model Acceleration on Distributed Systems
Published: IEEE 29.06.2024Published in 2024 ACM/IEEE 51st Annual International Symposium on Computer Architecture (ISCA) (29.06.2024)“…Training and deploying large-scale machine learning models is time-consuming, requires significant distributed computing infrastructures, and incurs high…”
Get full text
Conference Proceeding -
14
Parallelizing Maximal Clique Enumeration on GPUs
Published: IEEE 21.10.2023Published in 2023 32nd International Conference on Parallel Architectures and Compilation Techniques (PACT) (21.10.2023)“…We present a GPU solution for exact maximal clique enumeration (MCE) that performs a search tree traversal following the Bron-Kerbosch algorithm…”
Get full text
Conference Proceeding -
15
PolyGraph: Exposing the Value of Flexibility for Graph Processing Accelerators
ISSN: 2575-713XPublished: IEEE 01.06.2021Published in Proceedings - International Symposium on Computer Architecture (01.06.2021)“… First, we identify a taxonomy of key algorithm variants. Then we develop a template architecture (PolyGraph…”
Get full text
Conference Proceeding -
16
MnnFast: A Fast and Scalable System Architecture for Memory-Augmented Neural Networks
ISSN: 2575-713XPublished: ACM 01.06.2019Published in 2019 ACM/IEEE 46th Annual International Symposium on Computer Architecture (ISCA) (01.06.2019)“… Such large-scale memory networks provide excellent reasoning power; however, the current computer infrastructure cannot achieve scalable performance due to its limited system architecture…”
Get full text
Conference Proceeding -
17
An In-Network Architecture for Accelerating Shared-Memory Multiprocessor Collectives
Published: IEEE 01.05.2020Published in 2020 ACM/IEEE 47th Annual International Symposium on Computer Architecture (ISCA) (01.05.2020)“…The slowdown of single-chip performance scaling combined with the growing demands of computing ever larger problems efficiently has led to a renewed interest in distributed architectures and specialized hardware…”
Get full text
Conference Proceeding -
18
DRRA-based Reconfigurable Architecture for Mixed-Radix FFT
ISSN: 2380-6923Published: IEEE 01.01.2023Published in VLSI design (01.01.2023)“… In this paper, we propose an architecture for the implementation of the FFT that is derived from the Dynamically Reconfigurable Resource Array and has multiple parallel processing cells while also…”
Get full text
Conference Proceeding -
19
Seer: Predictive Runtime Kernel Selection for Irregular Problems
ISSN: 2643-2838Published: IEEE 02.03.2024Published in Proceedings / International Symposium on Code Generation and Optimization (02.03.2024)“…Modern GPUs are designed for regular problems and suffer from load imbalance when processing irregular data…”
Get full text
Conference Proceeding -
20
Accelerating Fourier and Number Theoretic Transforms using Tensor Cores and Warp Shuffles
Published: IEEE 01.09.2021Published in 2021 30th International Conference on Parallel Architectures and Compilation Techniques (PACT) (01.09.2021)“… To speed things up, fast Fourier transform (FFT) algorithms, which are reduced-complexity formulations for computing the DFT of a sequence, have been proposed and implemented for traditional processors and their corresponding instruction sets…”
Get full text
Conference Proceeding

