Výsledky vyhledávání - IEEE International Conference on Algorithms AND Architectures for Parallel Processing~

1

Načítá se…

ICAPP 95 : IEEE First ICA[3]PP : IEEE First International Conference on Algorithms and Architectures for Parallel Processing, Brisbane, Australia, 19-21 April, 1995 Autor IEEE International Conference on Algorithms and Architectures for Parallel Processing, Narasimhan, V. L.

ISBN: 9780780320185, 0780320182

Vydáno: New York Institute of Electrical and Electronics Engineers 1995

Získat plný text

Kniha

Přidat do oblíbených

Uloženo v:
2

Načítá se…

Proceedings fifth International Conference on algorithms and architectures for parallel processing Autor IEEE International Conference on Algorithms and Architectures for Parallel Processing, Zhou, Wanlei

ISBN: 0769515126, 9780769515144, 0769515134, 9780769515137, 0769515142, 9780769515120

Vydáno: Los Alamitos ; Tokyo IEEE Computer Society 2002

Získat plný text

Kniha

Přidat do oblíbených

Uloženo v:
3

Načítá se…

Proceedings of 1996 IEEE Second International Conference on Algorithms & Architectures for Parallel Processing, ICA[3]PP '96, June 11-13, 1996, Singapore Autor IEEE International Conference on Algorithms and Architectures for Parallel Processing, Institute of Electrical and Electronics Engineers. Singapore Section, IEEE Singapore Section. Computer Chapter, National University of Singapore. Dept. of Information Systems and Computer Science

ISBN: 9780780335295, 0780335295

Vydáno: New York IEEE Service Center 1996

Získat plný text

Kniha

Přidat do oblíbených

Uloženo v:
4

Načítá se…

1997 3rd International Conference on Algorithms & Architectures for Parallel Processing, ICA[3]PP '97, Melbourne, Australia Decsmber 10-12 1997 Autor IEEE International Conference on Algorithms and Architectures for Parallel Processing, Institute of Electrical and Electronics Engineers. Victorian section, Gościński, Andrzej, Hobbs, Michael, Zhou, Wanlei, Deakin University Faculty of Science and Technology, Deakin University IEEE Victorian Section

ISBN: 0780342291, 9780780342293

Vydáno: Singapore World Scientific 1997

Získat plný text

Kniha

Přidat do oblíbených

Uloženo v:
5

Načítá se…

Algorithms and architectures for parallel processing: 1997 3rd international conference, Melbourne, Australia, December 10-12 1997 Autor Goscinski, Andrzej, Zhou, Wanlei, Hobbs, Michael

ISBN: 0780342291, 9780780342293

Vydáno: World Scientific Publishing Co. Pte. Ltd 1997

“…This volume of proceedings describes the lower costs and higher degrees of integration of chip architecture which allow parallel processing…”

Získat plný text

E-kniha

Přidat do oblíbených

Uloženo v:
6

Načítá se…

Neural Speech Phase Prediction Based on Parallel Estimation Architecture and Anti-Wrapping Losses Autor Ai, Yang, Ling, Zhen-Hua

ISSN: 2379-190X

Vydáno: IEEE 04.06.2023

Vydáno v Proceedings of the ... IEEE International Conference on Acoustics, Speech and Signal Processing (1998) (04.06.2023)
“… The proposed model is a cascade of a residual convolutional network and a parallel estimation architecture…”

Získat plný text

Konferenční příspěvek

Přidat do oblíbených

Uloženo v:
7

Načítá se…

InnerSP: A Memory Efficient Sparse Matrix Multiplication Accelerator with Locality-Aware Inner Product Processing Autor Baek, Daehyeon, Hwang, Soojin, Heo, Taekyung, Kim, Daehoon, Huh, Jaehyuk

Vydáno: IEEE 01.09.2021

Vydáno v 2021 30th International Conference on Parallel Architectures and Compilation Techniques (PACT) (01.09.2021)
“… To mitigate the memory access overheads, recent accelerator designs advocated the outer product processing which minimizes input accesses but generates intermediate products to be merged to the final output matrix…”

Získat plný text

Konferenční příspěvek

Přidat do oblíbených

Uloženo v:
8

Načítá se…

Skywalker: Efficient Alias-Method-Based Graph Sampling and Random Walk on GPUs Autor Wang, Pengyu, Li, Chao, Wang, Jing, Wang, Taolei, Zhang, Lu, Leng, Jingwen, Chen, Quan, Guo, Minyi

Vydáno: IEEE 01.09.2021

Vydáno v 2021 30th International Conference on Parallel Architectures and Compilation Techniques (PACT) (01.09.2021)
“…Graph sampling and random walk operations, capturing the structural properties of graphs, are playing an important role today as we cannot directly adopt computing-intensive algorithms on large-scale graphs…”

Získat plný text

Konferenční příspěvek

Přidat do oblíbených

Uloženo v:
9

Načítá se…

Splitwise: Efficient Generative LLM Inference Using Phase Splitting Autor Patel, Pratyush, Choukse, Esha, Zhang, Chaojie, Shah, Aashaka, Goiri, Inigo, Maleki, Saeed, Bianchini, Ricardo

Vydáno: IEEE 29.06.2024

Vydáno v 2024 ACM/IEEE 51st Annual International Symposium on Computer Architecture (ISCA) (29.06.2024)
“…Generative large language model (LLM) applications are growing rapidly, leading to large-scale deployments of expensive and power-hungry GPUs. Our…”

Získat plný text

Konferenční příspěvek

Přidat do oblíbených

Uloženo v:
10

Načítá se…

Accelerating Graph Convolutional Networks Using Crossbar-based Processing-In-Memory Architectures Autor Huang, Yu, Zheng, Long, Yao, Pengcheng, Wang, Qinggang, Liao, Xiaofei, Jin, Hai, Xue, Jingling

ISSN: 2378-203X

Vydáno: IEEE 01.04.2022

Vydáno v Proceedings - International Symposium on High-Performance Computer Architecture (01.04.2022)
“… efficiency.In this paper, we present a new GCN accelerator, RE-FLIP, with three key innovations in terms of architecture design, algorithm mappings, and practical implementations…”

Získat plný text

Konferenční příspěvek

Přidat do oblíbených

Uloženo v:
11

Načítá se…

A Hybrid Systolic-Dataflow Architecture for Inductive Matrix Algorithms Autor Weng, Jian, Liu, Sihao, Wang, Zhengrong, Dadu, Vidushi, Nowatzki, Tony

ISSN: 2378-203X

Vydáno: IEEE 01.02.2020

Vydáno v Proceedings - International Symposium on High-Performance Computer Architecture (01.02.2020)
“… in the hardware/software interface, then a spatial architecture could efficiently execute parallel code regions…”

Získat plný text

Konferenční příspěvek

Přidat do oblíbených

Uloženo v:
12

Načítá se…

pSyncPIM: Partially Synchronous Execution of Sparse Matrix Operations for All-Bank PIM Architectures Autor Baek, Daehyeon, Hwang, Soojin, Huh, Jaehyuk

Vydáno: IEEE 29.06.2024

Vydáno v 2024 ACM/IEEE 51st Annual International Symposium on Computer Architecture (ISCA) (29.06.2024)
“… Sparse matrix processing is another critical computation that can significantly benefit from the PIM architecture, but the current all-bank PIM control cannot support diverging executions due to the random sparsity…”

Získat plný text

Konferenční příspěvek

Přidat do oblíbených

Uloženo v:
13

Načítá se…

MAD-Max Beyond Single-Node: Enabling Large Machine Learning Model Acceleration on Distributed Systems Autor Hsia, Samuel, Golden, Alicia, Acun, Bilge, Ardalani, Newsha, DeVito, Zachary, Wei, Gu-Yeon, Brooks, David, Wu, Carole-Jean

Vydáno: IEEE 29.06.2024

Vydáno v 2024 ACM/IEEE 51st Annual International Symposium on Computer Architecture (ISCA) (29.06.2024)
“…Training and deploying large-scale machine learning models is time-consuming, requires significant distributed computing infrastructures, and incurs high…”

Získat plný text

Konferenční příspěvek

Přidat do oblíbených

Uloženo v:
14

Načítá se…

Parallelizing Maximal Clique Enumeration on GPUs Autor Almasri, Mohammad, Chang, Yen-Hsiang, Hajj, Izzat El, Nagi, Rakesh, Xiong, Jinjun, Hwu, Wen-mei

Vydáno: IEEE 21.10.2023

Vydáno v 2023 32nd International Conference on Parallel Architectures and Compilation Techniques (PACT) (21.10.2023)
“…We present a GPU solution for exact maximal clique enumeration (MCE) that performs a search tree traversal following the Bron-Kerbosch algorithm…”

Získat plný text

Konferenční příspěvek

Přidat do oblíbených

Uloženo v:
15

Načítá se…

PolyGraph: Exposing the Value of Flexibility for Graph Processing Accelerators Autor Dadu, Vidushi, Liu, Sihao, Nowatzki, Tony

ISSN: 2575-713X

Vydáno: IEEE 01.06.2021

Vydáno v Proceedings - International Symposium on Computer Architecture (01.06.2021)
“… First, we identify a taxonomy of key algorithm variants. Then we develop a template architecture (PolyGraph…”

Získat plný text

Konferenční příspěvek

Přidat do oblíbených

Uloženo v:
16

Načítá se…

MnnFast: A Fast and Scalable System Architecture for Memory-Augmented Neural Networks Autor Jang, Hanhwi, Kim, Joonsung, Jo, Jae-Eon, Lee, Jaewon, Kim, Jangwoo

ISSN: 2575-713X

Vydáno: ACM 01.06.2019

Vydáno v 2019 ACM/IEEE 46th Annual International Symposium on Computer Architecture (ISCA) (01.06.2019)
“… Such large-scale memory networks provide excellent reasoning power; however, the current computer infrastructure cannot achieve scalable performance due to its limited system architecture…”

Získat plný text

Konferenční příspěvek

Přidat do oblíbených

Uloženo v:
17

Načítá se…

An In-Network Architecture for Accelerating Shared-Memory Multiprocessor Collectives Autor Klenk, Benjamin, Jiang, Nan, Thorson, Greg, Dennison, Larry

Vydáno: IEEE 01.05.2020

Vydáno v 2020 ACM/IEEE 47th Annual International Symposium on Computer Architecture (ISCA) (01.05.2020)
“…The slowdown of single-chip performance scaling combined with the growing demands of computing ever larger problems efficiently has led to a renewed interest in distributed architectures and specialized hardware…”

Získat plný text

Konferenční příspěvek

Přidat do oblíbených

Uloženo v:
18

Načítá se…

DRRA-based Reconfigurable Architecture for Mixed-Radix FFT Autor Kallapu, Reeshita, Stathis, Dimitrios, Boppu, Srinivas, Hemani, Ahmed

ISSN: 2380-6923

Vydáno: IEEE 01.01.2023

Vydáno v VLSI design (01.01.2023)
“… In this paper, we propose an architecture for the implementation of the FFT that is derived from the Dynamically Reconfigurable Resource Array and has multiple parallel processing cells while also…”

Získat plný text

Konferenční příspěvek

Přidat do oblíbených

Uloženo v:
19

Načítá se…

Seer: Predictive Runtime Kernel Selection for Irregular Problems Autor Swann, Ryan, Osama, Muhammad, Sangaiah, Karthik, Mahmud, Jalal

ISSN: 2643-2838

Vydáno: IEEE 02.03.2024

Vydáno v Proceedings / International Symposium on Code Generation and Optimization (02.03.2024)
“…Modern GPUs are designed for regular problems and suffer from load imbalance when processing irregular data…”

Získat plný text

Konferenční příspěvek

Přidat do oblíbených

Uloženo v:
20

Načítá se…

Accelerating Fourier and Number Theoretic Transforms using Tensor Cores and Warp Shuffles Autor Durrani, Sultan, Chughtai, Muhammad Saad, Hidayetoglu, Mert, Tahir, Rashid, Dakkak, Abdul, Rauchwerger, Lawrence, Zaffar, Fareed, Hwu, Wen-mei

Vydáno: IEEE 01.09.2021

Vydáno v 2021 30th International Conference on Parallel Architectures and Compilation Techniques (PACT) (01.09.2021)
“… To speed things up, fast Fourier transform (FFT) algorithms, which are reduced-complexity formulations for computing the DFT of a sequence, have been proposed and implemented for traditional processors and their corresponding instruction sets…”

Získat plný text

Konferenční příspěvek

Přidat do oblíbených

Uloženo v:

Výsledky vyhledávání - IEEE International Conference on Algorithms AND Architectures for Parallel Processing~

ICAPP 95 : IEEE First ICA[3]PP : IEEE First International Conference on Algorithms and Architectures for Parallel Processing, Brisbane, Australia, 19-21 April, 1995 Autor IEEE International Conference on Algorithms and Architectures for Parallel Processing, Narasimhan, V. L.

Proceedings fifth International Conference on algorithms and architectures for parallel processing Autor IEEE International Conference on Algorithms and Architectures for Parallel Processing, Zhou, Wanlei

Algorithms and architectures for parallel processing: 1997 3rd international conference, Melbourne, Australia, December 10-12 1997 Autor Goscinski, Andrzej, Zhou, Wanlei, Hobbs, Michael

Neural Speech Phase Prediction Based on Parallel Estimation Architecture and Anti-Wrapping Losses Autor Ai, Yang, Ling, Zhen-Hua

InnerSP: A Memory Efficient Sparse Matrix Multiplication Accelerator with Locality-Aware Inner Product Processing Autor Baek, Daehyeon, Hwang, Soojin, Heo, Taekyung, Kim, Daehoon, Huh, Jaehyuk

Skywalker: Efficient Alias-Method-Based Graph Sampling and Random Walk on GPUs Autor Wang, Pengyu, Li, Chao, Wang, Jing, Wang, Taolei, Zhang, Lu, Leng, Jingwen, Chen, Quan, Guo, Minyi

Splitwise: Efficient Generative LLM Inference Using Phase Splitting Autor Patel, Pratyush, Choukse, Esha, Zhang, Chaojie, Shah, Aashaka, Goiri, Inigo, Maleki, Saeed, Bianchini, Ricardo

Accelerating Graph Convolutional Networks Using Crossbar-based Processing-In-Memory Architectures Autor Huang, Yu, Zheng, Long, Yao, Pengcheng, Wang, Qinggang, Liao, Xiaofei, Jin, Hai, Xue, Jingling

A Hybrid Systolic-Dataflow Architecture for Inductive Matrix Algorithms Autor Weng, Jian, Liu, Sihao, Wang, Zhengrong, Dadu, Vidushi, Nowatzki, Tony

pSyncPIM: Partially Synchronous Execution of Sparse Matrix Operations for All-Bank PIM Architectures Autor Baek, Daehyeon, Hwang, Soojin, Huh, Jaehyuk

MAD-Max Beyond Single-Node: Enabling Large Machine Learning Model Acceleration on Distributed Systems Autor Hsia, Samuel, Golden, Alicia, Acun, Bilge, Ardalani, Newsha, DeVito, Zachary, Wei, Gu-Yeon, Brooks, David, Wu, Carole-Jean

Parallelizing Maximal Clique Enumeration on GPUs Autor Almasri, Mohammad, Chang, Yen-Hsiang, Hajj, Izzat El, Nagi, Rakesh, Xiong, Jinjun, Hwu, Wen-mei

PolyGraph: Exposing the Value of Flexibility for Graph Processing Accelerators Autor Dadu, Vidushi, Liu, Sihao, Nowatzki, Tony

MnnFast: A Fast and Scalable System Architecture for Memory-Augmented Neural Networks Autor Jang, Hanhwi, Kim, Joonsung, Jo, Jae-Eon, Lee, Jaewon, Kim, Jangwoo

An In-Network Architecture for Accelerating Shared-Memory Multiprocessor Collectives Autor Klenk, Benjamin, Jiang, Nan, Thorson, Greg, Dennison, Larry

DRRA-based Reconfigurable Architecture for Mixed-Radix FFT Autor Kallapu, Reeshita, Stathis, Dimitrios, Boppu, Srinivas, Hemani, Ahmed

Seer: Predictive Runtime Kernel Selection for Irregular Problems Autor Swann, Ryan, Osama, Muhammad, Sangaiah, Karthik, Mahmud, Jalal

Accelerating Fourier and Number Theoretic Transforms using Tensor Cores and Warp Shuffles Autor Durrani, Sultan, Chughtai, Muhammad Saad, Hidayetoglu, Mert, Tahir, Rashid, Dakkak, Abdul, Rauchwerger, Lawrence, Zaffar, Fareed, Hwu, Wen-mei

Vyhledávací nástroje:

Upřesnit hledání

Médium

Předmětová oblast

Téma

Jazyk

Rok vydání