Výsledky vyhľadávania - "Hardware Integrated circuits Reconfigurable logic and FPGAs Hardware accelerators"
-
1
Ambit: in-memory accelerator for bulk bitwise operations using commodity DRAM technology
ISBN: 1450349528, 9781450349529ISSN: 2379-3155Vydavateľské údaje: New York, NY, USA ACM 14.10.2017Vydané v MICRO-50 : the 50th annual IEEE/ACM International Symposium on Microarchitecture : proceedings : October 14-18, 2017, Cambridge, MA (14.10.2017)“…Many important applications trigger bulk bitwise operations, i.e., bitwise operations on large bit vectors. In fact, recent works design techniques that…”
Získať plný text
Konferenčný príspevok.. -
2
Gemmini: Enabling Systematic Deep-Learning Architecture Evaluation via Full-Stack Integration
Vydavateľské údaje: IEEE 05.12.2021Vydané v 2021 58th ACM/IEEE Design Automation Conference (DAC) (05.12.2021)“…DNN accelerators are often developed and evaluated in isolation without considering the cross-stack, system-level effects in real-world environments. This…”
Získať plný text
Konferenčný príspevok.. -
3
GAMMA: Automating the HW Mapping of DNN Models on Accelerators via Genetic Algorithm
ISSN: 1558-2434Vydavateľské údaje: Association on Computer Machinery 02.11.2020Vydané v Digest of technical papers - IEEE/ACM International Conference on Computer-Aided Design (02.11.2020)“…DNN layers are multi-dimensional loops that can be ordered, tiled, and scheduled in myriad ways across space and time on DNN accelerators. Each of these…”
Získať plný text
Konferenčný príspevok.. -
4
ANT: Exploiting Adaptive Numerical Data Type for Low-bit Deep Neural Network Quantization
Vydavateľské údaje: IEEE 01.10.2022Vydané v 2022 55th IEEE/ACM International Symposium on Microarchitecture (MICRO) (01.10.2022)“…Quantization is a technique to reduce the computation and memory cost of DNN models, which are getting increasingly large. Existing quantization solutions use…”
Získať plný text
Konferenčný príspevok.. -
5
Google Neural Network Models for Edge Devices: Analyzing and Mitigating Machine Learning Inference Bottlenecks
Vydavateľské údaje: IEEE 01.09.2021Vydané v 2021 30th International Conference on Parallel Architectures and Compilation Techniques (PACT) (01.09.2021)“…Emerging edge computing platforms often contain machine learning (ML) accelerators that can accelerate inference for a wide range of neural network (NN)…”
Získať plný text
Konferenčný príspevok.. -
6
CoNDA: Efficient Cache Coherence Support for Near-Data Accelerators
ISSN: 2575-713XVydavateľské údaje: ACM 01.06.2019Vydané v 2019 ACM/IEEE 46th Annual International Symposium on Computer Architecture (ISCA) (01.06.2019)“…Specialized on-chip accelerators are widely used to improve the energy efficiency of computing systems. Recent advances in memory technology have enabled…”
Získať plný text
Konferenčný príspevok.. -
7
Adaptable Butterfly Accelerator for Attention-based NNs via Hardware and Algorithm Co-design
Vydavateľské údaje: IEEE 01.10.2022Vydané v 2022 55th IEEE/ACM International Symposium on Microarchitecture (MICRO) (01.10.2022)“…Attention-based neural networks have become pervasive in many AI tasks. Despite their excellent algorithmic performance, the use of the attention mechanism and…”
Získať plný text
Konferenčný príspevok.. -
8
NAAS: Neural Accelerator Architecture Search
Vydavateľské údaje: IEEE 05.12.2021Vydané v 2021 58th ACM/IEEE Design Automation Conference (DAC) (05.12.2021)“…Data-driven, automatic design space exploration of neural accelerator architecture is desirable for specialization and productivity. Previous frameworks focus…”
Získať plný text
Konferenčný príspevok.. -
9
Laconic Deep Learning Inference Acceleration
ISSN: 2575-713XVydavateľské údaje: ACM 01.06.2019Vydané v 2019 ACM/IEEE 46th Annual International Symposium on Computer Architecture (ISCA) (01.06.2019)“…We present a method for transparently identifying ineffectual computations during inference with Deep Learning models. Specifically, by decomposing…”
Získať plný text
Konferenčný príspevok.. -
10
DFX: A Low-latency Multi-FPGA Appliance for Accelerating Transformer-based Text Generation
Vydavateľské údaje: IEEE 01.10.2022Vydané v 2022 55th IEEE/ACM International Symposium on Microarchitecture (MICRO) (01.10.2022)“…Transformer is a deep learning language model widely used for natural language processing (NLP) services in datacenters. Among transformer models, Generative…”
Získať plný text
Konferenčný príspevok.. -
11
PolySA: Polyhedral-Based Systolic Array Auto-Compilation
ISSN: 1558-2434Vydavateľské údaje: ACM 01.11.2018Vydané v 2018 IEEE/ACM International Conference on Computer-Aided Design (ICCAD) (01.11.2018)“…Automatic systolic array generation has long been an interesting topic due to the need to reduce the lengthy development cycles of manual designs. Existing…”
Získať plný text
Konferenčný príspevok.. -
12
CrossLight: A Cross-Layer Optimized Silicon Photonic Neural Network Accelerator
Vydavateľské údaje: IEEE 05.12.2021Vydané v 2021 58th ACM/IEEE Design Automation Conference (DAC) (05.12.2021)“…Domain-specific neural network accelerators have seen growing interest in recent years due to their improved energy efficiency and performance compared to CPUs…”
Získať plný text
Konferenčný príspevok.. -
13
Caffeine: Towards uniformed representation and acceleration for deep convolutional neural networks
ISSN: 1558-2434Vydavateľské údaje: ACM 01.11.2016Vydané v Digest of technical papers - IEEE/ACM International Conference on Computer-Aided Design (01.11.2016)“…With the recent advancement of multilayer convolutional neural networks (CNN), deep learning has achieved amazing success in many areas, especially in visual…”
Získať plný text
Konferenčný príspevok.. -
14
SODA: Stencil with Optimized Dataflow Architecture
ISSN: 1558-2434Vydavateľské údaje: ACM 01.11.2018Vydané v 2018 IEEE/ACM International Conference on Computer-Aided Design (ICCAD) (01.11.2018)“…Stencil computation is one of the most important kernels in many application domains such as image processing, solving partial differential equations, and…”
Získať plný text
Konferenčný príspevok.. -
15
MnnFast: A Fast and Scalable System Architecture for Memory-Augmented Neural Networks
ISSN: 2575-713XVydavateľské údaje: ACM 01.06.2019Vydané v 2019 ACM/IEEE 46th Annual International Symposium on Computer Architecture (ISCA) (01.06.2019)“…Memory-augmented neural networks are getting more attention from many researchers as they can make an inference with the previous history stored in memory…”
Získať plný text
Konferenčný príspevok.. -
16
Efficient Hardware Acceleration of CNNs using Logarithmic Data Representation with Arbitrary log-base
ISSN: 1558-2434Vydavateľské údaje: ACM 01.11.2018Vydané v 2018 IEEE/ACM International Conference on Computer-Aided Design (ICCAD) (01.11.2018)“…Efficient acceleration of Deep Neural Networks is a manifold task. In order to save memory requirements and reduce energy consumption we propose the use of…”
Získať plný text
Konferenčný príspevok.. -
17
LLMCompass: Enabling Efficient Hardware Design for Large Language Model Inference
Vydavateľské údaje: IEEE 29.06.2024Vydané v 2024 ACM/IEEE 51st Annual International Symposium on Computer Architecture (ISCA) (29.06.2024)“…The past year has witnessed the increasing popularity of Large Language Models (LLMs). Their unprecedented scale and associated high hardware cost have impeded…”
Získať plný text
Konferenčný príspevok.. -
18
TGPA: Tile-Grained Pipeline Architecture for Low Latency CNN Inference
ISSN: 1558-2434Vydavateľské údaje: ACM 01.11.2018Vydané v 2018 IEEE/ACM International Conference on Computer-Aided Design (ICCAD) (01.11.2018)“…FPGAs are more and more widely used as reconfigurable hardware accelerators for applications leveraging convolutional neural networks (CNNs) in recent years…”
Získať plný text
Konferenčný príspevok.. -
19
Energy-Efficient Video Processing for Virtual Reality
ISSN: 2575-713XVydavateľské údaje: ACM 01.06.2019Vydané v 2019 ACM/IEEE 46th Annual International Symposium on Computer Architecture (ISCA) (01.06.2019)“…Virtual reality (VR) has huge potential to enable radically new applications, behind which spherical panoramic video processing is one of the backbone…”
Získať plný text
Konferenčný príspevok.. -
20
MECLA: Memory-Compute-Efficient LLM Accelerator with Scaling Sub-matrix Partition
Vydavateľské údaje: IEEE 29.06.2024Vydané v 2024 ACM/IEEE 51st Annual International Symposium on Computer Architecture (ISCA) (29.06.2024)“…Large language models (LLMs) have been showing surprising performance in processing language tasks, bringing a new prevalence to deploy LLM from cloud to edge…”
Získať plný text
Konferenčný príspevok..

