Výsledky vyhledávání - Computation/Dataflow Optimization
-
1
MnnFast: A Fast and Scalable System Architecture for Memory-Augmented Neural Networks
ISSN: 2575-713XVydáno: ACM 01.06.2019Vydáno v 2019 ACM/IEEE 46th Annual International Symposium on Computer Architecture (ISCA) (01.06.2019)“…Memory-augmented neural networks are getting more attention from many researchers as they can make an inference with the previous history stored in memory…”
Získat plný text
Konferenční příspěvek -
2
NLP-Fast: A Fast, Scalable, and Flexible System to Accelerate Large-Scale Heterogeneous NLP Models
Vydáno: IEEE 01.09.2021Vydáno v 2021 30th International Conference on Parallel Architectures and Compilation Techniques (PACT) (01.09.2021)“…: three end-to-end optimization techniques to accelerate…”
Získat plný text
Konferenční příspěvek -
3
Reconfigurable Dataflow Optimization for Spatiotemporal Spiking Neural Computation on Systolic Array Accelerators
ISSN: 2576-6996Vydáno: IEEE 01.10.2020Vydáno v Proceedings - IEEE International Conference on Computer Design (01.10.2020)“… Recognizing the need for efficient processing of complex spatiotemporal data while considering the all-or-none nature of spiking activities, we propose holistic reconfigurable dataflow optimization…”
Získat plný text
Konferenční příspěvek -
4
VQ-LLM: High-performance Code Generation for Vector Quantization Augmented LLM Inference
ISSN: 2378-203XVydáno: IEEE 01.03.2025Vydáno v Proceedings - International Symposium on High-Performance Computer Architecture (01.03.2025)“… and uncoordinated computation dataflow. Meanwhile, the diversity of VQ algorithms (e.g., different vector sizes and entry counts…”
Získat plný text
Konferenční příspěvek -
5
Techniques for Efficient Performance Analysis and Memory Optimization in Mapping Dataflow Models of Computation Onto Embedded Systems
ISBN: 9798346386964Vydáno: ProQuest Dissertations & Theses 01.01.2024“…The power of modern multi-core and many-core platforms is an excellent fit for meeting the performance needs of embedded software applications. However, there…”
Získat plný text
Dissertation -
6
A high-performance dataflow-centric optimization framework for deep learning inference on the edge
ISSN: 1383-7621, 1873-6165Vydáno: Elsevier B.V 01.07.2024Vydáno v Journal of systems architecture (01.07.2024)“… Targeting the existing drawbacks of operator-centric frameworks, we design Xenos, which can automatically conduct dataflow-centric optimization of the computation graph and accelerate inference in two dimensions…”
Získat plný text
Journal Article -
7
SWG: an architecture for sparse weight gradient computation
ISSN: 1674-733X, 1869-1919Vydáno: Beijing Science China Press 01.02.2024Vydáno v Science China. Information sciences (01.02.2024)“… Nevertheless, exploiting the optimization opportunities would meet three underutilization problems, which are caused by (1…”
Získat plný text
Journal Article -
8
AttentionLib: A Scalable Optimization Framework for Automated Attention Acceleration on FPGA
ISSN: 1558-1101Vydáno: EDAA 31.03.2025Vydáno v Proceedings - Design, Automation, and Test in Europe Conference and Exhibition (31.03.2025)“… AttentionLib automatically performs fusion dataflow optimization for attention computations and generates high-level synthesis code in compliance…”
Získat plný text
Konferenční příspěvek -
9
Efficient N:M Sparse DNN Training Using Algorithm, Architecture, and Dataflow Co-Design
ISSN: 0278-0070, 1937-4151Vydáno: New York IEEE 01.02.2024Vydáno v IEEE transactions on computer-aided design of integrated circuits and systems (01.02.2024)“…Sparse training is one of the promising techniques to reduce the computational cost of DNNs while retaining high accuracy. In particular, N:M fine-grained…”
Získat plný text
Journal Article -
10
Unleashing Network/Accelerator Co-Exploration Potential on FPGAs: A Deeper Joint Search
ISSN: 0278-0070, 1937-4151Vydáno: New York IEEE 01.10.2024Vydáno v IEEE transactions on computer-aided design of integrated circuits and systems (01.10.2024)“…Recently, algorithm-hardware (HW) co-exploration for neural networks (NNs) has become the key to obtaining high-quality solutions. However, previous efforts…”
Získat plný text
Journal Article -
11
Algorithm/Hardware Co-optimization for Sparsity-Aware SpMM Acceleration of GNNs
ISSN: 0278-0070, 1937-4151Vydáno: New York IEEE 01.12.2023Vydáno v IEEE transactions on computer-aided design of integrated circuits and systems (01.12.2023)“… So in this paper, we demonstrate an algorithm/hardware co-optimization chance to enhance SpMM acceleration for GNNs…”
Získat plný text
Journal Article -
12
SCnC: Efficient Unification of Streaming with Dynamic Task Parallelism
ISSN: 0885-7458, 1573-7640Vydáno: New York Springer US 01.04.2016Vydáno v International journal of parallel programming (01.04.2016)“… This work shows that it is possible to exploit streaming as a safe and automatic optimization of a more general dataflow-based model…”
Získat plný text
Journal Article -
13
DiMO-Sparse: Differentiable Modeling and Optimization of Sparse CNN Dataflow and Hardware Architecture
ISSN: 1558-1101Vydáno: EDAA 25.03.2024Vydáno v Proceedings - Design, Automation, and Test in Europe Conference and Exhibition (25.03.2024)“… To the best of our knowledge, this paper presents the first systematic investigation of automatic dataflow and hardware optimization for sparse CNN computation…”
Získat plný text
Konferenční příspěvek -
14
Mechanisms Towards Energy-Efficient Dynamic Hardware Specialization
ISBN: 9781321384222, 132138422XVydáno: ProQuest Dissertations & Theses 01.01.2014“…In the past few decades, Von Neumann superscalar processors have been the prevalent approach for general purpose processing. Hardware specialization, as a…”
Získat plný text
Dissertation -
15
An FPGA-based efficient accelerator for fault interaction of rupture dynamics
ISSN: 1573-0484, 0920-8542, 1573-0484Vydáno: New York Springer Nature B.V 10.09.2025Vydáno v The Journal of supercomputing (10.09.2025)“…Efficiently predicting aftershocks based on rupture dynamics simulation is a crucial task in high-performance computing, traditionally dependent on…”
Získat plný text
Journal Article -
16
SpikeFlow: A hardware–software co-designed systolic array for spiking neural networks
ISSN: 1383-7621Vydáno: Elsevier B.V 01.12.2025Vydáno v Journal of systems architecture (01.12.2025)“…Spiking neural networks (SNNs), often referred to as third-generation neural networks, offer substantial advantages in efficiency and power consumption, which…”
Získat plný text
Journal Article -
17
CHAUS: Scalable VM-Based Channels for Unbounded Streaming
ISSN: 1000-9000, 1860-4749Vydáno: New York Springer US 01.11.2017Vydáno v Journal of computer science and technology (01.11.2017)“…Stream processing is a special form of the dataflow execution model that offers extensive opportunities for optimization and automatic parallelism…”
Získat plný text
Journal Article -
18
SPADE: Sparse Pillar-based 3D Object Detection Accelerator for Autonomous Driving
ISSN: 2378-203XVydáno: IEEE 02.03.2024Vydáno v Proceedings - International Symposium on High-Performance Computer Architecture (02.03.2024)“…3D object detection using point cloud (PC) data is essential for perception pipelines of autonomous driving, where efficient encoding is key to meeting…”
Získat plný text
Konferenční příspěvek -
19
Dataflow computing models, languages, and machines for intelligence computations
ISSN: 0098-5589, 1939-3520Vydáno: New York, NY IEEE 01.12.1988Vydáno v IEEE Transactions on Software Engineering (01.12.1988)“…The authors compare dataflow computing models, languages, and dataflow computing machines for numerical and nonnumerical computations. The…”
Získat plný text
Journal Article -
20
PolyJuice: Detecting Mis-compilation Bugs in Tensor Compilers with Equality Saturation Based Rewriting
ISSN: 2475-1421, 2475-1421Vydáno: New York, NY, USA ACM 08.10.2024Vydáno v Proceedings of ACM on programming languages (08.10.2024)“… The main challenge is to construct equivalent graphs capable of efficiently exploring the diverse optimization logic during compilation…”
Získat plný text
Journal Article

