Search Results - Hardware Integrated circuits Reconfigurable logic and FPGAs

1

Loading…

DSPlacer: DSP Placement for FPGA-based CNN Accelerator by Xie, Baohui, Zhu, Xinrui, Lu, Zhiyuan, Pu, Yuan, Wu, Tongkai, Zou, Xiaofeng, Yu, Bei, Chen, Tinghuan

Published: IEEE 22.06.2025

Published in 2025 62nd ACM/IEEE Design Automation Conference (DAC) (22.06.2025)
“…Deploying convolutional neural networks (CNNs) on hardware platforms like Field Programmable Gate Arrays (FPGAs…”

Get full text

Conference Proceeding

Save to List

Saved in:
2

Loading…

Caffeine: Towards uniformed representation and acceleration for deep convolutional neural networks by Chen Zhang, Zhenman Fang, Peipei Zhou, Peichen Pan, Jason Cong

ISSN: 1558-2434

Published: ACM 01.11.2016

Published in Digest of technical papers - IEEE/ACM International Conference on Computer-Aided Design (01.11.2016)
“… In this paper we design and implement Caffeine, a hardware/software co-designed library to efficiently accelerate the entire CNN on FPGAs…”

Get full text

Conference Proceeding

Save to List

Saved in:
3

Loading…

Gemmini: Enabling Systematic Deep-Learning Architecture Evaluation via Full-Stack Integration by Genc, Hasan, Kim, Seah, Amid, Alon, Haj-Ali, Ameer, Iyer, Vighnesh, Prakash, Pranav, Zhao, Jerry, Grubb, Daniel, Liew, Harrison, Mao, Howard, Ou, Albert, Schmidt, Colin, Steffl, Samuel, Wright, John, Stoica, Ion, Ragan-Kelley, Jonathan, Asanovic, Krste, Nikolic, Borivoje, Shao, Yakun Sophia

Published: IEEE 05.12.2021

Published in 2021 58th ACM/IEEE Design Automation Conference (DAC) (05.12.2021)
“…DNN accelerators are often developed and evaluated in isolation without considering the cross-stack, system-level effects in real-world environments. This…”

Get full text

Conference Proceeding

Save to List

Saved in:
4

Loading…

SGX-FPGA: Trusted Execution Environment for CPU-FPGA Heterogeneous Architecture by Xia, Ke, Luo, Yukui, Xu, Xiaolin, Wei, Sheng

Published: IEEE 05.12.2021

Published in 2021 58th ACM/IEEE Design Automation Conference (DAC) (05.12.2021)
“… To fill the gap, we present SGX-FPGA, a trusted hardware isolation path enabling the first FPGA TEE by bridging SGX enclaves and FPGAs in the heterogeneous CPU-FPGA architecture…”

Get full text

Conference Proceeding

Save to List

Saved in:
5

Loading…

FPGA-TrustZone: Security Extension of TrustZone to FPGA for SoC-FPGA Heterogeneous Architecture by Wang, Shupeng, Fan, Xindong, Xu, Xiao, Wang, Shuchen, Ju, Lei, Zhou, Zimeng

Published: IEEE 22.06.2025

Published in 2025 62nd ACM/IEEE Design Automation Conference (DAC) (22.06.2025)
“… Experiments on real SoC-FPGA hardware development boards show that FPGA-TrustZone provides high security with low performance overhead…”

Get full text

Conference Proceeding

Save to List

Saved in:
6

Loading…

BlockGNN: Towards Efficient GNN Acceleration Using Block-Circulant Weight Matrices by Zhou, Zhe, Shi, Bizhao, Zhang, Zhe, Guan, Yijin, Sun, Guangyu, Luo, Guojie

Published: IEEE 05.12.2021

Published in 2021 58th ACM/IEEE Design Automation Conference (DAC) (05.12.2021)
“… platforms.To tackle this challenge, we propose BlockGNN, a software-hardware co-design approach to realize efficient GNN acceleration…”

Get full text

Conference Proceeding

Save to List

Saved in:
7

Loading…

High-Performance FPGA-based Accelerator for Bayesian Neural Networks by Fan, Hongxiang, Ferianc, Martin, Rodrigues, Miguel, Zhou, Hongyu, Niu, Xinyu, Luk, Wayne

Published: IEEE 05.12.2021

Published in 2021 58th ACM/IEEE Design Automation Conference (DAC) (05.12.2021)
“… This work proposes a novel FPGA based hardware architecture to accelerate BNNs inferred through Monte Carlo Dropout…”

Get full text

Conference Proceeding

Save to List

Saved in:
8

Loading…

An Enhanced Data Packing Method for General Matrix Multiplication in Brakerski/Fan-Vercauteren Scheme by Meng, Xiangchen, Tan, Yan, Jiang, Zijun, Lyu, Yangdi

Published: IEEE 22.06.2025

Published in 2025 62nd ACM/IEEE Design Automation Conference (DAC) (22.06.2025)
“… Furthermore, we design specialized hardware…”

Get full text

Conference Proceeding

Save to List

Saved in:
9

Loading…

Configurable DSP-Based CAM Architecture for Data-Intensive Applications on FPGAs by Chen, Yao, Yu, Feng, Wu, Di, Wong, Weng-Fai, He, Bingsheng

Published: IEEE 22.06.2025

Published in 2025 62nd ACM/IEEE Design Automation Conference (DAC) (22.06.2025)
“… They have been used in many domains, such as networking, databases, and graph processing. Field-programmable gate arrays (FPGAs…”

Get full text

Conference Proceeding

Save to List

Saved in:
10

Loading…

XShift: FPGA-efficient Binarized LLM with Joint Quantization and Sparsification by Zhou, Shuai, Tian, Huinan, Meng, Sisi, Chen, Jianli, Yu, Jun, Wang, Kun

Published: IEEE 22.06.2025

Published in 2025 62nd ACM/IEEE Design Automation Conference (DAC) (22.06.2025)
“… a specialized inference framework. In response, we introduce XShift, an algorithm-hardware co-design framework optimized for efficient binarized LLM inference on FPGAs…”

Get full text

Conference Proceeding

Save to List

Saved in:
11

Loading…

WSQ-AdderNet: Efficient Weight Standardization based Quantized AdderNet FPGA Accelerator Design with High-Density INT8 DSP-LUT Co-Packing Optimization by Zhang, Yunxiang, Sun, Biao, Jiang, Weixiong, Ha, Yajun, Hu, Miao, Zhao, Wenfeng

ISSN: 1558-2434

Published: ACM 29.10.2022

Published in 2022 IEEE/ACM International Conference On Computer Aided Design (ICCAD) (29.10.2022)
“… Recent proposals on hardware-optimal neural network architectures suggest that AdderNet with a lightweight ℓ…”

Get full text

Conference Proceeding

Save to List

Saved in:
12

Loading…

DuoQ: A DSP Utilization-aware and Outlier-free Quantization for FPGA-based LLMs Acceleration by Yu, Zhuoquan, Ji, Huidong, Cao, Yue, Wu, Junfu, Yan, Xiaoze, Zheng, Lirong, Zou, Zhuo

Published: IEEE 22.06.2025

Published in 2025 62nd ACM/IEEE Design Automation Conference (DAC) (22.06.2025)
“… To address this problem, we introduce DuoQ, an FPGA-oriented algorithm-hardware co-design framework…”

Get full text

Conference Proceeding

Save to List

Saved in:
13

Loading…

April: Accuracy-Improved Floating-Point Approximation For Neural Network Accelerators by Chen, Yonghao, Zou, Jiaxiang, Chen, Xinyu

Published: IEEE 22.06.2025

Published in 2025 62nd ACM/IEEE Design Automation Conference (DAC) (22.06.2025)
“… Floatingpoint approximation, such as Mitchell's logarithm, enables floating-point multiplication using simpler integer additions, thereby improving hardware efficiency…”

Get full text

Conference Proceeding

Save to List

Saved in:
14

Loading…

An Efficient Bit-level Sparse MAC-accelerated Architecture with SW/HW Co-design on FPGA by Zhang, Chenming, Gong, Lei, Wang, Chao, Zhou, Xuehai

Published: IEEE 22.06.2025

Published in 2025 62nd ACM/IEEE Design Automation Conference (DAC) (22.06.2025)
“… The reconfigurable platform offers possibilities for identifying the bitlevel unstructured redundancy during inference with different DNN models…”

Get full text

Conference Proceeding

Save to List

Saved in:
15

Loading…

FLAG: An FPGA-Based System for Low-Latency GNN Inference Service Using Vector Quantization by Han, Yunki, Kim, Taehwan, Kim, Jiwan, Ha, Seohye, Kim, Lee-Sup

Published: IEEE 22.06.2025

Published in 2025 62nd ACM/IEEE Design Automation Conference (DAC) (22.06.2025)
“… In this paper, we propose FLAG, an FPGA-based GNN inference serving system using vector quantization…”

Get full text

Conference Proceeding

Save to List

Saved in:
16

Loading…

An Algorithm-Hardware Co-design Based on Revised Microscaling Format Quantization for Accelerating Large Language Models by Hao, Yingbo, Chen, Huangxu, Zou, Yi, Yang, Yanfeng

Published: IEEE 22.06.2025

Published in 2025 62nd ACM/IEEE Design Automation Conference (DAC) (22.06.2025)
“… However, deploying such a new format into existing hardware systems is still challenging, and the dominant solution for LLM inference at low precision is still low-bit quantization…”

Get full text

Conference Proceeding

Save to List

Saved in:
17

Loading…

Classifying Computations on Multi-Tenant FPGAs by Gobulukoglu, Mustafa, Drewes, Colin, Hunter, William, Kastner, Ryan, Richmond, Dustin

Published: IEEE 05.12.2021

Published in 2021 58th ACM/IEEE Design Automation Conference (DAC) (05.12.2021)
“…Modern data centers leverage large FPGAs to provide low latency, high throughput, and low energy computation…”

Get full text

Conference Proceeding

Save to List

Saved in:
18

Loading…

KLiNQ: Knowledge Distillation-Assisted Lightweight Neural Network for Qubit Readout on FPGA by Guo, Xiaorang, Bunarjyan, Tigran, Liu, Dai, Lienhard, Benjamin, Schulz, Martin

Published: IEEE 22.06.2025

Published in 2025 62nd ACM/IEEE Design Automation Conference (DAC) (22.06.2025)
“… While current methods, including deep neural networks, enhance readout accuracy, they typically lack support for mid-circuit measurements essential for quantum error correction, and they usually rely…”

Get full text

Conference Proceeding

Save to List

Saved in:
19

Loading…

CoSPARSE: A Software and Hardware Reconfigurable SpMV Framework for Graph Analytics by Feng, Siying, Sun, Jiawen, Pal, Subhankar, He, Xin, Kaszyk, Kuba, Park, Dong-hyeon, Morton, Magnus, Mudge, Trevor, Cole, Murray, O'Boyle, Michael, Chakrabarti, Chaitali, Dreslinski, Ronald

Published: IEEE 05.12.2021

Published in 2021 58th ACM/IEEE Design Automation Conference (DAC) (05.12.2021)
“… reconfiguration as a synergistic solution to accelerate SpMV-based graph analytics algorithms. Building on previously proposed general-purpose reconfigurable hardware…”

Get full text

Conference Proceeding

Save to List

Saved in:
20

Loading…

ANT: Exploiting Adaptive Numerical Data Type for Low-bit Deep Neural Network Quantization by Guo, Cong, Zhang, Chen, Leng, Jingwen, Liu, Zihan, Yang, Fan, Liu, Yunxin, Guo, Minyi, Zhu, Yuhao

Published: IEEE 01.10.2022

Published in 2022 55th IEEE/ACM International Symposium on Microarchitecture (MICRO) (01.10.2022)
“… Even though this line of work brings algorithmic benefits, it also introduces significant hardware overheads due to variable-length encoding…”

Get full text

Conference Proceeding

Save to List

Saved in:

Search Results - Hardware Integrated circuits Reconfigurable logic and FPGAs

DSPlacer: DSP Placement for FPGA-based CNN Accelerator by Xie, Baohui, Zhu, Xinrui, Lu, Zhiyuan, Pu, Yuan, Wu, Tongkai, Zou, Xiaofeng, Yu, Bei, Chen, Tinghuan

Caffeine: Towards uniformed representation and acceleration for deep convolutional neural networks by Chen Zhang, Zhenman Fang, Peipei Zhou, Peichen Pan, Jason Cong

SGX-FPGA: Trusted Execution Environment for CPU-FPGA Heterogeneous Architecture by Xia, Ke, Luo, Yukui, Xu, Xiaolin, Wei, Sheng

FPGA-TrustZone: Security Extension of TrustZone to FPGA for SoC-FPGA Heterogeneous Architecture by Wang, Shupeng, Fan, Xindong, Xu, Xiao, Wang, Shuchen, Ju, Lei, Zhou, Zimeng

BlockGNN: Towards Efficient GNN Acceleration Using Block-Circulant Weight Matrices by Zhou, Zhe, Shi, Bizhao, Zhang, Zhe, Guan, Yijin, Sun, Guangyu, Luo, Guojie

High-Performance FPGA-based Accelerator for Bayesian Neural Networks by Fan, Hongxiang, Ferianc, Martin, Rodrigues, Miguel, Zhou, Hongyu, Niu, Xinyu, Luk, Wayne

An Enhanced Data Packing Method for General Matrix Multiplication in Brakerski/Fan-Vercauteren Scheme by Meng, Xiangchen, Tan, Yan, Jiang, Zijun, Lyu, Yangdi

Configurable DSP-Based CAM Architecture for Data-Intensive Applications on FPGAs by Chen, Yao, Yu, Feng, Wu, Di, Wong, Weng-Fai, He, Bingsheng

XShift: FPGA-efficient Binarized LLM with Joint Quantization and Sparsification by Zhou, Shuai, Tian, Huinan, Meng, Sisi, Chen, Jianli, Yu, Jun, Wang, Kun

WSQ-AdderNet: Efficient Weight Standardization based Quantized AdderNet FPGA Accelerator Design with High-Density INT8 DSP-LUT Co-Packing Optimization by Zhang, Yunxiang, Sun, Biao, Jiang, Weixiong, Ha, Yajun, Hu, Miao, Zhao, Wenfeng

DuoQ: A DSP Utilization-aware and Outlier-free Quantization for FPGA-based LLMs Acceleration by Yu, Zhuoquan, Ji, Huidong, Cao, Yue, Wu, Junfu, Yan, Xiaoze, Zheng, Lirong, Zou, Zhuo

April: Accuracy-Improved Floating-Point Approximation For Neural Network Accelerators by Chen, Yonghao, Zou, Jiaxiang, Chen, Xinyu

An Efficient Bit-level Sparse MAC-accelerated Architecture with SW/HW Co-design on FPGA by Zhang, Chenming, Gong, Lei, Wang, Chao, Zhou, Xuehai

FLAG: An FPGA-Based System for Low-Latency GNN Inference Service Using Vector Quantization by Han, Yunki, Kim, Taehwan, Kim, Jiwan, Ha, Seohye, Kim, Lee-Sup

An Algorithm-Hardware Co-design Based on Revised Microscaling Format Quantization for Accelerating Large Language Models by Hao, Yingbo, Chen, Huangxu, Zou, Yi, Yang, Yanfeng

Classifying Computations on Multi-Tenant FPGAs by Gobulukoglu, Mustafa, Drewes, Colin, Hunter, William, Kastner, Ryan, Richmond, Dustin

KLiNQ: Knowledge Distillation-Assisted Lightweight Neural Network for Qubit Readout on FPGA by Guo, Xiaorang, Bunarjyan, Tigran, Liu, Dai, Lienhard, Benjamin, Schulz, Martin

CoSPARSE: A Software and Hardware Reconfigurable SpMV Framework for Graph Analytics by Feng, Siying, Sun, Jiawen, Pal, Subhankar, He, Xin, Kaszyk, Kuba, Park, Dong-hyeon, Morton, Magnus, Mudge, Trevor, Cole, Murray, O'Boyle, Michael, Chakrabarti, Chaitali, Dreslinski, Ronald

ANT: Exploiting Adaptive Numerical Data Type for Low-bit Deep Neural Network Quantization by Guo, Cong, Zhang, Chen, Leng, Jingwen, Liu, Zihan, Yang, Fan, Liu, Yunxin, Guo, Minyi, Zhu, Yuhao

Search Tools:

Refine Results

Format

Subject Area

Topic

Language

Year of Publication