Invited Paper: Software/Hardware Co-design for LLM and Its Application for Design Verification

The widespread adoption of Large Language Models (LLMs) is impeded by their demanding compute and memory resources. The first task of this paper is to explore optimization strategies to expedite LLMs, including quantization, pruning, and operation-level optimizations. One unique direction is to opti...

Full description

Saved in:
Bibliographic Details
Published in:Proceedings of the ASP-DAC ... Asia and South Pacific Design Automation Conference pp. 435 - 441
Main Authors: Wan, Lily Jiaxin, Huang, Yingbing, Li, Yuhong, Ye, Hanchen, Wang, Jinghua, Zhang, Xiaofan, Chen, Deming
Format: Conference Proceeding
Language:English
Published: IEEE 22.01.2024
Subjects:
ISSN:2153-697X
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Be the first to leave a comment!
You must be logged in first