Invited Paper: Software/Hardware Co-design for LLM and Its Application for Design Verification

The widespread adoption of Large Language Models (LLMs) is impeded by their demanding compute and memory resources. The first task of this paper is to explore optimization strategies to expedite LLMs, including quantization, pruning, and operation-level optimizations. One unique direction is to opti...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Proceedings of the ASP-DAC ... Asia and South Pacific Design Automation Conference S. 435 - 441
Hauptverfasser: Wan, Lily Jiaxin, Huang, Yingbing, Li, Yuhong, Ye, Hanchen, Wang, Jinghua, Zhang, Xiaofan, Chen, Deming
Format: Tagungsbericht
Sprache:Englisch
Veröffentlicht: IEEE 22.01.2024
Schlagworte:
ISSN:2153-697X
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!