Invited Paper: Software/Hardware Co-design for LLM and Its Application for Design Verification
The widespread adoption of Large Language Models (LLMs) is impeded by their demanding compute and memory resources. The first task of this paper is to explore optimization strategies to expedite LLMs, including quantization, pruning, and operation-level optimizations. One unique direction is to opti...
Saved in:
| Published in: | Proceedings of the ASP-DAC ... Asia and South Pacific Design Automation Conference pp. 435 - 441 |
|---|---|
| Main Authors: | , , , , , , |
| Format: | Conference Proceeding |
| Language: | English |
| Published: |
IEEE
22.01.2024
|
| Subjects: | |
| ISSN: | 2153-697X |
| Online Access: | Get full text |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Be the first to leave a comment!