Edge-LLM: A Collaborative Framework for Large Language Model Serving in Edge Computing
The rapid advancement and extensive implementation of Large Language Models (LLMs) are milestones in the realm of artificial intelligence. Although Parameter-Efficient Transfer Learning (PETL), a.k.a. Adapter, methods have reduced the barrier for fine-tuning and inference on LLMs, it becomes a chall...
Saved in:
| Published in: | Proceedings (IEEE International Conference on Web Services. Online) pp. 799 - 809 |
|---|---|
| Main Authors: | , , , |
| Format: | Conference Proceeding |
| Language: | English |
| Published: |
IEEE
07.07.2024
|
| Subjects: | |
| ISSN: | 2836-3868 |
| Online Access: | Get full text |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Be the first to leave a comment!