Edge-LLM: A Collaborative Framework for Large Language Model Serving in Edge Computing

The rapid advancement and extensive implementation of Large Language Models (LLMs) are milestones in the realm of artificial intelligence. Although Parameter-Efficient Transfer Learning (PETL), a.k.a. Adapter, methods have reduced the barrier for fine-tuning and inference on LLMs, it becomes a chall...

Full description

Saved in:
Bibliographic Details
Published in:Proceedings (IEEE International Conference on Web Services. Online) pp. 799 - 809
Main Authors: Cai, Fenglong, Yuan, Dong, Yang, Zhe, Cui, Lizhen
Format: Conference Proceeding
Language:English
Published: IEEE 07.07.2024
Subjects:
ISSN:2836-3868
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Be the first to leave a comment!
You must be logged in first