AToM: Adaptive Token Merging for Efficient Acceleration of Vision Transformer

Recently, Vision Transformers (ViTs) have set a new standard in computer vision (CV), showing unparalleled image processing performance. However, their substantial computational requirements hinder practical deployment, especially on resource-limited devices common in CV applications. Token merging...

Full description

Saved in:

Bibliographic Details
Published in:	IEEE transactions on computers Vol. 74; no. 5; pp. 1620 - 1633
Main Authors:	Shin, Jaekang, Kang, Myeonggu, Han, Yunki, Park, Junyoung, Kim, Lee-Sup
Format:	Journal Article
Language:	English
Published:	IEEE 01.05.2025
Subjects:	algorithm-architecture co-design Computational efficiency Computational modeling Computer architecture Computer vision Computers DNN accelerator Graphics processing units Hardware Heuristic algorithms Merging token merge transformer-based computer vision Transformers
ISSN:	0018-9340, 1557-9956
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

Be the first to leave a comment!