AToM: Adaptive Token Merging for Efficient Acceleration of Vision Transformer

Recently, Vision Transformers (ViTs) have set a new standard in computer vision (CV), showing unparalleled image processing performance. However, their substantial computational requirements hinder practical deployment, especially on resource-limited devices common in CV applications. Token merging...

Full description

Saved in:
Bibliographic Details
Published in:IEEE transactions on computers Vol. 74; no. 5; pp. 1620 - 1633
Main Authors: Shin, Jaekang, Kang, Myeonggu, Han, Yunki, Park, Junyoung, Kim, Lee-Sup
Format: Journal Article
Language:English
Published: IEEE 01.05.2025
Subjects:
ISSN:0018-9340, 1557-9956
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Be the first to leave a comment!
You must be logged in first