Rep ViT: Revisiting Mobile CNN From ViT Perspective

Recently, lightweight Vision Transformers (ViTs) demon-strate superior performance and lower latency, compared with lightweight Convolutional Neural Networks (CNNs), on resource-constrained mobile devices. Researchers have discovered many structural connections be-tween lightweight ViTs and lightwei...

Full description

Saved in:

Bibliographic Details
Published in:	Proceedings (IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Online) pp. 15909 - 15920
Main Authors:	Wang, Ao, Chen, Hui, Lin, Zijia, Han, Jungong, Ding, Guiguang
Format:	Conference Proceeding
Language:	English
Published:	IEEE 16.06.2024
Subjects:	Accuracy CNN Codes Computational modeling Computer vision Mobile handsets Performance evaluation Transformers ViT
ISSN:	1063-6919
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

Be the first to leave a comment!