BoxeR: Box-Attention for 2D and 3D Transformers

In this paper, we propose a simple attention mechanism, we call Box-Attention. It enables spatial interaction between grid features, as sampled from boxes of interest, and improves the learning capability of transformers for several vision tasks. Specifically, we present BoxeR, short for Box Transfo...

Celý popis

Uloženo v:

Podrobná bibliografie
Vydáno v:	Proceedings (IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Online) s. 4763 - 4772
Hlavní autoři:	Nguyen, Duy-Kien, Ju, Jihong, Booij, Olaf, Oswald, Martin R., Snoek, Cees G. M.
Médium:	Konferenční příspěvek
Jazyk:	angličtina
Vydáno:	IEEE 01.06.2022
Témata:	categorization Codes Computer vision grouping and shape analysis Object detection Pattern recognition Recognition: detection retrieval; Deep learning architectures and techniques; Segmentation Task analysis Three-dimensional displays Transformers
ISSN:	1063-6919
On-line přístup:	Získat plný text
Tagy:	Přidat tag Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!

Buďte první, kdo okomentuje tento záznam!