EAPT: Efficient Attention Pyramid Transformer for Image Processing

Recent transformer-based models, especially patch-based methods, have shown huge potentiality in vision tasks. However, the split fixed-size patches divide the input features into the same size patches, which ignores the fact that vision elements are often various and thus may destroy the semantic i...

Celý popis

Uloženo v:

Podrobná bibliografie
Vydáno v:	IEEE transactions on multimedia Ročník 25; s. 50 - 61
Hlavní autoři:	Lin, Xiao, Sun, Shuzhou, Huang, Wei, Sheng, Bin, Li, Ping, Feng, David Dagan
Médium:	Journal Article
Jazyk:	angličtina
Vydáno:	Piscataway IEEE 2023 The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Témata:	Ablation attention mechanism classification Communication Convolutional neural networks Costs Encoding Feature extraction Formability Image classification Image processing Image segmentation Modules object detection Object recognition Patches (structures) pyramid Semantic segmentation Semantics Task analysis Transformer Transformers
ISSN:	1520-9210, 1941-0077
On-line přístup:	Získat plný text
Tagy:	Přidat tag Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!

Buďte první, kdo okomentuje tento záznam!