Rethinking Vision Transformer and Masked Autoencoder in Multimodal Face Anti-Spoofing

Recently, vision transformer (ViT) based multimodal learning methods have been proposed to improve the robustness of face anti-spoofing (FAS) systems. However, there are still no works to explore the fundamental natures (e.g., modality-aware inputs, suitable multimodal pre-training, and efficient fi...

Celý popis

Uloženo v:

Podrobná bibliografie
Vydáno v:	International journal of computer vision Ročník 132; číslo 11; s. 5217 - 5238
Hlavní autoři:	Yu, Zitong, Cai, Rizhao, Cui, Yawen, Liu, Xin, Hu, Yongjian, Kot, Alex C.
Médium:	Journal Article
Jazyk:	angličtina
Vydáno:	New York Springer US 01.11.2024 Springer Springer Nature B.V
Témata:	Artificial Intelligence Classification Computer Imaging Computer Science Computer vision Computers Electric transformers Freezing Image Processing and Computer Vision Investigations Machine learning Pattern Recognition Pattern Recognition and Graphics Special Issue on Biometrics Security and Privacy Spoofing Vision Adaptive multimodal adapter Multimodal Masked autoencoder Face anti-spoofing
ISSN:	0920-5691, 1573-1405
On-line přístup:	Získat plný text
Tagy:	Přidat tag Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!

Buďte první, kdo okomentuje tento záznam!