Rethinking Vision Transformer and Masked Autoencoder in Multimodal Face Anti-Spoofing

Recently, vision transformer (ViT) based multimodal learning methods have been proposed to improve the robustness of face anti-spoofing (FAS) systems. However, there are still no works to explore the fundamental natures (e.g., modality-aware inputs, suitable multimodal pre-training, and efficient fi...

Celý popis

Uložené v:

Podrobná bibliografia
Vydané v:	International journal of computer vision Ročník 132; číslo 11; s. 5217 - 5238
Hlavní autori:	Yu, Zitong, Cai, Rizhao, Cui, Yawen, Liu, Xin, Hu, Yongjian, Kot, Alex C.
Médium:	Journal Article
Jazyk:	English
Vydavateľské údaje:	New York Springer US 01.11.2024 Springer Springer Nature B.V
Predmet:	Artificial Intelligence Classification Computer Imaging Computer Science Computer vision Computers Electric transformers Freezing Image Processing and Computer Vision Investigations Machine learning Pattern Recognition Pattern Recognition and Graphics Special Issue on Biometrics Security and Privacy Spoofing Vision Adaptive multimodal adapter Multimodal Masked autoencoder Face anti-spoofing
ISSN:	0920-5691, 1573-1405
On-line prístup:	Získať plný text
Tagy:	Pridať tag Žiadne tagy, Buďte prvý, kto otaguje tento záznam!

Buďte prvý, kto okomentuje tento záznam!