Search Results - Proceedings of the IEEE/CVF Conference on Computer Vision AND Pattern Recognition

Refine Results
  1. 1

    Subspace-PnP: A Geometric Constraint Loss for Mutual Assistance of Depth and Optical Flow Estimation by Chi, Cheng, Hao, Tianyu, Wang, Qingjie, Guo, Peng, Yang, Xin

    ISSN: 0920-5691, 1573-1405
    Published: New York Springer US 01.12.2022
    Published in International journal of computer vision (01.12.2022)
    “…, in: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 4654–4665, 2020; Ranjan et al., in: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 12240…”
    Get full text
    Journal Article
  2. 2

    Collecting Cross-Modal Presence-Absence Evidence for Weakly-Supervised Audio- Visual Event Perception by Gao, Junyu, Chen, Mengyuan, Xu, Changsheng

    ISSN: 1063-6919
    Published: IEEE 01.06.2023
    “…With only video-level event labels, this paper targets at the task of weakly-supervised audio-visual event perception (WS-AVEP), which aims to temporally…”
    Get full text
    Conference Proceeding
  3. 3

    Bottleneck Transformers for Visual Recognition by Srinivas, Aravind, Lin, Tsung-Yi, Parmar, Niki, Shlens, Jonathon, Abbeel, Pieter, Vaswani, Ashish

    ISSN: 1063-6919
    Published: IEEE 01.06.2021
    “… By just replacing the spatial convolutions with global self-attention in the final three bottleneck blocks of a ResNet and no other changes, our approach improves upon the baselines significantly on…”
    Get full text
    Conference Proceeding
  4. 4

    Masked-attention Mask Transformer for Universal Image Segmentation by Cheng, Bowen, Misra, Ishan, Schwing, Alexander G., Kirillov, Alexander, Girdhar, Rohit

    ISSN: 1063-6919
    Published: IEEE 01.06.2022
    “… Each choice of semantics defines a task. While only the semantics of each task differ, current research focuses on designing spe-cialized architectures for each task…”
    Get full text
    Conference Proceeding
  5. 5

    GLIGEN: Open-Set Grounded Text-to-Image Generation by Li, Yuheng, Liu, Haotian, Wu, Qingyang, Mu, Fangzhou, Yang, Jianwei, Gao, Jianfeng, Li, Chunyuan, Lee, Yong Jae

    ISSN: 1063-6919
    Published: IEEE 01.06.2023
    “… enabling them to also be conditioned on grounding inputs. To preserve the vast concept knowledge of the pre-trained model, we freeze all of its weights and inject the grounding information into new trainable layers via a gated mechanism…”
    Get full text
    Conference Proceeding
  6. 6

    Lips Don't Lie: A Generalisable and Robust Approach to Face Forgery Detection by Haliassos, Alexandros, Vougioukas, Konstantinos, Petridis, Stavros, Pantic, Maja

    ISSN: 1063-6919
    Published: IEEE 01.06.2021
    “…), thus learning rich internal representations related to natural mouth motion. A temporal network is subsequently finetuned on fixed mouth embeddings of real and…”
    Get full text
    Conference Proceeding
  7. 7

    Deformable ConvNets V2: More Deformable, Better Results by Zhu, Xizhou, Hu, Han, Lin, Stephen, Dai, Jifeng

    ISSN: 1063-6919
    Published: IEEE 01.06.2019
    “…The superior performance of Deformable Convolutional Networks arises from its ability to adapt to the geometric variations of objects…”
    Get full text
    Conference Proceeding
  8. 8

    Google Landmarks Dataset v2 - A Large-Scale Benchmark for Instance-Level Recognition and Retrieval by Weyand, Tobias, Araujo, Andre, Cao, Bingyi, Sim, Jack

    ISSN: 1063-6919
    Published: IEEE 01.06.2020
    “…), a new benchmark for large-scale, fine-grained instance recognition and image retrieval in the domain of human-made and natural landmarks…”
    Get full text
    Conference Proceeding
  9. 9

    Dense Contrastive Learning for Self-Supervised Visual Pre-Training by Wang, Xinlong, Zhang, Rufeng, Shen, Chunhua, Kong, Tao, Li, Lei

    ISSN: 1063-6919
    Published: IEEE 01.06.2021
    “… To fill this gap, we aim to design an effective, dense self-supervised learning method that directly works at the level of pixels (or local features…”
    Get full text
    Conference Proceeding
  10. 10

    Evading Defenses to Transferable Adversarial Examples by Translation-Invariant Attacks by Dong, Yinpeng, Pang, Tianyu, Su, Hang, Zhu, Jun

    ISSN: 1063-6919
    Published: IEEE 01.06.2019
    “… Due to the threat of adversarial attacks, many methods have been proposed to improve the robustness…”
    Get full text
    Conference Proceeding
  11. 11

    Deformable Siamese Attention Networks for Visual Object Tracking by Yu, Yuechen, Xiong, Yilei, Huang, Weilin, Scott, Matthew R.

    ISSN: 1063-6919
    Published: IEEE 01.06.2020
    “… However, the target template is not updated online, and the features of target template and search image are computed independently in a Siamese architecture…”
    Get full text
    Conference Proceeding
  12. 12

    MobileOne: An Improved One millisecond Mobile Backbone by Vasu, Pavan Kumar Anasosalu, Gabriel, James, Zhu, Jeff, Tuzel, Oncel, Ranjan, Anurag

    ISSN: 1063-6919
    Published: IEEE 01.06.2023
    “… However, these metrics may not correlate well with latency of the network when deployed on a mobile device…”
    Get full text
    Conference Proceeding
  13. 13

    Multi-class Token Transformer for Weakly Supervised Semantic Segmentation by Xu, Lian, Ouyang, Wanli, Bennamoun, Mohammed, Boussaid, Farid, Xu, Dan

    ISSN: 1063-6919
    Published: IEEE 01.06.2022
    “…). Inspired by the fact that the attended regions of the one-class token in the standard vision transformer can be leveraged to form a class-agnostic localization map, we investigate if the transformer…”
    Get full text
    Conference Proceeding
  14. 14

    TopFormer: Token Pyramid Transformer for Mobile Semantic Segmentation by Zhang, Wenqiang, Huang, Zilong, Luo, Guozhong, Chen, Tao, Wang, Xinggang, Liu, Wenyu, Yu, Gang, Shen, Chunhua

    ISSN: 1063-6919
    Published: IEEE 01.06.2022
    “…Although vision transformers (ViTs) have achieved great success in computer vision, the heavy computational cost hampers their applications to dense prediction tasks such as semantic segmentation on mobile devices…”
    Get full text
    Conference Proceeding
  15. 15

    Causality Inspired Representation Learning for Domain Generalization by Lv, Fangrui, Liang, Jian, Li, Shuang, Zang, Bin, Liu, Chi Harold, Wang, Ziteng, Liu, Di

    ISSN: 1063-6919
    Published: IEEE 01.06.2022
    “…Domain generalization (DG) is essentially an out-of-distribution problem, aiming to generalize the knowledge learned from multiple source domains to an unseen target domain…”
    Get full text
    Conference Proceeding
  16. 16

    Deep Snake for Real-Time Instance Segmentation by Peng, Sida, Jiang, Wen, Pi, Huaijin, Li, Xiuli, Bao, Hujun, Zhou, Xiaowei

    ISSN: 1063-6919
    Published: IEEE 01.06.2020
    “… For structured feature learning on the contour, we propose to use circular convolution in deep snake, which better exploits the cycle-graph structure of a contour compared against generic graph convolution…”
    Get full text
    Conference Proceeding
  17. 17

    Token Contrast for Weakly-Supervised Semantic Segmentation by Ru, Lixiang, Zheng, Heliang, Zhan, Yibing, Du, Bo

    ISSN: 1063-6919
    Published: IEEE 01.06.2023
    “…) to generate the pseudo labels. Limited by the local structure perception of CNN, CAM usually cannot identify the integral object regions…”
    Get full text
    Conference Proceeding
  18. 18

    Finding Task-Relevant Features for Few-Shot Learning by Category Traversal by Li, Hongyang, Eigen, David, Dodge, Samuel, Zeiler, Matthew, Wang, Xiaogang

    ISSN: 1063-6919
    Published: IEEE 01.06.2019
    “… Because of this, they are constrained to use a single set of features for all possible test-time tasks, which hinders the ability to distinguish the most relevant dimensions for the task at hand…”
    Get full text
    Conference Proceeding
  19. 19

    Progressively Generating Better Initial Guesses Towards Next Stages for High-Quality Human Motion Prediction by Ma, Tiezheng, Nie, Yongwei, Long, Chengjiang, Zhang, Qing, Li, Guiqing

    ISSN: 1063-6919
    Published: IEEE 01.06.2022
    “… Our method is based on the observation that a good "initial guess" of the future poses is very helpful in improving the forecasting accuracy…”
    Get full text
    Conference Proceeding
  20. 20

    Wavelet Integrated CNNs for Noise-Robust Image Classification by Li, Qiufu, Shen, Linlin, Guo, Sheng, Lai, Zhihui

    ISSN: 1063-6919
    Published: IEEE 01.06.2020
    “… The high-frequency components, containing most of the data noise, are dropped during inference to improve the noise-robustness of the WaveCNets…”
    Get full text
    Conference Proceeding