Search Results - Proceedings of the IEEE/CVF Conference on Computer Vision AND Pattern Recognition
-
1
Subspace-PnP: A Geometric Constraint Loss for Mutual Assistance of Depth and Optical Flow Estimation
ISSN: 0920-5691, 1573-1405Published: New York Springer US 01.12.2022Published in International journal of computer vision (01.12.2022)“…, in: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 4654–4665, 2020; Ranjan et al., in: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 12240…”
Get full text
Journal Article -
2
Collecting Cross-Modal Presence-Absence Evidence for Weakly-Supervised Audio- Visual Event Perception
ISSN: 1063-6919Published: IEEE 01.06.2023Published in Proceedings (IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Online) (01.06.2023)“…With only video-level event labels, this paper targets at the task of weakly-supervised audio-visual event perception (WS-AVEP), which aims to temporally…”
Get full text
Conference Proceeding -
3
Bottleneck Transformers for Visual Recognition
ISSN: 1063-6919Published: IEEE 01.06.2021Published in Proceedings (IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Online) (01.06.2021)“… By just replacing the spatial convolutions with global self-attention in the final three bottleneck blocks of a ResNet and no other changes, our approach improves upon the baselines significantly on…”
Get full text
Conference Proceeding -
4
Masked-attention Mask Transformer for Universal Image Segmentation
ISSN: 1063-6919Published: IEEE 01.06.2022Published in Proceedings (IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Online) (01.06.2022)“… Each choice of semantics defines a task. While only the semantics of each task differ, current research focuses on designing spe-cialized architectures for each task…”
Get full text
Conference Proceeding -
5
GLIGEN: Open-Set Grounded Text-to-Image Generation
ISSN: 1063-6919Published: IEEE 01.06.2023Published in Proceedings (IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Online) (01.06.2023)“… enabling them to also be conditioned on grounding inputs. To preserve the vast concept knowledge of the pre-trained model, we freeze all of its weights and inject the grounding information into new trainable layers via a gated mechanism…”
Get full text
Conference Proceeding -
6
Lips Don't Lie: A Generalisable and Robust Approach to Face Forgery Detection
ISSN: 1063-6919Published: IEEE 01.06.2021Published in Proceedings (IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Online) (01.06.2021)“…), thus learning rich internal representations related to natural mouth motion. A temporal network is subsequently finetuned on fixed mouth embeddings of real and…”
Get full text
Conference Proceeding -
7
Deformable ConvNets V2: More Deformable, Better Results
ISSN: 1063-6919Published: IEEE 01.06.2019Published in Proceedings (IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Online) (01.06.2019)“…The superior performance of Deformable Convolutional Networks arises from its ability to adapt to the geometric variations of objects…”
Get full text
Conference Proceeding -
8
Google Landmarks Dataset v2 - A Large-Scale Benchmark for Instance-Level Recognition and Retrieval
ISSN: 1063-6919Published: IEEE 01.06.2020Published in Proceedings (IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Online) (01.06.2020)“…), a new benchmark for large-scale, fine-grained instance recognition and image retrieval in the domain of human-made and natural landmarks…”
Get full text
Conference Proceeding -
9
Dense Contrastive Learning for Self-Supervised Visual Pre-Training
ISSN: 1063-6919Published: IEEE 01.06.2021Published in Proceedings (IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Online) (01.06.2021)“… To fill this gap, we aim to design an effective, dense self-supervised learning method that directly works at the level of pixels (or local features…”
Get full text
Conference Proceeding -
10
Evading Defenses to Transferable Adversarial Examples by Translation-Invariant Attacks
ISSN: 1063-6919Published: IEEE 01.06.2019Published in Proceedings (IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Online) (01.06.2019)“… Due to the threat of adversarial attacks, many methods have been proposed to improve the robustness…”
Get full text
Conference Proceeding -
11
Deformable Siamese Attention Networks for Visual Object Tracking
ISSN: 1063-6919Published: IEEE 01.06.2020Published in Proceedings (IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Online) (01.06.2020)“… However, the target template is not updated online, and the features of target template and search image are computed independently in a Siamese architecture…”
Get full text
Conference Proceeding -
12
MobileOne: An Improved One millisecond Mobile Backbone
ISSN: 1063-6919Published: IEEE 01.06.2023Published in Proceedings (IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Online) (01.06.2023)“… However, these metrics may not correlate well with latency of the network when deployed on a mobile device…”
Get full text
Conference Proceeding -
13
Multi-class Token Transformer for Weakly Supervised Semantic Segmentation
ISSN: 1063-6919Published: IEEE 01.06.2022Published in Proceedings (IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Online) (01.06.2022)“…). Inspired by the fact that the attended regions of the one-class token in the standard vision transformer can be leveraged to form a class-agnostic localization map, we investigate if the transformer…”
Get full text
Conference Proceeding -
14
TopFormer: Token Pyramid Transformer for Mobile Semantic Segmentation
ISSN: 1063-6919Published: IEEE 01.06.2022Published in Proceedings (IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Online) (01.06.2022)“…Although vision transformers (ViTs) have achieved great success in computer vision, the heavy computational cost hampers their applications to dense prediction tasks such as semantic segmentation on mobile devices…”
Get full text
Conference Proceeding -
15
Causality Inspired Representation Learning for Domain Generalization
ISSN: 1063-6919Published: IEEE 01.06.2022Published in Proceedings (IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Online) (01.06.2022)“…Domain generalization (DG) is essentially an out-of-distribution problem, aiming to generalize the knowledge learned from multiple source domains to an unseen target domain…”
Get full text
Conference Proceeding -
16
Deep Snake for Real-Time Instance Segmentation
ISSN: 1063-6919Published: IEEE 01.06.2020Published in Proceedings (IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Online) (01.06.2020)“… For structured feature learning on the contour, we propose to use circular convolution in deep snake, which better exploits the cycle-graph structure of a contour compared against generic graph convolution…”
Get full text
Conference Proceeding -
17
Token Contrast for Weakly-Supervised Semantic Segmentation
ISSN: 1063-6919Published: IEEE 01.06.2023Published in Proceedings (IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Online) (01.06.2023)“…) to generate the pseudo labels. Limited by the local structure perception of CNN, CAM usually cannot identify the integral object regions…”
Get full text
Conference Proceeding -
18
Finding Task-Relevant Features for Few-Shot Learning by Category Traversal
ISSN: 1063-6919Published: IEEE 01.06.2019Published in Proceedings (IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Online) (01.06.2019)“… Because of this, they are constrained to use a single set of features for all possible test-time tasks, which hinders the ability to distinguish the most relevant dimensions for the task at hand…”
Get full text
Conference Proceeding -
19
Progressively Generating Better Initial Guesses Towards Next Stages for High-Quality Human Motion Prediction
ISSN: 1063-6919Published: IEEE 01.06.2022Published in Proceedings (IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Online) (01.06.2022)“… Our method is based on the observation that a good "initial guess" of the future poses is very helpful in improving the forecasting accuracy…”
Get full text
Conference Proceeding -
20
Wavelet Integrated CNNs for Noise-Robust Image Classification
ISSN: 1063-6919Published: IEEE 01.06.2020Published in Proceedings (IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Online) (01.06.2020)“… The high-frequency components, containing most of the data noise, are dropped during inference to improve the noise-robustness of the WaveCNets…”
Get full text
Conference Proceeding