Search Results - Proceedings of the IEEE/CVF Conference on Computer Vision AND Pattern Recognition*
-
1
Subspace-PnP: A Geometric Constraint Loss for Mutual Assistance of Depth and Optical Flow Estimation
ISSN: 0920-5691, 1573-1405Published: New York Springer US 01.12.2022Published in International journal of computer vision (01.12.2022)“…, in: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 4654–4665, 2020; Ranjan et al., in: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 12240…”
Get full text
Journal Article -
2
Collecting Cross-Modal Presence-Absence Evidence for Weakly-Supervised Audio- Visual Event Perception
ISSN: 1063-6919Published: IEEE 01.06.2023Published in Proceedings (IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Online) (01.06.2023)“…With only video-level event labels, this paper targets at the task of weakly-supervised audio-visual event perception (WS-AVEP), which aims to temporally…”
Get full text
Conference Proceeding -
3
Two-stream lightweight sign language transformer
ISSN: 0932-8092, 1432-1769Published: Berlin/Heidelberg Springer Berlin Heidelberg 01.09.2022Published in Machine vision and applications (01.09.2022)“…Despite the recent progress of continuous sign language translation-based video, a variety of deep learning models are difficult to apply to the real-time…”
Get full text
Journal Article -
4
MTPose: Human Pose Estimation with High-Resolution Multi-scale Transformers
ISSN: 1370-4621, 1573-773XPublished: New York Springer US 01.10.2022Published in Neural processing letters (01.10.2022)“…HRNet (High-Resolution Networks) as reported by Sun et al. (in: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (CVPR), 2019…”
Get full text
Journal Article -
5
Staged encoder training for cross-camera person re-identification
ISSN: 1863-1703, 1863-1711Published: London Springer London 01.04.2024Published in Signal, image and video processing (01.04.2024)“…As a cross-camera retrieval problem, person re-identification (ReID) suffers from image style variations caused by camera parameters, lighting and other reasons, which will seriously affect the model recognition accuracy…”
Get full text
Journal Article -
6
Multi-cue SORT: integrating weak cues with appearance and motion for multi-object tracking
ISSN: 1573-0484, 0920-8542, 1573-0484Published: New York Springer Nature B.V 18.08.2025Published in The Journal of supercomputing (18.08.2025)“…The objective of multi-object tracking (MOT) is to accurately detect and track all objects within a continuous sequence while maintaining unique identifiers…”
Get full text
Journal Article -
7
VizWiz Grand Challenge: Answering Visual Questions from Blind People
ISSN: 1063-6919Published: IEEE 01.06.2018Published in 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition (01.06.2018)“…The study of algorithms to automatically answer visual questions currently is motivated by visual question answering (VQA…”
Get full text
Conference Proceeding -
8
EV-Gait: Event-Based Robust Gait Recognition Using Dynamic Vision Sensors
ISSN: 1063-6919Published: IEEE 01.06.2019Published in Proceedings (IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Online) (01.06.2019)“…In this paper, we introduce a new type of sensing modality, the Dynamic Vision Sensors (Event Cameras…”
Get full text
Conference Proceeding -
9
AdaViT: Adaptive Vision Transformers for Efficient Image Recognition
ISSN: 1063-6919Published: IEEE 01.06.2022Published in Proceedings (IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Online) (01.06.2022)“…Built on top of self-attention mechanisms, vision transformers have demonstrated remarkable performance on a variety of tasks recently…”
Get full text
Conference Proceeding -
10
BEVFormer v2: Adapting Modern Image Backbones to Bird's-Eye-View Recognition via Perspective Supervision
ISSN: 1063-6919Published: IEEE 01.06.2023Published in Proceedings (IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Online) (01.06.2023)“…We present a novel bird's-eye-view (BEV) detector with perspective supervision, which converges faster and bet-suits modern image backbones…”
Get full text
Conference Proceeding -
11
MeMViT: Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition
ISSN: 1063-6919Published: IEEE 01.06.2022Published in Proceedings (IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Online) (01.06.2022)“…While today's video recognition systems parse snapshots or short clips accurately, they cannot connect the dots and reason across a longer range of time yet…”
Get full text
Conference Proceeding -
12
Exploring Self-Attention for Image Recognition
ISSN: 1063-6919Published: IEEE 01.01.2020Published in Proceedings (IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Online) (01.01.2020)“…Recent work has shown that self-attention can serve as a basic building block for image recognition models…”
Get full text
Conference Proceeding -
13
X3D: Expanding Architectures for Efficient Video Recognition
ISSN: 1063-6919Published: IEEE 01.01.2020Published in Proceedings (IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Online) (01.01.2020)“…This paper presents X3D, a family of efficient video networks that progressively expand a tiny 2D image classification architecture along multiple network…”
Get full text
Conference Proceeding -
14
Revisiting Skeleton-based Action Recognition
ISSN: 1063-6919Published: IEEE 01.06.2022Published in Proceedings (IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Online) (01.06.2022)“… Many skeleton-based action recognition methods adopt GCNs to extract features on top of human skeletons…”
Get full text
Conference Proceeding -
15
Skeleton-Based Action Recognition With Shift Graph Convolutional Network
ISSN: 1063-6919Published: IEEE 01.06.2020Published in Proceedings (IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Online) (01.06.2020)“…Action recognition with skeleton data is attracting more attention in computer vision…”
Get full text
Conference Proceeding -
16
From Recognition to Cognition: Visual Commonsense Reasoning
ISSN: 1063-6919Published: IEEE 01.06.2019Published in Proceedings (IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Online) (01.06.2019)“…Visual understanding goes well beyond object recognition. With one glance at an image, we can effortlessly imagine the world beyond the pixels…”
Get full text
Conference Proceeding -
17
MagFace: A Universal Representation for Face Recognition and Quality Assessment
ISSN: 1063-6919Published: IEEE 01.06.2021Published in Proceedings (IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Online) (01.06.2021)“…The performance of face recognition system degrades when the variability of the acquired faces increases…”
Get full text
Conference Proceeding -
18
AdaFace: Quality Adaptive Margin for Face Recognition
ISSN: 1063-6919Published: IEEE 01.06.2022Published in Proceedings (IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Online) (01.06.2022)“…Recognition in low quality face datasets is challenging because facial attributes are obscured and degraded…”
Get full text
Conference Proceeding -
19
Improving Calibration for Long-Tailed Recognition
ISSN: 1063-6919Published: IEEE 01.06.2021Published in Proceedings (IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Online) (01.06.2021)“…Deep neural networks may perform poorly when training datasets are heavily class-imbalanced. Recently, two-stage methods decouple representation learning and…”
Get full text
Conference Proceeding -
20
BBN: Bilateral-Branch Network With Cumulative Learning for Long-Tailed Visual Recognition
ISSN: 1063-6919Published: IEEE 01.06.2020Published in Proceedings (IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Online) (01.06.2020)“…Our work focuses on tackling the challenging but natural visual recognition task of long-tailed data distribution (i.e…”
Get full text
Conference Proceeding