Search Results - Proceedings of the IEEE/CVF Conference on Computer Vision AND Pattern Recognition*

1

Loading…

Subspace-PnP: A Geometric Constraint Loss for Mutual Assistance of Depth and Optical Flow Estimation by Chi, Cheng, Hao, Tianyu, Wang, Qingjie, Guo, Peng, Yang, Xin

ISSN: 0920-5691, 1573-1405

Published: New York Springer US 01.12.2022

Published in International journal of computer vision (01.12.2022)
“…, in: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 4654–4665, 2020; Ranjan et al., in: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 12240…”

Get full text

Journal Article

Save to List

Saved in:
2

Loading…

Collecting Cross-Modal Presence-Absence Evidence for Weakly-Supervised Audio- Visual Event Perception by Gao, Junyu, Chen, Mengyuan, Xu, Changsheng

ISSN: 1063-6919

Published: IEEE 01.06.2023

Published in Proceedings (IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Online) (01.06.2023)
“…With only video-level event labels, this paper targets at the task of weakly-supervised audio-visual event perception (WS-AVEP), which aims to temporally…”

Get full text

Conference Proceeding

Save to List

Saved in:
3

Loading…

Two-stream lightweight sign language transformer by Chen, Yuming, Mei, Xue, Qin, Xuan

ISSN: 0932-8092, 1432-1769

Published: Berlin/Heidelberg Springer Berlin Heidelberg 01.09.2022

Published in Machine vision and applications (01.09.2022)
“…Despite the recent progress of continuous sign language translation-based video, a variety of deep learning models are difficult to apply to the real-time…”

Get full text

Journal Article

Save to List

Saved in:
4

Loading…

MTPose: Human Pose Estimation with High-Resolution Multi-scale Transformers by Wang, Rui, Geng, Fudi, Wang, Xiangyang

ISSN: 1370-4621, 1573-773X

Published: New York Springer US 01.10.2022

Published in Neural processing letters (01.10.2022)
“…HRNet (High-Resolution Networks) as reported by Sun et al. (in: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (CVPR), 2019…”

Get full text

Journal Article

Save to List

Saved in:
5

Loading…

Staged encoder training for cross-camera person re-identification by Xu, Zhi, Yang, Jiawei, Liu, Yuxuan, Zhao, Longyang, Liu, Jiajia

ISSN: 1863-1703, 1863-1711

Published: London Springer London 01.04.2024

Published in Signal, image and video processing (01.04.2024)
“…As a cross-camera retrieval problem, person re-identification (ReID) suffers from image style variations caused by camera parameters, lighting and other reasons, which will seriously affect the model recognition accuracy…”

Get full text

Journal Article

Save to List

Saved in:
6

Loading…

Multi-cue SORT: integrating weak cues with appearance and motion for multi-object tracking by Liang, Hong, Xu, Mingchen, Zhang, Qian, Shao, Mingwen

ISSN: 1573-0484, 0920-8542, 1573-0484

Published: New York Springer Nature B.V 18.08.2025

Published in The Journal of supercomputing (18.08.2025)
“…The objective of multi-object tracking (MOT) is to accurately detect and track all objects within a continuous sequence while maintaining unique identifiers…”

Get full text

Journal Article

Save to List

Saved in:
7

Loading…

VizWiz Grand Challenge: Answering Visual Questions from Blind People by Gurari, Danna, Li, Qing, Stangl, Abigale J., Guo, Anhong, Lin, Chi, Grauman, Kristen, Luo, Jiebo, Bigham, Jeffrey P.

ISSN: 1063-6919

Published: IEEE 01.06.2018

Published in 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition (01.06.2018)
“…The study of algorithms to automatically answer visual questions currently is motivated by visual question answering (VQA…”

Get full text

Conference Proceeding

Save to List

Saved in:
8

Loading…

EV-Gait: Event-Based Robust Gait Recognition Using Dynamic Vision Sensors by Wang, Yanxiang, Du, Bowen, Shen, Yiran, Wu, Kai, Zhao, Guangrong, Sun, Jianguo, Wen, Hongkai

ISSN: 1063-6919

Published: IEEE 01.06.2019

Published in Proceedings (IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Online) (01.06.2019)
“…In this paper, we introduce a new type of sensing modality, the Dynamic Vision Sensors (Event Cameras…”

Get full text

Conference Proceeding

Save to List

Saved in:
9

Loading…

AdaViT: Adaptive Vision Transformers for Efficient Image Recognition by Meng, Lingchen, Li, Hengduo, Chen, Bor-Chun, Lan, Shiyi, Wu, Zuxuan, Jiang, Yu-Gang, Lim, Ser-Nam

ISSN: 1063-6919

Published: IEEE 01.06.2022

Published in Proceedings (IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Online) (01.06.2022)
“…Built on top of self-attention mechanisms, vision transformers have demonstrated remarkable performance on a variety of tasks recently…”

Get full text

Conference Proceeding

Save to List

Saved in:
10

Loading…

BEVFormer v2: Adapting Modern Image Backbones to Bird's-Eye-View Recognition via Perspective Supervision by Yang, Chenyu, Chen, Yuntao, Tian, Hao, Tao, Chenxin, Zhu, Xizhou, Zhang, Zhaoxiang, Huang, Gao, Li, Hongyang, Qiao, Yu, Lu, Lewei, Zhou, Jie, Dai, Jifeng

ISSN: 1063-6919

Published: IEEE 01.06.2023

Published in Proceedings (IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Online) (01.06.2023)
“…We present a novel bird's-eye-view (BEV) detector with perspective supervision, which converges faster and bet-suits modern image backbones…”

Get full text

Conference Proceeding

Save to List

Saved in:
11

Loading…

MeMViT: Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition by Wu, Chao-Yuan, Li, Yanghao, Mangalam, Karttikeya, Fan, Haoqi, Xiong, Bo, Malik, Jitendra, Feichtenhofer, Christoph

ISSN: 1063-6919

Published: IEEE 01.06.2022

Published in Proceedings (IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Online) (01.06.2022)
“…While today's video recognition systems parse snapshots or short clips accurately, they cannot connect the dots and reason across a longer range of time yet…”

Get full text

Conference Proceeding

Save to List

Saved in:
12

Loading…

Exploring Self-Attention for Image Recognition by Zhao, Hengshuang, Jia, Jiaya, Koltun, Vladlen

ISSN: 1063-6919

Published: IEEE 01.01.2020

Published in Proceedings (IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Online) (01.01.2020)
“…Recent work has shown that self-attention can serve as a basic building block for image recognition models…”

Get full text

Conference Proceeding

Save to List

Saved in:
13

Loading…

X3D: Expanding Architectures for Efficient Video Recognition by Feichtenhofer, Christoph

ISSN: 1063-6919

Published: IEEE 01.01.2020

Published in Proceedings (IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Online) (01.01.2020)
“…This paper presents X3D, a family of efficient video networks that progressively expand a tiny 2D image classification architecture along multiple network…”

Get full text

Conference Proceeding

Save to List

Saved in:
14

Loading…

Revisiting Skeleton-based Action Recognition by Duan, Haodong, Zhao, Yue, Chen, Kai, Lin, Dahua, Dai, Bo

ISSN: 1063-6919

Published: IEEE 01.06.2022

Published in Proceedings (IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Online) (01.06.2022)
“… Many skeleton-based action recognition methods adopt GCNs to extract features on top of human skeletons…”

Get full text

Conference Proceeding

Save to List

Saved in:
15

Loading…

Skeleton-Based Action Recognition With Shift Graph Convolutional Network by Cheng, Ke, Zhang, Yifan, He, Xiangyu, Chen, Weihan, Cheng, Jian, Lu, Hanqing

ISSN: 1063-6919

Published: IEEE 01.06.2020

Published in Proceedings (IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Online) (01.06.2020)
“…Action recognition with skeleton data is attracting more attention in computer vision…”

Get full text

Conference Proceeding

Save to List

Saved in:
16

Loading…

From Recognition to Cognition: Visual Commonsense Reasoning by Zellers, Rowan, Bisk, Yonatan, Farhadi, Ali, Choi, Yejin

ISSN: 1063-6919

Published: IEEE 01.06.2019

Published in Proceedings (IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Online) (01.06.2019)
“…Visual understanding goes well beyond object recognition. With one glance at an image, we can effortlessly imagine the world beyond the pixels…”

Get full text

Conference Proceeding

Save to List

Saved in:
17

Loading…

MagFace: A Universal Representation for Face Recognition and Quality Assessment by Meng, Qiang, Zhao, Shichao, Huang, Zhida, Zhou, Feng

ISSN: 1063-6919

Published: IEEE 01.06.2021

Published in Proceedings (IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Online) (01.06.2021)
“…The performance of face recognition system degrades when the variability of the acquired faces increases…”

Get full text

Conference Proceeding

Save to List

Saved in:
18

Loading…

AdaFace: Quality Adaptive Margin for Face Recognition by Kim, Minchul, Jain, Anil K., Liu, Xiaoming

ISSN: 1063-6919

Published: IEEE 01.06.2022

Published in Proceedings (IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Online) (01.06.2022)
“…Recognition in low quality face datasets is challenging because facial attributes are obscured and degraded…”

Get full text

Conference Proceeding

Save to List

Saved in:
19

Loading…

Improving Calibration for Long-Tailed Recognition by Zhong, Zhisheng, Cui, Jiequan, Liu, Shu, Jia, Jiaya

ISSN: 1063-6919

Published: IEEE 01.06.2021

Published in Proceedings (IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Online) (01.06.2021)
“…Deep neural networks may perform poorly when training datasets are heavily class-imbalanced. Recently, two-stage methods decouple representation learning and…”

Get full text

Conference Proceeding

Save to List

Saved in:
20

Loading…

BBN: Bilateral-Branch Network With Cumulative Learning for Long-Tailed Visual Recognition by Zhou, Boyan, Cui, Quan, Wei, Xiu-Shen, Chen, Zhao-Min

ISSN: 1063-6919

Published: IEEE 01.06.2020

Published in Proceedings (IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Online) (01.06.2020)
“…Our work focuses on tackling the challenging but natural visual recognition task of long-tailed data distribution (i.e…”

Get full text

Conference Proceeding

Save to List

Saved in:

Search Results - Proceedings of the IEEE/CVF Conference on Computer Vision AND Pattern Recognition*

Subspace-PnP: A Geometric Constraint Loss for Mutual Assistance of Depth and Optical Flow Estimation by Chi, Cheng, Hao, Tianyu, Wang, Qingjie, Guo, Peng, Yang, Xin

Collecting Cross-Modal Presence-Absence Evidence for Weakly-Supervised Audio- Visual Event Perception by Gao, Junyu, Chen, Mengyuan, Xu, Changsheng

Two-stream lightweight sign language transformer by Chen, Yuming, Mei, Xue, Qin, Xuan

MTPose: Human Pose Estimation with High-Resolution Multi-scale Transformers by Wang, Rui, Geng, Fudi, Wang, Xiangyang

Staged encoder training for cross-camera person re-identification by Xu, Zhi, Yang, Jiawei, Liu, Yuxuan, Zhao, Longyang, Liu, Jiajia

Multi-cue SORT: integrating weak cues with appearance and motion for multi-object tracking by Liang, Hong, Xu, Mingchen, Zhang, Qian, Shao, Mingwen

VizWiz Grand Challenge: Answering Visual Questions from Blind People by Gurari, Danna, Li, Qing, Stangl, Abigale J., Guo, Anhong, Lin, Chi, Grauman, Kristen, Luo, Jiebo, Bigham, Jeffrey P.

EV-Gait: Event-Based Robust Gait Recognition Using Dynamic Vision Sensors by Wang, Yanxiang, Du, Bowen, Shen, Yiran, Wu, Kai, Zhao, Guangrong, Sun, Jianguo, Wen, Hongkai

AdaViT: Adaptive Vision Transformers for Efficient Image Recognition by Meng, Lingchen, Li, Hengduo, Chen, Bor-Chun, Lan, Shiyi, Wu, Zuxuan, Jiang, Yu-Gang, Lim, Ser-Nam

BEVFormer v2: Adapting Modern Image Backbones to Bird's-Eye-View Recognition via Perspective Supervision by Yang, Chenyu, Chen, Yuntao, Tian, Hao, Tao, Chenxin, Zhu, Xizhou, Zhang, Zhaoxiang, Huang, Gao, Li, Hongyang, Qiao, Yu, Lu, Lewei, Zhou, Jie, Dai, Jifeng

MeMViT: Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition by Wu, Chao-Yuan, Li, Yanghao, Mangalam, Karttikeya, Fan, Haoqi, Xiong, Bo, Malik, Jitendra, Feichtenhofer, Christoph

Exploring Self-Attention for Image Recognition by Zhao, Hengshuang, Jia, Jiaya, Koltun, Vladlen

X3D: Expanding Architectures for Efficient Video Recognition by Feichtenhofer, Christoph

Revisiting Skeleton-based Action Recognition by Duan, Haodong, Zhao, Yue, Chen, Kai, Lin, Dahua, Dai, Bo

Skeleton-Based Action Recognition With Shift Graph Convolutional Network by Cheng, Ke, Zhang, Yifan, He, Xiangyu, Chen, Weihan, Cheng, Jian, Lu, Hanqing

From Recognition to Cognition: Visual Commonsense Reasoning by Zellers, Rowan, Bisk, Yonatan, Farhadi, Ali, Choi, Yejin

MagFace: A Universal Representation for Face Recognition and Quality Assessment by Meng, Qiang, Zhao, Shichao, Huang, Zhida, Zhou, Feng

AdaFace: Quality Adaptive Margin for Face Recognition by Kim, Minchul, Jain, Anil K., Liu, Xiaoming

Improving Calibration for Long-Tailed Recognition by Zhong, Zhisheng, Cui, Jiequan, Liu, Shu, Jia, Jiaya

BBN: Bilateral-Branch Network With Cumulative Learning for Long-Tailed Visual Recognition by Zhou, Boyan, Cui, Quan, Wei, Xiu-Shen, Chen, Zhao-Min

Search Tools:

Refine Results

Format

Subject Area

Topic

Language

Year of Publication