Výsledky vyhledávání - "IEEE Transactions on Multimedia"

1

Načítá se…

StrongSORT: Make DeepSORT Great Again Autor Du, Yunhao, Zhao, Zhicheng, Song, Yang, Zhao, Yanyun, Su, Fei, Gong, Tao, Meng, Hongying

ISSN: 1520-9210, 1941-0077

Vydáno: Piscataway IEEE 01.01.2023

Vydáno v IEEE transactions on multimedia (01.01.2023)
“…Recently, Multi-Object Tracking (MOT) has attracted rising attention, and accordingly, remarkable progresses have been achieved. However, the existing methods…”

Získat plný text

Journal Article

Přidat do oblíbených

Uloženo v:
2

Načítá se…

EAPT: Efficient Attention Pyramid Transformer for Image Processing Autor Lin, Xiao, Sun, Shuzhou, Huang, Wei, Sheng, Bin, Li, Ping, Feng, David Dagan

ISSN: 1520-9210, 1941-0077

Vydáno: Piscataway IEEE 2023

Vydáno v IEEE transactions on multimedia (2023)
“…Recent transformer-based models, especially patch-based methods, have shown huge potentiality in vision tasks. However, the split fixed-size patches divide the…”

Získat plný text

Journal Article

Přidat do oblíbených

Uloženo v:
3

Načítá se…

Deep Learning for Single Image Super-Resolution: A Brief Review Autor Yang, Wenming, Zhang, Xuechen, Tian, Yapeng, Wang, Wei, Xue, Jing-Hao, Liao, Qingmin

ISSN: 1520-9210, 1941-0077

Vydáno: Piscataway IEEE 01.12.2019

Vydáno v IEEE transactions on multimedia (01.12.2019)
“…Single image super-resolution (SISR) is a notoriously challenging ill-posed problem that aims to obtain a high-resolution output from one of its low-resolution…”

Získat plný text

Journal Article

Přidat do oblíbených

Uloženo v:
4

Načítá se…

A Strong Baseline and Batch Normalization Neck for Deep Person Re-Identification Autor Luo, Hao, Jiang, Wei, Gu, Youzhi, Liu, Fuxu, Liao, Xingyu, Lai, Shenqi, Gu, Jianyang

ISSN: 1520-9210, 1941-0077

Vydáno: Piscataway IEEE 01.10.2020

Vydáno v IEEE transactions on multimedia (01.10.2020)
“…This study proposes a simple but strong baseline for deep person re-identification (ReID). Deep person ReID has achieved great progress and high performance in…”

Získat plný text

Journal Article

Přidat do oblíbených

Uloženo v:
5

Načítá se…

DilateFormer: Multi-Scale Dilated Transformer for Visual Recognition Autor Jiao, Jiayu, Tang, Yu-Ming, Lin, Kun-Yu, Gao, Yipeng, Ma, Jinhua, Wang, Yaowei, Zheng, Wei-Shi

ISSN: 1520-9210, 1941-0077

Vydáno: Piscataway IEEE 01.01.2023

Vydáno v IEEE transactions on multimedia (01.01.2023)
“…As a de facto solution, the vanilla Vision Transformers (ViTs) are encouraged to model long-range dependencies between arbitrary image patches while the global…”

Získat plný text

Journal Article

Přidat do oblíbených

Uloženo v:
6

Načítá se…

Extended Feature Pyramid Network for Small Object Detection Autor Deng, Chunfang, Wang, Mengmeng, Liu, Liang, Liu, Yong, Jiang, Yunliang

ISSN: 1520-9210, 1941-0077

Vydáno: Piscataway IEEE 2022

Vydáno v IEEE transactions on multimedia (2022)
“…Small object detection remains an unsolved challenge because it is hard to extract the information of small objects with only a few pixels. While scale-level…”

Získat plný text

Journal Article

Přidat do oblíbených

Uloženo v:
7

Načítá se…

Arbitrary-Oriented Scene Text Detection via Rotation Proposals Autor Ma, Jianqi, Shao, Weiyuan, Ye, Hao, Wang, Li, Wang, Hong, Zheng, Yingbin, Xue, Xiangyang

ISSN: 1520-9210, 1941-0077

Vydáno: Piscataway IEEE 01.11.2018

Vydáno v IEEE transactions on multimedia (01.11.2018)
“…This paper introduces a novel rotation-based framework for arbitrary-oriented text detection in natural scene images. We present the Rotation Region Proposal…”

Získat plný text

Journal Article

Přidat do oblíbených

Uloženo v:
8

Načítá se…

YDTR: Infrared and Visible Image Fusion via Y-Shape Dynamic Transformer Autor Tang, Wei, He, Fazhi, Liu, Yu

ISSN: 1520-9210, 1941-0077

Vydáno: Piscataway IEEE 2023

Vydáno v IEEE transactions on multimedia (2023)
“…Infrared and visible image fusion is aims to generate a composite image that can simultaneously describe the salient target in the infrared image and texture…”

Získat plný text

Journal Article

Přidat do oblíbených

Uloženo v:
9

Načítá se…

Multi-View Multi-Label Learning With Sparse Feature Selection for Image Annotation Autor Zhang, Yongshan, Wu, Jia, Cai, Zhihua, Yu, Philip S.

ISSN: 1520-9210, 1941-0077

Vydáno: Piscataway IEEE 01.11.2020

Vydáno v IEEE transactions on multimedia (01.11.2020)
“…In image analysis, image samples are always represented by multiple view features and associated with multiple class labels for better interpretation. However,…”

Získat plný text

Journal Article

Přidat do oblíbených

Uloženo v:
10

Načítá se…

Image-to-Image Translation: Methods and Applications Autor Pang, Yingxue, Lin, Jianxin, Qin, Tao, Chen, Zhibo

ISSN: 1520-9210, 1941-0077

Vydáno: Piscataway IEEE 2022

Vydáno v IEEE transactions on multimedia (2022)
“…Image-to-image translation (I2I) aims to transfer images from a source domain to a target domain while preserving the content representations. I2I has drawn…”

Získat plný text

Journal Article

Přidat do oblíbených

Uloženo v:
11

Načítá se…

Geometric Back-Projection Network for Point Cloud Classification Autor Qiu, Shi, Anwar, Saeed, Barnes, Nick

ISSN: 1520-9210, 1941-0077

Vydáno: Piscataway IEEE 2022

Vydáno v IEEE transactions on multimedia (2022)
“…As the basic task of point cloud analysis, classification is fundamental but always challenging. To address some unsolved problems of existing methods, we…”

Získat plný text

Journal Article

Přidat do oblíbených

Uloženo v:
12

Načítá se…

Exploiting Temporal Contexts With Strided Transformer for 3D Human Pose Estimation Autor Li, Wenhao, Liu, Hong, Ding, Runwei, Liu, Mengyuan, Wang, Pichao, Yang, Wenming

ISSN: 1520-9210, 1941-0077

Vydáno: Piscataway IEEE 2023

Vydáno v IEEE transactions on multimedia (2023)
“…Despite the great progress in 3D human pose estimation from videos, it is still an open problem to take full advantage of a redundant 2D pose sequence to learn…”

Získat plný text

Journal Article

Přidat do oblíbených

Uloženo v:
13

Načítá se…

Consensus Graph Learning for Multi-View Clustering Autor Li, Zhenglai, Tang, Chang, Liu, Xinwang, Zheng, Xiao, Zhang, Wei, Zhu, En

ISSN: 1520-9210, 1941-0077

Vydáno: Piscataway IEEE 2022

Vydáno v IEEE transactions on multimedia (2022)
“…Multi-view clustering, which exploits the multi-view information to partition data into their clusters, has attracted intense attention. However, most existing…”

Získat plný text

Journal Article

Přidat do oblíbených

Uloženo v:
14

Načítá se…

AttentionFGAN: Infrared and Visible Image Fusion Using Attention-Based Generative Adversarial Networks Autor Li, Jing, Huo, Hongtao, Li, Chang, Wang, Renhua, Feng, Qi

ISSN: 1520-9210, 1941-0077

Vydáno: Piscataway IEEE 2021

Vydáno v IEEE transactions on multimedia (2021)
“…Infrared and visible image fusion aims to describe the same scene from different aspects by combining complementary information of multi-modality images. The…”

Získat plný text

Journal Article

Přidat do oblíbených

Uloženo v:
15

Načítá se…

Low-Light Image Enhancement With Semi-Decoupled Decomposition Autor Hao, Shijie, Han, Xu, Guo, Yanrong, Xu, Xin, Wang, Meng

ISSN: 1520-9210, 1941-0077

Vydáno: Piscataway IEEE 01.12.2020

Vydáno v IEEE transactions on multimedia (01.12.2020)
“…Low-light image enhancement is important for high-quality image display and other visual applications. However, it is a challenging task as the enhancement is…”

Získat plný text

Journal Article

Přidat do oblíbených

Uloženo v:
16

Načítá se…

MFDNet: Collaborative Poses Perception and Matrix Fisher Distribution for Head Pose Estimation Autor Liu, Hai, Fang, Shuai, Zhang, Zhaoli, Li, Duantengchuan, Lin, Ke, Wang, Jiazhang

ISSN: 1520-9210, 1941-0077

Vydáno: Piscataway IEEE 2022

Vydáno v IEEE transactions on multimedia (2022)
“…Head pose estimation suffers from several problems, including low pose tolerance under different disturbances and ambiguity arising from common head pose…”

Získat plný text

Journal Article

Přidat do oblíbených

Uloženo v:
17

Načítá se…

Learning Disentangled Representation Implicitly Via Transformer for Occluded Person Re-Identification Autor Jia, Mengxi, Cheng, Xinhua, Lu, Shijian, Zhang, Jian

ISSN: 1520-9210, 1941-0077

Vydáno: Piscataway IEEE 2023

Vydáno v IEEE transactions on multimedia (2023)
“…Person re-IDentification (re-ID) under various occlusions has been a long-standing challenge as person images with different types of occlusions often suffer…”

Získat plný text

Journal Article

Přidat do oblíbených

Uloženo v:
18

Načítá se…

DualGNN: Dual Graph Neural Network for Multimedia Recommendation Autor Wang, Qifan, Wei, Yinwei, Yin, Jianhua, Wu, Jianlong, Song, Xuemeng, Nie, Liqiang

ISSN: 1520-9210, 1941-0077

Vydáno: Piscataway IEEE 2023

Vydáno v IEEE transactions on multimedia (2023)
“…One of the important factors affecting micro-video recommender systems is to model the multi-modal user preference on the micro-video. Despite the remarkable…”

Získat plný text

Journal Article

Přidat do oblíbených

Uloženo v:
19

Načítá se…

Scale-Aware Fast R-CNN for Pedestrian Detection Autor Li, Jianan, Liang, Xiaodan, Shen, Shengmei, Xu, Tingfa, Feng, Jiashi, Yan, Shuicheng

ISSN: 1520-9210, 1941-0077

Vydáno: IEEE 01.04.2018

Vydáno v IEEE transactions on multimedia (01.04.2018)
“…In this paper, we consider the problem of pedestrian detection in natural scenes. Intuitively, instances of pedestrians with different spatial scales may…”

Získat plný text

Journal Article

Přidat do oblíbených

Uloženo v:
20

Načítá se…

STAT: Spatial-Temporal Attention Mechanism for Video Captioning Autor Yan, Chenggang, Tu, Yunbin, Wang, Xingzheng, Zhang, Yongbing, Hao, Xinhong, Zhang, Yongdong, Dai, Qionghai

ISSN: 1520-9210, 1941-0077

Vydáno: Piscataway IEEE 01.01.2020

Vydáno v IEEE transactions on multimedia (01.01.2020)
“…Video captioning refers to automatic generate natural language sentences, which summarize the video contents. Inspired by the visual attention mechanism of…”

Získat plný text

Journal Article

Přidat do oblíbených

Uloženo v:

Výsledky vyhledávání - "IEEE Transactions on Multimedia"

StrongSORT: Make DeepSORT Great Again Autor Du, Yunhao, Zhao, Zhicheng, Song, Yang, Zhao, Yanyun, Su, Fei, Gong, Tao, Meng, Hongying

EAPT: Efficient Attention Pyramid Transformer for Image Processing Autor Lin, Xiao, Sun, Shuzhou, Huang, Wei, Sheng, Bin, Li, Ping, Feng, David Dagan

Deep Learning for Single Image Super-Resolution: A Brief Review Autor Yang, Wenming, Zhang, Xuechen, Tian, Yapeng, Wang, Wei, Xue, Jing-Hao, Liao, Qingmin

A Strong Baseline and Batch Normalization Neck for Deep Person Re-Identification Autor Luo, Hao, Jiang, Wei, Gu, Youzhi, Liu, Fuxu, Liao, Xingyu, Lai, Shenqi, Gu, Jianyang

DilateFormer: Multi-Scale Dilated Transformer for Visual Recognition Autor Jiao, Jiayu, Tang, Yu-Ming, Lin, Kun-Yu, Gao, Yipeng, Ma, Jinhua, Wang, Yaowei, Zheng, Wei-Shi

Extended Feature Pyramid Network for Small Object Detection Autor Deng, Chunfang, Wang, Mengmeng, Liu, Liang, Liu, Yong, Jiang, Yunliang

Arbitrary-Oriented Scene Text Detection via Rotation Proposals Autor Ma, Jianqi, Shao, Weiyuan, Ye, Hao, Wang, Li, Wang, Hong, Zheng, Yingbin, Xue, Xiangyang

YDTR: Infrared and Visible Image Fusion via Y-Shape Dynamic Transformer Autor Tang, Wei, He, Fazhi, Liu, Yu

Multi-View Multi-Label Learning With Sparse Feature Selection for Image Annotation Autor Zhang, Yongshan, Wu, Jia, Cai, Zhihua, Yu, Philip S.

Image-to-Image Translation: Methods and Applications Autor Pang, Yingxue, Lin, Jianxin, Qin, Tao, Chen, Zhibo

Geometric Back-Projection Network for Point Cloud Classification Autor Qiu, Shi, Anwar, Saeed, Barnes, Nick

Exploiting Temporal Contexts With Strided Transformer for 3D Human Pose Estimation Autor Li, Wenhao, Liu, Hong, Ding, Runwei, Liu, Mengyuan, Wang, Pichao, Yang, Wenming

Consensus Graph Learning for Multi-View Clustering Autor Li, Zhenglai, Tang, Chang, Liu, Xinwang, Zheng, Xiao, Zhang, Wei, Zhu, En

AttentionFGAN: Infrared and Visible Image Fusion Using Attention-Based Generative Adversarial Networks Autor Li, Jing, Huo, Hongtao, Li, Chang, Wang, Renhua, Feng, Qi

Low-Light Image Enhancement With Semi-Decoupled Decomposition Autor Hao, Shijie, Han, Xu, Guo, Yanrong, Xu, Xin, Wang, Meng

MFDNet: Collaborative Poses Perception and Matrix Fisher Distribution for Head Pose Estimation Autor Liu, Hai, Fang, Shuai, Zhang, Zhaoli, Li, Duantengchuan, Lin, Ke, Wang, Jiazhang

Learning Disentangled Representation Implicitly Via Transformer for Occluded Person Re-Identification Autor Jia, Mengxi, Cheng, Xinhua, Lu, Shijian, Zhang, Jian

DualGNN: Dual Graph Neural Network for Multimedia Recommendation Autor Wang, Qifan, Wei, Yinwei, Yin, Jianhua, Wu, Jianlong, Song, Xuemeng, Nie, Liqiang

Scale-Aware Fast R-CNN for Pedestrian Detection Autor Li, Jianan, Liang, Xiaodan, Shen, Shengmei, Xu, Tingfa, Feng, Jiashi, Yan, Shuicheng

STAT: Spatial-Temporal Attention Mechanism for Video Captioning Autor Yan, Chenggang, Tu, Yunbin, Wang, Xingzheng, Zhang, Yongbing, Hao, Xinhong, Zhang, Yongdong, Dai, Qionghai

Vyhledávací nástroje:

Upřesnit hledání

Médium

Předmětová oblast

Téma

Jazyk

Rok vydání