Výsledky vyhledávání - "IEEE Transactions on Multimedia"

Upřesnit hledání
  1. 1

    StrongSORT: Make DeepSORT Great Again Autor Du, Yunhao, Zhao, Zhicheng, Song, Yang, Zhao, Yanyun, Su, Fei, Gong, Tao, Meng, Hongying

    ISSN: 1520-9210, 1941-0077
    Vydáno: Piscataway IEEE 01.01.2023
    Vydáno v IEEE transactions on multimedia (01.01.2023)
    “…Recently, Multi-Object Tracking (MOT) has attracted rising attention, and accordingly, remarkable progresses have been achieved. However, the existing methods…”
    Získat plný text
    Journal Article
  2. 2

    EAPT: Efficient Attention Pyramid Transformer for Image Processing Autor Lin, Xiao, Sun, Shuzhou, Huang, Wei, Sheng, Bin, Li, Ping, Feng, David Dagan

    ISSN: 1520-9210, 1941-0077
    Vydáno: Piscataway IEEE 2023
    “…Recent transformer-based models, especially patch-based methods, have shown huge potentiality in vision tasks. However, the split fixed-size patches divide the…”
    Získat plný text
    Journal Article
  3. 3

    Deep Learning for Single Image Super-Resolution: A Brief Review Autor Yang, Wenming, Zhang, Xuechen, Tian, Yapeng, Wang, Wei, Xue, Jing-Hao, Liao, Qingmin

    ISSN: 1520-9210, 1941-0077
    Vydáno: Piscataway IEEE 01.12.2019
    Vydáno v IEEE transactions on multimedia (01.12.2019)
    “…Single image super-resolution (SISR) is a notoriously challenging ill-posed problem that aims to obtain a high-resolution output from one of its low-resolution…”
    Získat plný text
    Journal Article
  4. 4

    A Strong Baseline and Batch Normalization Neck for Deep Person Re-Identification Autor Luo, Hao, Jiang, Wei, Gu, Youzhi, Liu, Fuxu, Liao, Xingyu, Lai, Shenqi, Gu, Jianyang

    ISSN: 1520-9210, 1941-0077
    Vydáno: Piscataway IEEE 01.10.2020
    Vydáno v IEEE transactions on multimedia (01.10.2020)
    “…This study proposes a simple but strong baseline for deep person re-identification (ReID). Deep person ReID has achieved great progress and high performance in…”
    Získat plný text
    Journal Article
  5. 5

    DilateFormer: Multi-Scale Dilated Transformer for Visual Recognition Autor Jiao, Jiayu, Tang, Yu-Ming, Lin, Kun-Yu, Gao, Yipeng, Ma, Jinhua, Wang, Yaowei, Zheng, Wei-Shi

    ISSN: 1520-9210, 1941-0077
    Vydáno: Piscataway IEEE 01.01.2023
    Vydáno v IEEE transactions on multimedia (01.01.2023)
    “…As a de facto solution, the vanilla Vision Transformers (ViTs) are encouraged to model long-range dependencies between arbitrary image patches while the global…”
    Získat plný text
    Journal Article
  6. 6

    Extended Feature Pyramid Network for Small Object Detection Autor Deng, Chunfang, Wang, Mengmeng, Liu, Liang, Liu, Yong, Jiang, Yunliang

    ISSN: 1520-9210, 1941-0077
    Vydáno: Piscataway IEEE 2022
    “…Small object detection remains an unsolved challenge because it is hard to extract the information of small objects with only a few pixels. While scale-level…”
    Získat plný text
    Journal Article
  7. 7

    Arbitrary-Oriented Scene Text Detection via Rotation Proposals Autor Ma, Jianqi, Shao, Weiyuan, Ye, Hao, Wang, Li, Wang, Hong, Zheng, Yingbin, Xue, Xiangyang

    ISSN: 1520-9210, 1941-0077
    Vydáno: Piscataway IEEE 01.11.2018
    Vydáno v IEEE transactions on multimedia (01.11.2018)
    “…This paper introduces a novel rotation-based framework for arbitrary-oriented text detection in natural scene images. We present the Rotation Region Proposal…”
    Získat plný text
    Journal Article
  8. 8

    YDTR: Infrared and Visible Image Fusion via Y-Shape Dynamic Transformer Autor Tang, Wei, He, Fazhi, Liu, Yu

    ISSN: 1520-9210, 1941-0077
    Vydáno: Piscataway IEEE 2023
    “…Infrared and visible image fusion is aims to generate a composite image that can simultaneously describe the salient target in the infrared image and texture…”
    Získat plný text
    Journal Article
  9. 9

    Multi-View Multi-Label Learning With Sparse Feature Selection for Image Annotation Autor Zhang, Yongshan, Wu, Jia, Cai, Zhihua, Yu, Philip S.

    ISSN: 1520-9210, 1941-0077
    Vydáno: Piscataway IEEE 01.11.2020
    Vydáno v IEEE transactions on multimedia (01.11.2020)
    “…In image analysis, image samples are always represented by multiple view features and associated with multiple class labels for better interpretation. However,…”
    Získat plný text
    Journal Article
  10. 10

    Image-to-Image Translation: Methods and Applications Autor Pang, Yingxue, Lin, Jianxin, Qin, Tao, Chen, Zhibo

    ISSN: 1520-9210, 1941-0077
    Vydáno: Piscataway IEEE 2022
    “…Image-to-image translation (I2I) aims to transfer images from a source domain to a target domain while preserving the content representations. I2I has drawn…”
    Získat plný text
    Journal Article
  11. 11

    Geometric Back-Projection Network for Point Cloud Classification Autor Qiu, Shi, Anwar, Saeed, Barnes, Nick

    ISSN: 1520-9210, 1941-0077
    Vydáno: Piscataway IEEE 2022
    “…As the basic task of point cloud analysis, classification is fundamental but always challenging. To address some unsolved problems of existing methods, we…”
    Získat plný text
    Journal Article
  12. 12

    Exploiting Temporal Contexts With Strided Transformer for 3D Human Pose Estimation Autor Li, Wenhao, Liu, Hong, Ding, Runwei, Liu, Mengyuan, Wang, Pichao, Yang, Wenming

    ISSN: 1520-9210, 1941-0077
    Vydáno: Piscataway IEEE 2023
    “…Despite the great progress in 3D human pose estimation from videos, it is still an open problem to take full advantage of a redundant 2D pose sequence to learn…”
    Získat plný text
    Journal Article
  13. 13

    Consensus Graph Learning for Multi-View Clustering Autor Li, Zhenglai, Tang, Chang, Liu, Xinwang, Zheng, Xiao, Zhang, Wei, Zhu, En

    ISSN: 1520-9210, 1941-0077
    Vydáno: Piscataway IEEE 2022
    “…Multi-view clustering, which exploits the multi-view information to partition data into their clusters, has attracted intense attention. However, most existing…”
    Získat plný text
    Journal Article
  14. 14

    AttentionFGAN: Infrared and Visible Image Fusion Using Attention-Based Generative Adversarial Networks Autor Li, Jing, Huo, Hongtao, Li, Chang, Wang, Renhua, Feng, Qi

    ISSN: 1520-9210, 1941-0077
    Vydáno: Piscataway IEEE 2021
    “…Infrared and visible image fusion aims to describe the same scene from different aspects by combining complementary information of multi-modality images. The…”
    Získat plný text
    Journal Article
  15. 15

    Low-Light Image Enhancement With Semi-Decoupled Decomposition Autor Hao, Shijie, Han, Xu, Guo, Yanrong, Xu, Xin, Wang, Meng

    ISSN: 1520-9210, 1941-0077
    Vydáno: Piscataway IEEE 01.12.2020
    Vydáno v IEEE transactions on multimedia (01.12.2020)
    “…Low-light image enhancement is important for high-quality image display and other visual applications. However, it is a challenging task as the enhancement is…”
    Získat plný text
    Journal Article
  16. 16

    MFDNet: Collaborative Poses Perception and Matrix Fisher Distribution for Head Pose Estimation Autor Liu, Hai, Fang, Shuai, Zhang, Zhaoli, Li, Duantengchuan, Lin, Ke, Wang, Jiazhang

    ISSN: 1520-9210, 1941-0077
    Vydáno: Piscataway IEEE 2022
    “…Head pose estimation suffers from several problems, including low pose tolerance under different disturbances and ambiguity arising from common head pose…”
    Získat plný text
    Journal Article
  17. 17

    Learning Disentangled Representation Implicitly Via Transformer for Occluded Person Re-Identification Autor Jia, Mengxi, Cheng, Xinhua, Lu, Shijian, Zhang, Jian

    ISSN: 1520-9210, 1941-0077
    Vydáno: Piscataway IEEE 2023
    “…Person re-IDentification (re-ID) under various occlusions has been a long-standing challenge as person images with different types of occlusions often suffer…”
    Získat plný text
    Journal Article
  18. 18

    DualGNN: Dual Graph Neural Network for Multimedia Recommendation Autor Wang, Qifan, Wei, Yinwei, Yin, Jianhua, Wu, Jianlong, Song, Xuemeng, Nie, Liqiang

    ISSN: 1520-9210, 1941-0077
    Vydáno: Piscataway IEEE 2023
    “…One of the important factors affecting micro-video recommender systems is to model the multi-modal user preference on the micro-video. Despite the remarkable…”
    Získat plný text
    Journal Article
  19. 19

    Scale-Aware Fast R-CNN for Pedestrian Detection Autor Li, Jianan, Liang, Xiaodan, Shen, Shengmei, Xu, Tingfa, Feng, Jiashi, Yan, Shuicheng

    ISSN: 1520-9210, 1941-0077
    Vydáno: IEEE 01.04.2018
    Vydáno v IEEE transactions on multimedia (01.04.2018)
    “…In this paper, we consider the problem of pedestrian detection in natural scenes. Intuitively, instances of pedestrians with different spatial scales may…”
    Získat plný text
    Journal Article
  20. 20

    STAT: Spatial-Temporal Attention Mechanism for Video Captioning Autor Yan, Chenggang, Tu, Yunbin, Wang, Xingzheng, Zhang, Yongbing, Hao, Xinhong, Zhang, Yongdong, Dai, Qionghai

    ISSN: 1520-9210, 1941-0077
    Vydáno: Piscataway IEEE 01.01.2020
    Vydáno v IEEE transactions on multimedia (01.01.2020)
    “…Video captioning refers to automatic generate natural language sentences, which summarize the video contents. Inspired by the visual attention mechanism of…”
    Získat plný text
    Journal Article