SMART: Supervised multi-class image retargeting generative model based on a long-range sampling strategy

Content-aware image retargeting (CAIR) techniques are crucial in multimedia processing for displaying images on various devices while preserving visually salient contents with desirable visual effects. There are discrete and continuous algorithms. For the former, the artefacts happen when the foregr...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Digital signal processing Jg. 154; S. 104659
Hauptverfasser: Cui, Jia, Jiang, Hao, Qi, Meng, Gu, Zhenyu, Lu, Hongju
Format: Journal Article
Sprache:Englisch
Veröffentlicht: Elsevier Inc 01.11.2024
Schlagworte:
ISSN:1051-2004
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Abstract Content-aware image retargeting (CAIR) techniques are crucial in multimedia processing for displaying images on various devices while preserving visually salient contents with desirable visual effects. There are discrete and continuous algorithms. For the former, the artefacts happen when the foreground proportion is larger than the retargeting ratio; for the latter, the salient regions are prone to be squeezed. In this paper, we reformulate the retargeting process into sampling the salient signal and reconstruction under aesthetic supervision, the supervised multi-class image retargeting reconstruction (SMART) framework. The target images can be represented into complementary parts, the masked and unmasked ones, according to the saliency influences in the encoder phrase. The long-range sampling algorithm is proposed to calculate similarities through an 8-connected planar path while considering spatial distance and feature correlation. The sampled embeddings in latent space reconstruct the retargeted images under supervised signals for aesthetic quality. The semantic loss Lsem from the pretrained CLIP model can maintain consistency for both content and semantics. The supervised loss, Lir, is introduced to ensure the retargeted qualities are close to the preferred labels. Then, we release a new retargeting dataset comprising seven image classes (animal, building, car, flower, indoor, landscape and people) with supervised labels collected from designers for further aesthetic retargeting study. The ablation studies are conducted to confirm the effectiveness of the new dataset, and comparative experiments with state-of-the-art baselines demonstrate the advantages of the proposed method.
AbstractList Content-aware image retargeting (CAIR) techniques are crucial in multimedia processing for displaying images on various devices while preserving visually salient contents with desirable visual effects. There are discrete and continuous algorithms. For the former, the artefacts happen when the foreground proportion is larger than the retargeting ratio; for the latter, the salient regions are prone to be squeezed. In this paper, we reformulate the retargeting process into sampling the salient signal and reconstruction under aesthetic supervision, the supervised multi-class image retargeting reconstruction (SMART) framework. The target images can be represented into complementary parts, the masked and unmasked ones, according to the saliency influences in the encoder phrase. The long-range sampling algorithm is proposed to calculate similarities through an 8-connected planar path while considering spatial distance and feature correlation. The sampled embeddings in latent space reconstruct the retargeted images under supervised signals for aesthetic quality. The semantic loss Lsem from the pretrained CLIP model can maintain consistency for both content and semantics. The supervised loss, Lir, is introduced to ensure the retargeted qualities are close to the preferred labels. Then, we release a new retargeting dataset comprising seven image classes (animal, building, car, flower, indoor, landscape and people) with supervised labels collected from designers for further aesthetic retargeting study. The ablation studies are conducted to confirm the effectiveness of the new dataset, and comparative experiments with state-of-the-art baselines demonstrate the advantages of the proposed method.
ArticleNumber 104659
Author Gu, Zhenyu
Lu, Hongju
Jiang, Hao
Cui, Jia
Qi, Meng
Author_xml – sequence: 1
  givenname: Jia
  orcidid: 0000-0002-1631-0535
  surname: Cui
  fullname: Cui, Jia
  email: cuijia1247@scut.edu.cn
  organization: the State Key Laboratory of Subtropical Building Science, Guangzhou, 510006, PR China
– sequence: 2
  givenname: Hao
  surname: Jiang
  fullname: Jiang, Hao
  organization: H.Cruiser Informationsgesellschaft mbH, Munich, 80807, Germany
– sequence: 3
  givenname: Meng
  surname: Qi
  fullname: Qi, Meng
  organization: School of Information Science and Engineering, Shandong Normal University, Jinan, 250001, PR China
– sequence: 4
  givenname: Zhenyu
  surname: Gu
  fullname: Gu, Zhenyu
  organization: School of Design, Shanghai Jiaotong University, Shanghai, 205530, PR China
– sequence: 5
  givenname: Hongju
  surname: Lu
  fullname: Lu, Hongju
  email: luhj@gcu.edu.cn
  organization: School of Management, Guangzhou City University of Technology, Guangzhou, 510800, PR China
BookMark eNp9kMtOwzAQRb0oEm3hA9j5B1Jsx84DVlUFFKkIiZa15diT4CpxItut1L8nUVmzGo10z9XMWaCZ6x0g9EDJihKaPR5XJgwrRhgfd56JcobmlAiaMEL4LVqEcCSE5Jxlc_Sz_1h_HZ7w_jSAP9sABnenNtpEtyoEbDvVAPYQlW8gWtfgBhx4Fe0ZcNcbaHGlJqh3WOG2d03ilRuRoLqhnfIhjmloLnfoplZtgPu_uUTfry-HzTbZfb69b9a7RDNexiQtaKHzShDQueGcpqKEgjElqpoTQwtqCCvyAsqS0pRlmdZVKlStgKVCsKpMl4hee7XvQ_BQy8GPX_iLpEROeuRRjnrkpEde9YzM85WB8bCzBS-DtuA0GOtBR2l6-w_9C7ZRceg
Cites_doi 10.1007/s11042-022-12003-1
10.1109/JETCAS.2014.2298919
10.1109/TIP.2016.2585884
10.1016/j.sigpro.2019.107242
10.1109/LSP.2012.2227726
10.1007/s00500-015-1795-1
10.1109/TMM.2015.2500727
10.1109/TIP.2017.2761556
10.1007/s11265-015-1084-3
10.1109/TCSVT.2014.2329374
10.1109/TSMC.2016.2557225
10.1016/j.eswa.2021.115852
10.1007/s00371-012-0744-6
10.1109/TMM.2012.2228475
10.1109/TMM.2019.2959925
10.1109/ACCESS.2018.2885347
10.1016/j.jvcir.2016.09.002
10.1186/s13640-016-0130-9
10.1109/TCSVT.2020.2977943
10.1109/TPAMI.2014.2353642
10.1016/j.sigpro.2018.09.037
10.1109/TIP.2012.2214050
ContentType Journal Article
Copyright 2024
Copyright_xml – notice: 2024
DBID AAYXX
CITATION
DOI 10.1016/j.dsp.2024.104659
DatabaseName CrossRef
DatabaseTitle CrossRef
DatabaseTitleList
DeliveryMethod fulltext_linktorsrc
Discipline Engineering
ExternalDocumentID 10_1016_j_dsp_2024_104659
S1051200424002847
GroupedDBID --K
--M
.DC
.~1
0R~
1B1
1~.
1~5
29G
4.4
457
4G.
5GY
5VS
7-5
71M
8P~
9JN
AACTN
AAEDT
AAEDW
AAIKJ
AAKOC
AALRI
AAOAW
AAQFI
AAQXK
AAXKI
AAXUO
AAYFN
ABBOA
ABFNM
ABJNI
ABMAC
ABXDB
ACDAQ
ACGFS
ACNNM
ACRLP
ACZNC
ADBBV
ADEZE
ADFGL
ADJOM
ADMUD
ADTZH
AEBSH
AECPX
AEKER
AENEX
AFKWA
AFTJW
AGHFR
AGUBO
AGYEJ
AHHHB
AHJVU
AHZHX
AIALX
AIEXJ
AIKHN
AITUG
AJOXV
AKRWK
ALMA_UNASSIGNED_HOLDINGS
AMFUW
AMRAJ
AOUOD
ASPBG
AVWKF
AXJTR
AZFZN
BJAXD
BKOJK
BLXMC
CAG
COF
CS3
DM4
DU5
EBS
EFBJH
EJD
EO8
EO9
EP2
EP3
F0J
F5P
FDB
FEDTE
FGOYB
FIRID
FNPLU
FYGXN
G-2
G-Q
G8K
GBLVA
GBOLZ
HLZ
HVGLF
HZ~
IHE
J1W
JJJVA
KOM
LG5
LG9
LY7
M41
MO0
N9A
O-L
O9-
OAUVE
OZT
P-8
P-9
P2P
PC.
Q38
R2-
RIG
ROL
RPZ
SBC
SDF
SDG
SDP
SES
SET
SEW
SPC
SPCBC
SST
SSV
SSZ
T5K
WUQ
XPP
ZMT
ZU3
~G-
9DU
AATTM
AAYWO
AAYXX
ABDPE
ABWVN
ACLOT
ACRPL
ACVFH
ADCNI
ADNMO
AEIPS
AEUPX
AFJKZ
AFPUW
AGQPQ
AIGII
AIIUN
AKBMS
AKYEP
ANKPU
APXCP
CITATION
EFKBS
EFLBG
~HD
ID FETCH-LOGICAL-c249t-3818c7b50ec7d441359e822a5bf40d181d02878e99113266ccb35afae23552b93
ISICitedReferencesCount 0
ISICitedReferencesURI http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=001288597600001&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
ISSN 1051-2004
IngestDate Sat Nov 29 05:58:10 EST 2025
Sat Sep 14 18:13:24 EDT 2024
IsPeerReviewed true
IsScholarly true
Keywords Multi-class image reconstruction
Supervised signals
Long-range sampling
Encoder-decoder structure
Image retargeting
Language English
LinkModel OpenURL
MergedId FETCHMERGED-LOGICAL-c249t-3818c7b50ec7d441359e822a5bf40d181d02878e99113266ccb35afae23552b93
ORCID 0000-0002-1631-0535
ParticipantIDs crossref_primary_10_1016_j_dsp_2024_104659
elsevier_sciencedirect_doi_10_1016_j_dsp_2024_104659
PublicationCentury 2000
PublicationDate November 2024
2024-11-00
PublicationDateYYYYMMDD 2024-11-01
PublicationDate_xml – month: 11
  year: 2024
  text: November 2024
PublicationDecade 2020
PublicationTitle Digital signal processing
PublicationYear 2024
Publisher Elsevier Inc
Publisher_xml – name: Elsevier Inc
References Bao (bib0031) 2021
Lin, Zhou, Chen (bib0014) 2019
Cui (bib0005) 2020; 166
Mittal, Soundararajan, Bovik (bib0046) 2012; 20
Cho (bib0004) 2017
Ho, Jain, Abbeel (bib0027) 2020; 33
Hashemzadeh, Asheghi, Farajzadeh (bib0009) 2019; 155
Zhang (bib0050) 2016; 25
Krause J., Deng J., Stark M., Fei-Fei L. Collecting a large-scale dataset of fine-grained cars.(2013). URL
Zhang (bib0024) 2013; 29
Tang, Lv, Tang (bib0011) 2022; 104
2019.p.1730–1743.
Zhou (bib0008) 2016; 41
Avidan, Shamir (bib0001) 2007
Garg, Nayyar, Singh (bib0006) 2022; 81
Celona, Ciocca, Napoletano (bib0012) 2021; 186
Zheng (bib0035) 2017
Fang (bib0041) 2014; 4
Kwon, Ye (bib0028) 2022
Zeqi, Chenda, Jia (bib0052) 2018
2013;16632981.
Fang (bib0015) 2017; 47
Song, Lee, Lee (bib0002) 2018; 7
Wang (bib0016) 2020
Venkatanath (bib0048) 2015
Panozzo, Weber, Sorkine (bib0025) 2012
Mittal, Moorthy, Bovik (bib0047) 2012; 21
Shafieyan (bib0020) 2017; 50
Liu, Yuen, Torralba (bib0049) 2016
Yan (bib0010) 2014; 25
Radford (bib0034) 2021
Choi, Kim (bib0007) 2016; 85
Lu (bib0021) 2018
Song (bib0036) 2020; 33
Zhang (bib0051) 2018; 27
Shocher (bib0039) 2019
Philbin (bib0044) 2007
Tan, W., et al., Cycle-IR: deep cyclic image retargeting. arXiv preprint
Zhang (bib0013) 2017; 21
Ito (bib0023) 2016; 2016
Zhou, Chen, Li (bib0017) 2020; 31
Yang (bib0038) 2015; 37
Shafieyan (bib0019) 2014
Caron (bib0032) 2021
Oord, Li, Vinyals (bib0040) 2018
Dosovitskiy (bib0030) 2020
Yue (bib0022) 2018; 30
Xie (bib0029) 2023
Lin (bib0045) 2012; 15
Lin (bib0042) 2014
Tan (bib0026) 2015; 18
Zhou, Wei, Wang, Shen, Xie, Yuille, Kong (bib0033) 2021
Song (bib0037) 2019; 32
Rubinstein (bib0018) 2010
Zhou (10.1016/j.dsp.2024.104659_bib0008) 2016; 41
Lin (10.1016/j.dsp.2024.104659_bib0014) 2019
Lin (10.1016/j.dsp.2024.104659_bib0045) 2012; 15
Mittal (10.1016/j.dsp.2024.104659_bib0047) 2012; 21
Lu (10.1016/j.dsp.2024.104659_bib0021) 2018
Fang (10.1016/j.dsp.2024.104659_bib0041) 2014; 4
Panozzo (10.1016/j.dsp.2024.104659_bib0025) 2012
Choi (10.1016/j.dsp.2024.104659_bib0007) 2016; 85
10.1016/j.dsp.2024.104659_bib0043
Radford (10.1016/j.dsp.2024.104659_bib0034) 2021
10.1016/j.dsp.2024.104659_bib0003
Kwon (10.1016/j.dsp.2024.104659_bib0028) 2022
Hashemzadeh (10.1016/j.dsp.2024.104659_bib0009) 2019; 155
Yan (10.1016/j.dsp.2024.104659_bib0010) 2014; 25
Zhou (10.1016/j.dsp.2024.104659_bib0017) 2020; 31
Yue (10.1016/j.dsp.2024.104659_bib0022) 2018; 30
Zeqi (10.1016/j.dsp.2024.104659_bib0052) 2018
Garg (10.1016/j.dsp.2024.104659_bib0006) 2022; 81
Zhang (10.1016/j.dsp.2024.104659_bib0050) 2016; 25
Tang (10.1016/j.dsp.2024.104659_bib0011) 2022; 104
Shafieyan (10.1016/j.dsp.2024.104659_bib0020) 2017; 50
Yang (10.1016/j.dsp.2024.104659_bib0038) 2015; 37
Lin (10.1016/j.dsp.2024.104659_bib0042) 2014
Shocher (10.1016/j.dsp.2024.104659_bib0039) 2019
Avidan (10.1016/j.dsp.2024.104659_bib0001) 2007
Venkatanath (10.1016/j.dsp.2024.104659_bib0048) 2015
Song (10.1016/j.dsp.2024.104659_bib0002) 2018; 7
Fang (10.1016/j.dsp.2024.104659_bib0015) 2017; 47
Song (10.1016/j.dsp.2024.104659_bib0037) 2019; 32
Bao (10.1016/j.dsp.2024.104659_bib0031) 2021
Wang (10.1016/j.dsp.2024.104659_bib0016) 2020
Cho (10.1016/j.dsp.2024.104659_bib0004) 2017
Xie (10.1016/j.dsp.2024.104659_bib0029) 2023
Rubinstein (10.1016/j.dsp.2024.104659_bib0018) 2010
Oord (10.1016/j.dsp.2024.104659_bib0040) 2018
Ito (10.1016/j.dsp.2024.104659_bib0023) 2016; 2016
Celona (10.1016/j.dsp.2024.104659_bib0012) 2021; 186
Zheng (10.1016/j.dsp.2024.104659_bib0035) 2017
Philbin (10.1016/j.dsp.2024.104659_bib0044) 2007
Dosovitskiy (10.1016/j.dsp.2024.104659_bib0030) 2020
Zhang (10.1016/j.dsp.2024.104659_bib0051) 2018; 27
Caron (10.1016/j.dsp.2024.104659_bib0032) 2021
Tan (10.1016/j.dsp.2024.104659_bib0026) 2015; 18
Cui (10.1016/j.dsp.2024.104659_bib0005) 2020; 166
Zhou (10.1016/j.dsp.2024.104659_bib0033) 2021
Liu (10.1016/j.dsp.2024.104659_bib0049) 2016
Shafieyan (10.1016/j.dsp.2024.104659_bib0019) 2014
Song (10.1016/j.dsp.2024.104659_bib0036) 2020; 33
Mittal (10.1016/j.dsp.2024.104659_bib0046) 2012; 20
Zhang (10.1016/j.dsp.2024.104659_bib0024) 2013; 29
Zhang (10.1016/j.dsp.2024.104659_bib0013) 2017; 21
Ho (10.1016/j.dsp.2024.104659_bib0027) 2020; 33
References_xml – reference: . 2013;16632981.
– start-page: 609
  year: 2007
  end-page: 617
  ident: bib0001
  article-title: Seam carving for content-aware image resizing
  publication-title: ACM Transactions on Graphics (TOG)
– reference: Tan, W., et al., Cycle-IR: deep cyclic image retargeting. arXiv preprint
– start-page: 1
  year: 2015
  end-page: 6
  ident: bib0048
  article-title: Blind image quality evaluation using perception based features
  publication-title: 2015 twenty first national conference on communications (NCC)
– start-page: 73
  year: 2018
  end-page: 77
  ident: bib0052
  article-title: The multi-modality content-aware retargeting algorithm and variable scale similarity measurement for image retargeting
  publication-title: Proceedings of the 2nd International Conference on Advances in Image Processing
– volume: 30
  start-page: 415
  year: 2018
  end-page: 523
  ident: bib0022
  article-title: Image retargeting using blur based depth saliency descriptor
  publication-title: J. Comput.-Aid. Des. Comput. Graph.
– start-page: 1
  year: 2007
  end-page: 8
  ident: bib0044
  article-title: Object retrieval with large vocabularies and fast spatial matching
  publication-title: 2007 IEEE conference on computer vision and pattern recognition
– volume: 31
  start-page: 126
  year: 2020
  end-page: 139
  ident: bib0017
  article-title: Weakly supervised reinforced multi-operator image retargeting
  publication-title: IEEE Transact. Circuit. Syst. Video Technol.
– start-page: 493
  year: 2018
  end-page: 500
  ident: bib0021
  article-title: Contour sensitive saliency and depth application in image retargeting
  publication-title: Ninth International Conference on Graphic and Image Processing (ICGIP 2017)
– year: 2022
  ident: bib0028
  article-title: arXiv preprint
– volume: 4
  start-page: 95
  year: 2014
  end-page: 105
  ident: bib0041
  article-title: Objective quality assessment for image retargeting based on structural similarity
  publication-title: IEEE J. Emerg. Select. Top. Circuit. Syst.
– start-page: 22428
  year: 2023
  end-page: 22437
  ident: bib0029
  article-title: Smartbrush: text and shape guided object inpainting with diffusion model
  publication-title: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition
– start-page: 11929
  year: 2020
  ident: bib0030
  article-title: arXiv preprint
– volume: 25
  start-page: 15
  year: 2014
  end-page: 23
  ident: bib0010
  article-title: Seam searching-based pixel fusion for image retargeting
  publication-title: IEEE Transact. Circuit. Syst. Video Technol.
– volume: 155
  start-page: 233
  year: 2019
  end-page: 246
  ident: bib0009
  article-title: Content-aware image resizing: an improved and shadow-preserving seam carving method
  publication-title: Signal Process.
– year: 2021
  ident: bib0033
  article-title: arXiv preprint
– volume: 85
  start-page: 275
  year: 2016
  end-page: 283
  ident: bib0007
  article-title: Sparse seam-carving for structure preserving image retargeting
  publication-title: J. Signal Process. Syst.
– volume: 166
  year: 2020
  ident: bib0005
  article-title: Distortion-aware image retargeting based on continuous seam carving model
  publication-title: Signal Process.
– volume: 27
  start-page: 451
  year: 2018
  end-page: 463
  ident: bib0051
  article-title: Multiple-level feature-based measure for retargeted image quality
  publication-title: IEEE Transact. Image Process.
– volume: 47
  start-page: 2956
  year: 2017
  end-page: 2966
  ident: bib0015
  article-title: Optimized multioperator image retargeting based on perceptual similarity measure
  publication-title: IEEE Transact. Syst. Man. Cybernet.: Syst.
– volume: 2016
  start-page: 27
  year: 2016
  ident: bib0023
  article-title: Gradient-based global features for seam carving
  publication-title: EURASIP J. Image Video Process.
– volume: 37
  start-page: 834
  year: 2015
  end-page: 846
  ident: bib0038
  article-title: Stereo matching using tree filtering
  publication-title: IEEE Transact. Patt. Analy. Mach. Intell.
– volume: 20
  start-page: 209
  year: 2012
  end-page: 212
  ident: bib0046
  article-title: Making a “completely blind” image quality analyzer
  publication-title: IEEE Signal Process. Lett.
– volume: 29
  start-page: 407
  year: 2013
  end-page: 420
  ident: bib0024
  article-title: Image retargeting with multifocus fisheye transformation
  publication-title: Vis. Comput.
– volume: 33
  start-page: 6840
  year: 2020
  end-page: 6851
  ident: bib0027
  article-title: Denoising diffusion probabilistic models
  publication-title: Adv. Neural. Inf. Process. Syst.
– start-page: 54
  year: 2019
  end-page: 59
  ident: bib0014
  article-title: DeepIR: a deep semantics driven framework for image retargeting
  publication-title: 2019 IEEE International Conference on Multimedia & Expo Workshops (ICMEW)
– start-page: 740
  year: 2014
  end-page: 755
  ident: bib0042
  article-title: Microsoft coco: common objects in context
  publication-title: European conference on computer vision
– volume: 25
  start-page: 4286
  year: 2016
  end-page: 4297
  ident: bib0050
  article-title: Backward registration-based aspect ratio similarity for image retargeting quality assessment
  publication-title: IEEE Transact. Image Process.
– volume: 32
  year: 2019
  ident: bib0037
  article-title: Learnable tree filter for structure-preserving feature transform
  publication-title: Adv. Neural Inf. Process. Syst.
– volume: 7
  start-page: 284
  year: 2018
  end-page: 292
  ident: bib0002
  article-title: CarvingNet: content-guided seam carving using deep convolution neural network
  publication-title: IEEE Access
– volume: 41
  start-page: 21
  year: 2016
  end-page: 30
  ident: bib0008
  article-title: Optimal bi-directional seam carving for compressibility-aware image retargeting
  publication-title: J. Vis. Commun. Image Represent.
– volume: 186
  year: 2021
  ident: bib0012
  article-title: A grid anchor based cropping approach exploiting image aesthetics, geometric composition, and semantics
  publication-title: Expert Syst. Appl.
– start-page: 1
  year: 2010
  end-page: 10
  ident: bib0018
  article-title: A comparative study of image retargeting
  publication-title: ACM Transactions On Graphics (TOG)
– volume: 21
  start-page: 4695
  year: 2012
  end-page: 4708
  ident: bib0047
  article-title: No-reference image quality assessment in the spatial domain
  publication-title: IEEE Transact. Image Process.
– volume: 21
  start-page: 447
  year: 2017
  end-page: 457
  ident: bib0013
  article-title: Seam warping: a new approach for image retargeting for small displays
  publication-title: Soft Comput.
– reference: Krause J., Deng J., Stark M., Fei-Fei L. Collecting a large-scale dataset of fine-grained cars.(2013). URL
– start-page: 9650
  year: 2021
  end-page: 9660
  ident: bib0032
  article-title: Emerging properties in self-supervised vision transformers
  publication-title: Proceedings of the IEEE/CVF international conference on computer vision
– year: 2018
  ident: bib0040
  article-title: arXiv preprint
– volume: 33
  start-page: 3991
  year: 2020
  end-page: 4002
  ident: bib0036
  article-title: Rethinking learnable tree filter for generic feature transform
  publication-title: Adv. Neural. Inf. Process Syst.
– start-page: 229
  year: 2012
  end-page: 236
  ident: bib0025
  article-title: Robust Image Retargeting Via Axis-Aligned deformation. in Computer Graphics Forum
– reference: , 2019.p.1730–1743.
– volume: 104
  year: 2022
  ident: bib0011
  article-title: Adaptive cropping with interframe relative displacement constraint for video retargeting
  publication-title: Signal Process.: Image Commun.
– volume: 18
  start-page: 128
  year: 2015
  end-page: 137
  ident: bib0026
  article-title: Image retargeting for preserving robust local feature: application to mobile visual search
  publication-title: IEEE Trans. Multimed.
– year: 2017
  ident: bib0035
  article-title: Learning multi-attention convolutional neural network for fine-grained image recognition
  publication-title: Proceedings of the IEEE international conference on computer vision
– start-page: 1609
  year: 2020
  end-page: 1614
  ident: bib0016
  article-title: Multi-operator video retargeting method based on improved seam carving
  publication-title: 2020 IEEE 5th Information Technology and Mechatronics Engineering Conference (ITOEC)
– start-page: 8748
  year: 2021
  end-page: 8763
  ident: bib0034
  article-title: Learning transferable visual models from natural language supervision
  publication-title: International Conference on Machine Learning
– volume: 50
  start-page: 34
  year: 2017
  end-page: 43
  ident: bib0020
  article-title: Image retargeting using depth assisted saliency map
  publication-title: Signal Process.: Image Commun.
– start-page: 4558
  year: 2017
  end-page: 4567
  ident: bib0004
  article-title: Weakly-and self-supervised learning for content-aware deep image retargeting
  publication-title: Proceedings of the IEEE International Conference on Computer Vision
– start-page: 15
  year: 2016
  end-page: 49
  ident: bib0049
  article-title: Sift flow: Dense correspondence Across Scenes and Its applications, in Dense Image Correspondences for Computer Vision
– start-page: 4492
  year: 2019
  end-page: 4501
  ident: bib0039
  article-title: Ingan: capturing and retargeting the" dna" of a natural image
  publication-title: Proceedings of the IEEE/CVF International Conference on Computer Vision
– start-page: 1155
  year: 2014
  end-page: 1159
  ident: bib0019
  article-title: Image seam carving using depth assisted saliency map
  publication-title: Image Processing (ICIP), 2014 IEEE International Conference on
– volume: 15
  start-page: 359
  year: 2012
  end-page: 368
  ident: bib0045
  article-title: Patch-based image warping for content-aware retargeting
  publication-title: IEEE Trans. Multimed.
– volume: 81
  start-page: 12883
  year: 2022
  end-page: 12924
  ident: bib0006
  article-title: Improved seam carving for structure preservation using efficient energy function
  publication-title: Multimed. Tool. Appl.
– year: 2021
  ident: bib0031
  article-title: arXiv preprint
– volume: 81
  start-page: 12883
  issue: 9
  year: 2022
  ident: 10.1016/j.dsp.2024.104659_bib0006
  article-title: Improved seam carving for structure preservation using efficient energy function
  publication-title: Multimed. Tool. Appl.
  doi: 10.1007/s11042-022-12003-1
– year: 2021
  ident: 10.1016/j.dsp.2024.104659_bib0033
– volume: 4
  start-page: 95
  issue: 1
  year: 2014
  ident: 10.1016/j.dsp.2024.104659_bib0041
  article-title: Objective quality assessment for image retargeting based on structural similarity
  publication-title: IEEE J. Emerg. Select. Top. Circuit. Syst.
  doi: 10.1109/JETCAS.2014.2298919
– year: 2022
  ident: 10.1016/j.dsp.2024.104659_bib0028
– volume: 25
  start-page: 4286
  issue: 9
  year: 2016
  ident: 10.1016/j.dsp.2024.104659_bib0050
  article-title: Backward registration-based aspect ratio similarity for image retargeting quality assessment
  publication-title: IEEE Transact. Image Process.
  doi: 10.1109/TIP.2016.2585884
– start-page: 54
  year: 2019
  ident: 10.1016/j.dsp.2024.104659_bib0014
  article-title: DeepIR: a deep semantics driven framework for image retargeting
– start-page: 1
  year: 2007
  ident: 10.1016/j.dsp.2024.104659_bib0044
  article-title: Object retrieval with large vocabularies and fast spatial matching
– start-page: 1155
  year: 2014
  ident: 10.1016/j.dsp.2024.104659_bib0019
  article-title: Image seam carving using depth assisted saliency map
– year: 2018
  ident: 10.1016/j.dsp.2024.104659_bib0040
– start-page: 1
  year: 2010
  ident: 10.1016/j.dsp.2024.104659_bib0018
  article-title: A comparative study of image retargeting
– start-page: 740
  year: 2014
  ident: 10.1016/j.dsp.2024.104659_bib0042
  article-title: Microsoft coco: common objects in context
– start-page: 4558
  year: 2017
  ident: 10.1016/j.dsp.2024.104659_bib0004
  article-title: Weakly-and self-supervised learning for content-aware deep image retargeting
– start-page: 22428
  year: 2023
  ident: 10.1016/j.dsp.2024.104659_bib0029
  article-title: Smartbrush: text and shape guided object inpainting with diffusion model
– start-page: 4492
  year: 2019
  ident: 10.1016/j.dsp.2024.104659_bib0039
  article-title: Ingan: capturing and retargeting the" dna" of a natural image
– volume: 166
  year: 2020
  ident: 10.1016/j.dsp.2024.104659_bib0005
  article-title: Distortion-aware image retargeting based on continuous seam carving model
  publication-title: Signal Process.
  doi: 10.1016/j.sigpro.2019.107242
– volume: 20
  start-page: 209
  issue: 3
  year: 2012
  ident: 10.1016/j.dsp.2024.104659_bib0046
  article-title: Making a “completely blind” image quality analyzer
  publication-title: IEEE Signal Process. Lett.
  doi: 10.1109/LSP.2012.2227726
– start-page: 1609
  year: 2020
  ident: 10.1016/j.dsp.2024.104659_bib0016
  article-title: Multi-operator video retargeting method based on improved seam carving
– start-page: 229
  year: 2012
  ident: 10.1016/j.dsp.2024.104659_bib0025
– volume: 21
  start-page: 447
  issue: 2
  year: 2017
  ident: 10.1016/j.dsp.2024.104659_bib0013
  article-title: Seam warping: a new approach for image retargeting for small displays
  publication-title: Soft Comput.
  doi: 10.1007/s00500-015-1795-1
– volume: 18
  start-page: 128
  issue: 1
  year: 2015
  ident: 10.1016/j.dsp.2024.104659_bib0026
  article-title: Image retargeting for preserving robust local feature: application to mobile visual search
  publication-title: IEEE Trans. Multimed.
  doi: 10.1109/TMM.2015.2500727
– volume: 27
  start-page: 451
  issue: 1
  year: 2018
  ident: 10.1016/j.dsp.2024.104659_bib0051
  article-title: Multiple-level feature-based measure for retargeted image quality
  publication-title: IEEE Transact. Image Process.
  doi: 10.1109/TIP.2017.2761556
– start-page: 1
  year: 2015
  ident: 10.1016/j.dsp.2024.104659_bib0048
  article-title: Blind image quality evaluation using perception based features
– volume: 85
  start-page: 275
  issue: 2
  year: 2016
  ident: 10.1016/j.dsp.2024.104659_bib0007
  article-title: Sparse seam-carving for structure preserving image retargeting
  publication-title: J. Signal Process. Syst.
  doi: 10.1007/s11265-015-1084-3
– ident: 10.1016/j.dsp.2024.104659_bib0043
– volume: 32
  year: 2019
  ident: 10.1016/j.dsp.2024.104659_bib0037
  article-title: Learnable tree filter for structure-preserving feature transform
  publication-title: Adv. Neural Inf. Process. Syst.
– start-page: 9650
  year: 2021
  ident: 10.1016/j.dsp.2024.104659_bib0032
  article-title: Emerging properties in self-supervised vision transformers
– volume: 33
  start-page: 3991
  year: 2020
  ident: 10.1016/j.dsp.2024.104659_bib0036
  article-title: Rethinking learnable tree filter for generic feature transform
  publication-title: Adv. Neural. Inf. Process Syst.
– volume: 25
  start-page: 15
  issue: 1
  year: 2014
  ident: 10.1016/j.dsp.2024.104659_bib0010
  article-title: Seam searching-based pixel fusion for image retargeting
  publication-title: IEEE Transact. Circuit. Syst. Video Technol.
  doi: 10.1109/TCSVT.2014.2329374
– volume: 33
  start-page: 6840
  year: 2020
  ident: 10.1016/j.dsp.2024.104659_bib0027
  article-title: Denoising diffusion probabilistic models
  publication-title: Adv. Neural. Inf. Process. Syst.
– volume: 50
  start-page: 34
  year: 2017
  ident: 10.1016/j.dsp.2024.104659_bib0020
  article-title: Image retargeting using depth assisted saliency map
  publication-title: Signal Process.: Image Commun.
– volume: 30
  start-page: 415
  year: 2018
  ident: 10.1016/j.dsp.2024.104659_bib0022
  article-title: Image retargeting using blur based depth saliency descriptor
  publication-title: J. Comput.-Aid. Des. Comput. Graph.
– year: 2021
  ident: 10.1016/j.dsp.2024.104659_bib0031
– start-page: 609
  year: 2007
  ident: 10.1016/j.dsp.2024.104659_bib0001
  article-title: Seam carving for content-aware image resizing
– volume: 47
  start-page: 2956
  issue: 11
  year: 2017
  ident: 10.1016/j.dsp.2024.104659_bib0015
  article-title: Optimized multioperator image retargeting based on perceptual similarity measure
  publication-title: IEEE Transact. Syst. Man. Cybernet.: Syst.
  doi: 10.1109/TSMC.2016.2557225
– start-page: 15
  year: 2016
  ident: 10.1016/j.dsp.2024.104659_bib0049
– volume: 186
  year: 2021
  ident: 10.1016/j.dsp.2024.104659_bib0012
  article-title: A grid anchor based cropping approach exploiting image aesthetics, geometric composition, and semantics
  publication-title: Expert Syst. Appl.
  doi: 10.1016/j.eswa.2021.115852
– start-page: 8748
  year: 2021
  ident: 10.1016/j.dsp.2024.104659_bib0034
  article-title: Learning transferable visual models from natural language supervision
– volume: 29
  start-page: 407
  year: 2013
  ident: 10.1016/j.dsp.2024.104659_bib0024
  article-title: Image retargeting with multifocus fisheye transformation
  publication-title: Vis. Comput.
  doi: 10.1007/s00371-012-0744-6
– volume: 15
  start-page: 359
  issue: 2
  year: 2012
  ident: 10.1016/j.dsp.2024.104659_bib0045
  article-title: Patch-based image warping for content-aware retargeting
  publication-title: IEEE Trans. Multimed.
  doi: 10.1109/TMM.2012.2228475
– ident: 10.1016/j.dsp.2024.104659_bib0003
  doi: 10.1109/TMM.2019.2959925
– volume: 7
  start-page: 284
  year: 2018
  ident: 10.1016/j.dsp.2024.104659_bib0002
  article-title: CarvingNet: content-guided seam carving using deep convolution neural network
  publication-title: IEEE Access
  doi: 10.1109/ACCESS.2018.2885347
– volume: 41
  start-page: 21
  year: 2016
  ident: 10.1016/j.dsp.2024.104659_bib0008
  article-title: Optimal bi-directional seam carving for compressibility-aware image retargeting
  publication-title: J. Vis. Commun. Image Represent.
  doi: 10.1016/j.jvcir.2016.09.002
– volume: 2016
  start-page: 27
  issue: 1
  year: 2016
  ident: 10.1016/j.dsp.2024.104659_bib0023
  article-title: Gradient-based global features for seam carving
  publication-title: EURASIP J. Image Video Process.
  doi: 10.1186/s13640-016-0130-9
– start-page: 11929
  year: 2020
  ident: 10.1016/j.dsp.2024.104659_bib0030
– year: 2017
  ident: 10.1016/j.dsp.2024.104659_bib0035
  article-title: Learning multi-attention convolutional neural network for fine-grained image recognition
– volume: 31
  start-page: 126
  issue: 1
  year: 2020
  ident: 10.1016/j.dsp.2024.104659_bib0017
  article-title: Weakly supervised reinforced multi-operator image retargeting
  publication-title: IEEE Transact. Circuit. Syst. Video Technol.
  doi: 10.1109/TCSVT.2020.2977943
– volume: 104
  year: 2022
  ident: 10.1016/j.dsp.2024.104659_bib0011
  article-title: Adaptive cropping with interframe relative displacement constraint for video retargeting
  publication-title: Signal Process.: Image Commun.
– start-page: 493
  year: 2018
  ident: 10.1016/j.dsp.2024.104659_bib0021
  article-title: Contour sensitive saliency and depth application in image retargeting
– start-page: 73
  year: 2018
  ident: 10.1016/j.dsp.2024.104659_bib0052
  article-title: The multi-modality content-aware retargeting algorithm and variable scale similarity measurement for image retargeting
– volume: 37
  start-page: 834
  issue: 04
  year: 2015
  ident: 10.1016/j.dsp.2024.104659_bib0038
  article-title: Stereo matching using tree filtering
  publication-title: IEEE Transact. Patt. Analy. Mach. Intell.
  doi: 10.1109/TPAMI.2014.2353642
– volume: 155
  start-page: 233
  year: 2019
  ident: 10.1016/j.dsp.2024.104659_bib0009
  article-title: Content-aware image resizing: an improved and shadow-preserving seam carving method
  publication-title: Signal Process.
  doi: 10.1016/j.sigpro.2018.09.037
– volume: 21
  start-page: 4695
  issue: 12
  year: 2012
  ident: 10.1016/j.dsp.2024.104659_bib0047
  article-title: No-reference image quality assessment in the spatial domain
  publication-title: IEEE Transact. Image Process.
  doi: 10.1109/TIP.2012.2214050
SSID ssj0007426
Score 2.3782175
Snippet Content-aware image retargeting (CAIR) techniques are crucial in multimedia processing for displaying images on various devices while preserving visually...
SourceID crossref
elsevier
SourceType Index Database
Publisher
StartPage 104659
SubjectTerms Encoder-decoder structure
Image retargeting
Long-range sampling
Multi-class image reconstruction
Supervised signals
Title SMART: Supervised multi-class image retargeting generative model based on a long-range sampling strategy
URI https://dx.doi.org/10.1016/j.dsp.2024.104659
Volume 154
WOSCitedRecordID wos001288597600001&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVESC
  databaseName: ScienceDirect database
  issn: 1051-2004
  databaseCode: AIEXJ
  dateStart: 19950101
  customDbUrl:
  isFulltext: true
  dateEnd: 99991231
  titleUrlDefault: https://www.sciencedirect.com
  omitProxy: false
  ssIdentifier: ssj0007426
  providerName: Elsevier
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV1La9wwEBZN0kN6KH0kNH2hQ081Kl5bXtm9hTYlDW0IbAp7M5Isbzak3mW9W9J_35mR7fWmLbSBXIwxfjHzIX2a-WbE2JuhNS5TYShsOSyFlNKJtIiMCA12I3dal4a6639Rp6fpeJydNSUENW0noKoqvb7O5nfqargGzsbS2f9wd_dSuADn4HQ4gtvh-E-OH30FjooL_dFqjgNBDZSSZIPCIlMOpt9RpoMyQwyGY6RgQq2nSUNEG-MEOLUVmEbQwdWsmogFViAEtUb1OYUfqL_ERkL443SC248EqAfB4i5ff9DOi6TBJt3AybSbB-DUDzTHetbFX-kmlNp2wqAV5U8uXPVz1Q9RRLKp1eviZm3tzIa0E4jdwDuuPxb7jtK_jes-xHD5rqixx2gkKTXdtBLfbJc9wvdGlNHF9aRUW2wnUkkGI97O4eej8Uk3TytJm_F1_9HmvEn9d-NDf2YtPSZy_og9bJYQ_NC7_jG756on7EGvseRTdkEgeM_XEOA9CHCCAO9BgK8hwAkCnCDAZxXXfA0B3kKAtxDYY98-HZ1_OBbNnhrCwkJ7KZCgWWWS0FlVABWOk8wBR9SJKWVYAN0rwGgqdbBsGACzH1pr4kSX2kVATCOTxftsu5pV7hnjLitsbEsbq0EpTZilbhAa6YYGKX1p0gP2trVZPvetU_JWU3iZg4FzNHDuDXzAZGvVvOF-ntPlAIG_P_b8do-9YLtrnL5k28vFyr1i9-2P5bRevG6A8gtukX7G
linkProvider Elsevier
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=SMART%3A+Supervised+multi-class+image+retargeting+generative+model+based+on+a+long-range+sampling+strategy&rft.jtitle=Digital+signal+processing&rft.au=Cui%2C+Jia&rft.au=Jiang%2C+Hao&rft.au=Qi%2C+Meng&rft.au=Gu%2C+Zhenyu&rft.date=2024-11-01&rft.pub=Elsevier+Inc&rft.issn=1051-2004&rft.volume=154&rft_id=info:doi/10.1016%2Fj.dsp.2024.104659&rft.externalDocID=S1051200424002847
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1051-2004&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1051-2004&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1051-2004&client=summon