SMART: Supervised multi-class image retargeting generative model based on a long-range sampling strategy
Content-aware image retargeting (CAIR) techniques are crucial in multimedia processing for displaying images on various devices while preserving visually salient contents with desirable visual effects. There are discrete and continuous algorithms. For the former, the artefacts happen when the foregr...
Gespeichert in:
| Veröffentlicht in: | Digital signal processing Jg. 154; S. 104659 |
|---|---|
| Hauptverfasser: | , , , , |
| Format: | Journal Article |
| Sprache: | Englisch |
| Veröffentlicht: |
Elsevier Inc
01.11.2024
|
| Schlagworte: | |
| ISSN: | 1051-2004 |
| Online-Zugang: | Volltext |
| Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
| Abstract | Content-aware image retargeting (CAIR) techniques are crucial in multimedia processing for displaying images on various devices while preserving visually salient contents with desirable visual effects. There are discrete and continuous algorithms. For the former, the artefacts happen when the foreground proportion is larger than the retargeting ratio; for the latter, the salient regions are prone to be squeezed. In this paper, we reformulate the retargeting process into sampling the salient signal and reconstruction under aesthetic supervision, the supervised multi-class image retargeting reconstruction (SMART) framework. The target images can be represented into complementary parts, the masked and unmasked ones, according to the saliency influences in the encoder phrase. The long-range sampling algorithm is proposed to calculate similarities through an 8-connected planar path while considering spatial distance and feature correlation. The sampled embeddings in latent space reconstruct the retargeted images under supervised signals for aesthetic quality. The semantic loss Lsem from the pretrained CLIP model can maintain consistency for both content and semantics. The supervised loss, Lir, is introduced to ensure the retargeted qualities are close to the preferred labels. Then, we release a new retargeting dataset comprising seven image classes (animal, building, car, flower, indoor, landscape and people) with supervised labels collected from designers for further aesthetic retargeting study. The ablation studies are conducted to confirm the effectiveness of the new dataset, and comparative experiments with state-of-the-art baselines demonstrate the advantages of the proposed method. |
|---|---|
| AbstractList | Content-aware image retargeting (CAIR) techniques are crucial in multimedia processing for displaying images on various devices while preserving visually salient contents with desirable visual effects. There are discrete and continuous algorithms. For the former, the artefacts happen when the foreground proportion is larger than the retargeting ratio; for the latter, the salient regions are prone to be squeezed. In this paper, we reformulate the retargeting process into sampling the salient signal and reconstruction under aesthetic supervision, the supervised multi-class image retargeting reconstruction (SMART) framework. The target images can be represented into complementary parts, the masked and unmasked ones, according to the saliency influences in the encoder phrase. The long-range sampling algorithm is proposed to calculate similarities through an 8-connected planar path while considering spatial distance and feature correlation. The sampled embeddings in latent space reconstruct the retargeted images under supervised signals for aesthetic quality. The semantic loss Lsem from the pretrained CLIP model can maintain consistency for both content and semantics. The supervised loss, Lir, is introduced to ensure the retargeted qualities are close to the preferred labels. Then, we release a new retargeting dataset comprising seven image classes (animal, building, car, flower, indoor, landscape and people) with supervised labels collected from designers for further aesthetic retargeting study. The ablation studies are conducted to confirm the effectiveness of the new dataset, and comparative experiments with state-of-the-art baselines demonstrate the advantages of the proposed method. |
| ArticleNumber | 104659 |
| Author | Gu, Zhenyu Lu, Hongju Jiang, Hao Cui, Jia Qi, Meng |
| Author_xml | – sequence: 1 givenname: Jia orcidid: 0000-0002-1631-0535 surname: Cui fullname: Cui, Jia email: cuijia1247@scut.edu.cn organization: the State Key Laboratory of Subtropical Building Science, Guangzhou, 510006, PR China – sequence: 2 givenname: Hao surname: Jiang fullname: Jiang, Hao organization: H.Cruiser Informationsgesellschaft mbH, Munich, 80807, Germany – sequence: 3 givenname: Meng surname: Qi fullname: Qi, Meng organization: School of Information Science and Engineering, Shandong Normal University, Jinan, 250001, PR China – sequence: 4 givenname: Zhenyu surname: Gu fullname: Gu, Zhenyu organization: School of Design, Shanghai Jiaotong University, Shanghai, 205530, PR China – sequence: 5 givenname: Hongju surname: Lu fullname: Lu, Hongju email: luhj@gcu.edu.cn organization: School of Management, Guangzhou City University of Technology, Guangzhou, 510800, PR China |
| BookMark | eNp9kMtOwzAQRb0oEm3hA9j5B1Jsx84DVlUFFKkIiZa15diT4CpxItut1L8nUVmzGo10z9XMWaCZ6x0g9EDJihKaPR5XJgwrRhgfd56JcobmlAiaMEL4LVqEcCSE5Jxlc_Sz_1h_HZ7w_jSAP9sABnenNtpEtyoEbDvVAPYQlW8gWtfgBhx4Fe0ZcNcbaHGlJqh3WOG2d03ilRuRoLqhnfIhjmloLnfoplZtgPu_uUTfry-HzTbZfb69b9a7RDNexiQtaKHzShDQueGcpqKEgjElqpoTQwtqCCvyAsqS0pRlmdZVKlStgKVCsKpMl4hee7XvQ_BQy8GPX_iLpEROeuRRjnrkpEde9YzM85WB8bCzBS-DtuA0GOtBR2l6-w_9C7ZRceg |
| Cites_doi | 10.1007/s11042-022-12003-1 10.1109/JETCAS.2014.2298919 10.1109/TIP.2016.2585884 10.1016/j.sigpro.2019.107242 10.1109/LSP.2012.2227726 10.1007/s00500-015-1795-1 10.1109/TMM.2015.2500727 10.1109/TIP.2017.2761556 10.1007/s11265-015-1084-3 10.1109/TCSVT.2014.2329374 10.1109/TSMC.2016.2557225 10.1016/j.eswa.2021.115852 10.1007/s00371-012-0744-6 10.1109/TMM.2012.2228475 10.1109/TMM.2019.2959925 10.1109/ACCESS.2018.2885347 10.1016/j.jvcir.2016.09.002 10.1186/s13640-016-0130-9 10.1109/TCSVT.2020.2977943 10.1109/TPAMI.2014.2353642 10.1016/j.sigpro.2018.09.037 10.1109/TIP.2012.2214050 |
| ContentType | Journal Article |
| Copyright | 2024 |
| Copyright_xml | – notice: 2024 |
| DBID | AAYXX CITATION |
| DOI | 10.1016/j.dsp.2024.104659 |
| DatabaseName | CrossRef |
| DatabaseTitle | CrossRef |
| DatabaseTitleList | |
| DeliveryMethod | fulltext_linktorsrc |
| Discipline | Engineering |
| ExternalDocumentID | 10_1016_j_dsp_2024_104659 S1051200424002847 |
| GroupedDBID | --K --M .DC .~1 0R~ 1B1 1~. 1~5 29G 4.4 457 4G. 5GY 5VS 7-5 71M 8P~ 9JN AACTN AAEDT AAEDW AAIKJ AAKOC AALRI AAOAW AAQFI AAQXK AAXKI AAXUO AAYFN ABBOA ABFNM ABJNI ABMAC ABXDB ACDAQ ACGFS ACNNM ACRLP ACZNC ADBBV ADEZE ADFGL ADJOM ADMUD ADTZH AEBSH AECPX AEKER AENEX AFKWA AFTJW AGHFR AGUBO AGYEJ AHHHB AHJVU AHZHX AIALX AIEXJ AIKHN AITUG AJOXV AKRWK ALMA_UNASSIGNED_HOLDINGS AMFUW AMRAJ AOUOD ASPBG AVWKF AXJTR AZFZN BJAXD BKOJK BLXMC CAG COF CS3 DM4 DU5 EBS EFBJH EJD EO8 EO9 EP2 EP3 F0J F5P FDB FEDTE FGOYB FIRID FNPLU FYGXN G-2 G-Q G8K GBLVA GBOLZ HLZ HVGLF HZ~ IHE J1W JJJVA KOM LG5 LG9 LY7 M41 MO0 N9A O-L O9- OAUVE OZT P-8 P-9 P2P PC. Q38 R2- RIG ROL RPZ SBC SDF SDG SDP SES SET SEW SPC SPCBC SST SSV SSZ T5K WUQ XPP ZMT ZU3 ~G- 9DU AATTM AAYWO AAYXX ABDPE ABWVN ACLOT ACRPL ACVFH ADCNI ADNMO AEIPS AEUPX AFJKZ AFPUW AGQPQ AIGII AIIUN AKBMS AKYEP ANKPU APXCP CITATION EFKBS EFLBG ~HD |
| ID | FETCH-LOGICAL-c249t-3818c7b50ec7d441359e822a5bf40d181d02878e99113266ccb35afae23552b93 |
| ISICitedReferencesCount | 0 |
| ISICitedReferencesURI | http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=001288597600001&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| ISSN | 1051-2004 |
| IngestDate | Sat Nov 29 05:58:10 EST 2025 Sat Sep 14 18:13:24 EDT 2024 |
| IsPeerReviewed | true |
| IsScholarly | true |
| Keywords | Multi-class image reconstruction Supervised signals Long-range sampling Encoder-decoder structure Image retargeting |
| Language | English |
| LinkModel | OpenURL |
| MergedId | FETCHMERGED-LOGICAL-c249t-3818c7b50ec7d441359e822a5bf40d181d02878e99113266ccb35afae23552b93 |
| ORCID | 0000-0002-1631-0535 |
| ParticipantIDs | crossref_primary_10_1016_j_dsp_2024_104659 elsevier_sciencedirect_doi_10_1016_j_dsp_2024_104659 |
| PublicationCentury | 2000 |
| PublicationDate | November 2024 2024-11-00 |
| PublicationDateYYYYMMDD | 2024-11-01 |
| PublicationDate_xml | – month: 11 year: 2024 text: November 2024 |
| PublicationDecade | 2020 |
| PublicationTitle | Digital signal processing |
| PublicationYear | 2024 |
| Publisher | Elsevier Inc |
| Publisher_xml | – name: Elsevier Inc |
| References | Bao (bib0031) 2021 Lin, Zhou, Chen (bib0014) 2019 Cui (bib0005) 2020; 166 Mittal, Soundararajan, Bovik (bib0046) 2012; 20 Cho (bib0004) 2017 Ho, Jain, Abbeel (bib0027) 2020; 33 Hashemzadeh, Asheghi, Farajzadeh (bib0009) 2019; 155 Zhang (bib0050) 2016; 25 Krause J., Deng J., Stark M., Fei-Fei L. Collecting a large-scale dataset of fine-grained cars.(2013). URL Zhang (bib0024) 2013; 29 Tang, Lv, Tang (bib0011) 2022; 104 2019.p.1730–1743. Zhou (bib0008) 2016; 41 Avidan, Shamir (bib0001) 2007 Garg, Nayyar, Singh (bib0006) 2022; 81 Celona, Ciocca, Napoletano (bib0012) 2021; 186 Zheng (bib0035) 2017 Fang (bib0041) 2014; 4 Kwon, Ye (bib0028) 2022 Zeqi, Chenda, Jia (bib0052) 2018 2013;16632981. Fang (bib0015) 2017; 47 Song, Lee, Lee (bib0002) 2018; 7 Wang (bib0016) 2020 Venkatanath (bib0048) 2015 Panozzo, Weber, Sorkine (bib0025) 2012 Mittal, Moorthy, Bovik (bib0047) 2012; 21 Shafieyan (bib0020) 2017; 50 Liu, Yuen, Torralba (bib0049) 2016 Yan (bib0010) 2014; 25 Radford (bib0034) 2021 Choi, Kim (bib0007) 2016; 85 Lu (bib0021) 2018 Song (bib0036) 2020; 33 Zhang (bib0051) 2018; 27 Shocher (bib0039) 2019 Philbin (bib0044) 2007 Tan, W., et al., Cycle-IR: deep cyclic image retargeting. arXiv preprint Zhang (bib0013) 2017; 21 Ito (bib0023) 2016; 2016 Zhou, Chen, Li (bib0017) 2020; 31 Yang (bib0038) 2015; 37 Shafieyan (bib0019) 2014 Caron (bib0032) 2021 Oord, Li, Vinyals (bib0040) 2018 Dosovitskiy (bib0030) 2020 Yue (bib0022) 2018; 30 Xie (bib0029) 2023 Lin (bib0045) 2012; 15 Lin (bib0042) 2014 Tan (bib0026) 2015; 18 Zhou, Wei, Wang, Shen, Xie, Yuille, Kong (bib0033) 2021 Song (bib0037) 2019; 32 Rubinstein (bib0018) 2010 Zhou (10.1016/j.dsp.2024.104659_bib0008) 2016; 41 Lin (10.1016/j.dsp.2024.104659_bib0014) 2019 Lin (10.1016/j.dsp.2024.104659_bib0045) 2012; 15 Mittal (10.1016/j.dsp.2024.104659_bib0047) 2012; 21 Lu (10.1016/j.dsp.2024.104659_bib0021) 2018 Fang (10.1016/j.dsp.2024.104659_bib0041) 2014; 4 Panozzo (10.1016/j.dsp.2024.104659_bib0025) 2012 Choi (10.1016/j.dsp.2024.104659_bib0007) 2016; 85 10.1016/j.dsp.2024.104659_bib0043 Radford (10.1016/j.dsp.2024.104659_bib0034) 2021 10.1016/j.dsp.2024.104659_bib0003 Kwon (10.1016/j.dsp.2024.104659_bib0028) 2022 Hashemzadeh (10.1016/j.dsp.2024.104659_bib0009) 2019; 155 Yan (10.1016/j.dsp.2024.104659_bib0010) 2014; 25 Zhou (10.1016/j.dsp.2024.104659_bib0017) 2020; 31 Yue (10.1016/j.dsp.2024.104659_bib0022) 2018; 30 Zeqi (10.1016/j.dsp.2024.104659_bib0052) 2018 Garg (10.1016/j.dsp.2024.104659_bib0006) 2022; 81 Zhang (10.1016/j.dsp.2024.104659_bib0050) 2016; 25 Tang (10.1016/j.dsp.2024.104659_bib0011) 2022; 104 Shafieyan (10.1016/j.dsp.2024.104659_bib0020) 2017; 50 Yang (10.1016/j.dsp.2024.104659_bib0038) 2015; 37 Lin (10.1016/j.dsp.2024.104659_bib0042) 2014 Shocher (10.1016/j.dsp.2024.104659_bib0039) 2019 Avidan (10.1016/j.dsp.2024.104659_bib0001) 2007 Venkatanath (10.1016/j.dsp.2024.104659_bib0048) 2015 Song (10.1016/j.dsp.2024.104659_bib0002) 2018; 7 Fang (10.1016/j.dsp.2024.104659_bib0015) 2017; 47 Song (10.1016/j.dsp.2024.104659_bib0037) 2019; 32 Bao (10.1016/j.dsp.2024.104659_bib0031) 2021 Wang (10.1016/j.dsp.2024.104659_bib0016) 2020 Cho (10.1016/j.dsp.2024.104659_bib0004) 2017 Xie (10.1016/j.dsp.2024.104659_bib0029) 2023 Rubinstein (10.1016/j.dsp.2024.104659_bib0018) 2010 Oord (10.1016/j.dsp.2024.104659_bib0040) 2018 Ito (10.1016/j.dsp.2024.104659_bib0023) 2016; 2016 Celona (10.1016/j.dsp.2024.104659_bib0012) 2021; 186 Zheng (10.1016/j.dsp.2024.104659_bib0035) 2017 Philbin (10.1016/j.dsp.2024.104659_bib0044) 2007 Dosovitskiy (10.1016/j.dsp.2024.104659_bib0030) 2020 Zhang (10.1016/j.dsp.2024.104659_bib0051) 2018; 27 Caron (10.1016/j.dsp.2024.104659_bib0032) 2021 Tan (10.1016/j.dsp.2024.104659_bib0026) 2015; 18 Cui (10.1016/j.dsp.2024.104659_bib0005) 2020; 166 Zhou (10.1016/j.dsp.2024.104659_bib0033) 2021 Liu (10.1016/j.dsp.2024.104659_bib0049) 2016 Shafieyan (10.1016/j.dsp.2024.104659_bib0019) 2014 Song (10.1016/j.dsp.2024.104659_bib0036) 2020; 33 Mittal (10.1016/j.dsp.2024.104659_bib0046) 2012; 20 Zhang (10.1016/j.dsp.2024.104659_bib0024) 2013; 29 Zhang (10.1016/j.dsp.2024.104659_bib0013) 2017; 21 Ho (10.1016/j.dsp.2024.104659_bib0027) 2020; 33 |
| References_xml | – reference: . 2013;16632981. – start-page: 609 year: 2007 end-page: 617 ident: bib0001 article-title: Seam carving for content-aware image resizing publication-title: ACM Transactions on Graphics (TOG) – reference: Tan, W., et al., Cycle-IR: deep cyclic image retargeting. arXiv preprint – start-page: 1 year: 2015 end-page: 6 ident: bib0048 article-title: Blind image quality evaluation using perception based features publication-title: 2015 twenty first national conference on communications (NCC) – start-page: 73 year: 2018 end-page: 77 ident: bib0052 article-title: The multi-modality content-aware retargeting algorithm and variable scale similarity measurement for image retargeting publication-title: Proceedings of the 2nd International Conference on Advances in Image Processing – volume: 30 start-page: 415 year: 2018 end-page: 523 ident: bib0022 article-title: Image retargeting using blur based depth saliency descriptor publication-title: J. Comput.-Aid. Des. Comput. Graph. – start-page: 1 year: 2007 end-page: 8 ident: bib0044 article-title: Object retrieval with large vocabularies and fast spatial matching publication-title: 2007 IEEE conference on computer vision and pattern recognition – volume: 31 start-page: 126 year: 2020 end-page: 139 ident: bib0017 article-title: Weakly supervised reinforced multi-operator image retargeting publication-title: IEEE Transact. Circuit. Syst. Video Technol. – start-page: 493 year: 2018 end-page: 500 ident: bib0021 article-title: Contour sensitive saliency and depth application in image retargeting publication-title: Ninth International Conference on Graphic and Image Processing (ICGIP 2017) – year: 2022 ident: bib0028 article-title: arXiv preprint – volume: 4 start-page: 95 year: 2014 end-page: 105 ident: bib0041 article-title: Objective quality assessment for image retargeting based on structural similarity publication-title: IEEE J. Emerg. Select. Top. Circuit. Syst. – start-page: 22428 year: 2023 end-page: 22437 ident: bib0029 article-title: Smartbrush: text and shape guided object inpainting with diffusion model publication-title: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition – start-page: 11929 year: 2020 ident: bib0030 article-title: arXiv preprint – volume: 25 start-page: 15 year: 2014 end-page: 23 ident: bib0010 article-title: Seam searching-based pixel fusion for image retargeting publication-title: IEEE Transact. Circuit. Syst. Video Technol. – volume: 155 start-page: 233 year: 2019 end-page: 246 ident: bib0009 article-title: Content-aware image resizing: an improved and shadow-preserving seam carving method publication-title: Signal Process. – year: 2021 ident: bib0033 article-title: arXiv preprint – volume: 85 start-page: 275 year: 2016 end-page: 283 ident: bib0007 article-title: Sparse seam-carving for structure preserving image retargeting publication-title: J. Signal Process. Syst. – volume: 166 year: 2020 ident: bib0005 article-title: Distortion-aware image retargeting based on continuous seam carving model publication-title: Signal Process. – volume: 27 start-page: 451 year: 2018 end-page: 463 ident: bib0051 article-title: Multiple-level feature-based measure for retargeted image quality publication-title: IEEE Transact. Image Process. – volume: 47 start-page: 2956 year: 2017 end-page: 2966 ident: bib0015 article-title: Optimized multioperator image retargeting based on perceptual similarity measure publication-title: IEEE Transact. Syst. Man. Cybernet.: Syst. – volume: 2016 start-page: 27 year: 2016 ident: bib0023 article-title: Gradient-based global features for seam carving publication-title: EURASIP J. Image Video Process. – volume: 37 start-page: 834 year: 2015 end-page: 846 ident: bib0038 article-title: Stereo matching using tree filtering publication-title: IEEE Transact. Patt. Analy. Mach. Intell. – volume: 20 start-page: 209 year: 2012 end-page: 212 ident: bib0046 article-title: Making a “completely blind” image quality analyzer publication-title: IEEE Signal Process. Lett. – volume: 29 start-page: 407 year: 2013 end-page: 420 ident: bib0024 article-title: Image retargeting with multifocus fisheye transformation publication-title: Vis. Comput. – volume: 33 start-page: 6840 year: 2020 end-page: 6851 ident: bib0027 article-title: Denoising diffusion probabilistic models publication-title: Adv. Neural. Inf. Process. Syst. – start-page: 54 year: 2019 end-page: 59 ident: bib0014 article-title: DeepIR: a deep semantics driven framework for image retargeting publication-title: 2019 IEEE International Conference on Multimedia & Expo Workshops (ICMEW) – start-page: 740 year: 2014 end-page: 755 ident: bib0042 article-title: Microsoft coco: common objects in context publication-title: European conference on computer vision – volume: 25 start-page: 4286 year: 2016 end-page: 4297 ident: bib0050 article-title: Backward registration-based aspect ratio similarity for image retargeting quality assessment publication-title: IEEE Transact. Image Process. – volume: 32 year: 2019 ident: bib0037 article-title: Learnable tree filter for structure-preserving feature transform publication-title: Adv. Neural Inf. Process. Syst. – volume: 7 start-page: 284 year: 2018 end-page: 292 ident: bib0002 article-title: CarvingNet: content-guided seam carving using deep convolution neural network publication-title: IEEE Access – volume: 41 start-page: 21 year: 2016 end-page: 30 ident: bib0008 article-title: Optimal bi-directional seam carving for compressibility-aware image retargeting publication-title: J. Vis. Commun. Image Represent. – volume: 186 year: 2021 ident: bib0012 article-title: A grid anchor based cropping approach exploiting image aesthetics, geometric composition, and semantics publication-title: Expert Syst. Appl. – start-page: 1 year: 2010 end-page: 10 ident: bib0018 article-title: A comparative study of image retargeting publication-title: ACM Transactions On Graphics (TOG) – volume: 21 start-page: 4695 year: 2012 end-page: 4708 ident: bib0047 article-title: No-reference image quality assessment in the spatial domain publication-title: IEEE Transact. Image Process. – volume: 21 start-page: 447 year: 2017 end-page: 457 ident: bib0013 article-title: Seam warping: a new approach for image retargeting for small displays publication-title: Soft Comput. – reference: Krause J., Deng J., Stark M., Fei-Fei L. Collecting a large-scale dataset of fine-grained cars.(2013). URL – start-page: 9650 year: 2021 end-page: 9660 ident: bib0032 article-title: Emerging properties in self-supervised vision transformers publication-title: Proceedings of the IEEE/CVF international conference on computer vision – year: 2018 ident: bib0040 article-title: arXiv preprint – volume: 33 start-page: 3991 year: 2020 end-page: 4002 ident: bib0036 article-title: Rethinking learnable tree filter for generic feature transform publication-title: Adv. Neural. Inf. Process Syst. – start-page: 229 year: 2012 end-page: 236 ident: bib0025 article-title: Robust Image Retargeting Via Axis-Aligned deformation. in Computer Graphics Forum – reference: , 2019.p.1730–1743. – volume: 104 year: 2022 ident: bib0011 article-title: Adaptive cropping with interframe relative displacement constraint for video retargeting publication-title: Signal Process.: Image Commun. – volume: 18 start-page: 128 year: 2015 end-page: 137 ident: bib0026 article-title: Image retargeting for preserving robust local feature: application to mobile visual search publication-title: IEEE Trans. Multimed. – year: 2017 ident: bib0035 article-title: Learning multi-attention convolutional neural network for fine-grained image recognition publication-title: Proceedings of the IEEE international conference on computer vision – start-page: 1609 year: 2020 end-page: 1614 ident: bib0016 article-title: Multi-operator video retargeting method based on improved seam carving publication-title: 2020 IEEE 5th Information Technology and Mechatronics Engineering Conference (ITOEC) – start-page: 8748 year: 2021 end-page: 8763 ident: bib0034 article-title: Learning transferable visual models from natural language supervision publication-title: International Conference on Machine Learning – volume: 50 start-page: 34 year: 2017 end-page: 43 ident: bib0020 article-title: Image retargeting using depth assisted saliency map publication-title: Signal Process.: Image Commun. – start-page: 4558 year: 2017 end-page: 4567 ident: bib0004 article-title: Weakly-and self-supervised learning for content-aware deep image retargeting publication-title: Proceedings of the IEEE International Conference on Computer Vision – start-page: 15 year: 2016 end-page: 49 ident: bib0049 article-title: Sift flow: Dense correspondence Across Scenes and Its applications, in Dense Image Correspondences for Computer Vision – start-page: 4492 year: 2019 end-page: 4501 ident: bib0039 article-title: Ingan: capturing and retargeting the" dna" of a natural image publication-title: Proceedings of the IEEE/CVF International Conference on Computer Vision – start-page: 1155 year: 2014 end-page: 1159 ident: bib0019 article-title: Image seam carving using depth assisted saliency map publication-title: Image Processing (ICIP), 2014 IEEE International Conference on – volume: 15 start-page: 359 year: 2012 end-page: 368 ident: bib0045 article-title: Patch-based image warping for content-aware retargeting publication-title: IEEE Trans. Multimed. – volume: 81 start-page: 12883 year: 2022 end-page: 12924 ident: bib0006 article-title: Improved seam carving for structure preservation using efficient energy function publication-title: Multimed. Tool. Appl. – year: 2021 ident: bib0031 article-title: arXiv preprint – volume: 81 start-page: 12883 issue: 9 year: 2022 ident: 10.1016/j.dsp.2024.104659_bib0006 article-title: Improved seam carving for structure preservation using efficient energy function publication-title: Multimed. Tool. Appl. doi: 10.1007/s11042-022-12003-1 – year: 2021 ident: 10.1016/j.dsp.2024.104659_bib0033 – volume: 4 start-page: 95 issue: 1 year: 2014 ident: 10.1016/j.dsp.2024.104659_bib0041 article-title: Objective quality assessment for image retargeting based on structural similarity publication-title: IEEE J. Emerg. Select. Top. Circuit. Syst. doi: 10.1109/JETCAS.2014.2298919 – year: 2022 ident: 10.1016/j.dsp.2024.104659_bib0028 – volume: 25 start-page: 4286 issue: 9 year: 2016 ident: 10.1016/j.dsp.2024.104659_bib0050 article-title: Backward registration-based aspect ratio similarity for image retargeting quality assessment publication-title: IEEE Transact. Image Process. doi: 10.1109/TIP.2016.2585884 – start-page: 54 year: 2019 ident: 10.1016/j.dsp.2024.104659_bib0014 article-title: DeepIR: a deep semantics driven framework for image retargeting – start-page: 1 year: 2007 ident: 10.1016/j.dsp.2024.104659_bib0044 article-title: Object retrieval with large vocabularies and fast spatial matching – start-page: 1155 year: 2014 ident: 10.1016/j.dsp.2024.104659_bib0019 article-title: Image seam carving using depth assisted saliency map – year: 2018 ident: 10.1016/j.dsp.2024.104659_bib0040 – start-page: 1 year: 2010 ident: 10.1016/j.dsp.2024.104659_bib0018 article-title: A comparative study of image retargeting – start-page: 740 year: 2014 ident: 10.1016/j.dsp.2024.104659_bib0042 article-title: Microsoft coco: common objects in context – start-page: 4558 year: 2017 ident: 10.1016/j.dsp.2024.104659_bib0004 article-title: Weakly-and self-supervised learning for content-aware deep image retargeting – start-page: 22428 year: 2023 ident: 10.1016/j.dsp.2024.104659_bib0029 article-title: Smartbrush: text and shape guided object inpainting with diffusion model – start-page: 4492 year: 2019 ident: 10.1016/j.dsp.2024.104659_bib0039 article-title: Ingan: capturing and retargeting the" dna" of a natural image – volume: 166 year: 2020 ident: 10.1016/j.dsp.2024.104659_bib0005 article-title: Distortion-aware image retargeting based on continuous seam carving model publication-title: Signal Process. doi: 10.1016/j.sigpro.2019.107242 – volume: 20 start-page: 209 issue: 3 year: 2012 ident: 10.1016/j.dsp.2024.104659_bib0046 article-title: Making a “completely blind” image quality analyzer publication-title: IEEE Signal Process. Lett. doi: 10.1109/LSP.2012.2227726 – start-page: 1609 year: 2020 ident: 10.1016/j.dsp.2024.104659_bib0016 article-title: Multi-operator video retargeting method based on improved seam carving – start-page: 229 year: 2012 ident: 10.1016/j.dsp.2024.104659_bib0025 – volume: 21 start-page: 447 issue: 2 year: 2017 ident: 10.1016/j.dsp.2024.104659_bib0013 article-title: Seam warping: a new approach for image retargeting for small displays publication-title: Soft Comput. doi: 10.1007/s00500-015-1795-1 – volume: 18 start-page: 128 issue: 1 year: 2015 ident: 10.1016/j.dsp.2024.104659_bib0026 article-title: Image retargeting for preserving robust local feature: application to mobile visual search publication-title: IEEE Trans. Multimed. doi: 10.1109/TMM.2015.2500727 – volume: 27 start-page: 451 issue: 1 year: 2018 ident: 10.1016/j.dsp.2024.104659_bib0051 article-title: Multiple-level feature-based measure for retargeted image quality publication-title: IEEE Transact. Image Process. doi: 10.1109/TIP.2017.2761556 – start-page: 1 year: 2015 ident: 10.1016/j.dsp.2024.104659_bib0048 article-title: Blind image quality evaluation using perception based features – volume: 85 start-page: 275 issue: 2 year: 2016 ident: 10.1016/j.dsp.2024.104659_bib0007 article-title: Sparse seam-carving for structure preserving image retargeting publication-title: J. Signal Process. Syst. doi: 10.1007/s11265-015-1084-3 – ident: 10.1016/j.dsp.2024.104659_bib0043 – volume: 32 year: 2019 ident: 10.1016/j.dsp.2024.104659_bib0037 article-title: Learnable tree filter for structure-preserving feature transform publication-title: Adv. Neural Inf. Process. Syst. – start-page: 9650 year: 2021 ident: 10.1016/j.dsp.2024.104659_bib0032 article-title: Emerging properties in self-supervised vision transformers – volume: 33 start-page: 3991 year: 2020 ident: 10.1016/j.dsp.2024.104659_bib0036 article-title: Rethinking learnable tree filter for generic feature transform publication-title: Adv. Neural. Inf. Process Syst. – volume: 25 start-page: 15 issue: 1 year: 2014 ident: 10.1016/j.dsp.2024.104659_bib0010 article-title: Seam searching-based pixel fusion for image retargeting publication-title: IEEE Transact. Circuit. Syst. Video Technol. doi: 10.1109/TCSVT.2014.2329374 – volume: 33 start-page: 6840 year: 2020 ident: 10.1016/j.dsp.2024.104659_bib0027 article-title: Denoising diffusion probabilistic models publication-title: Adv. Neural. Inf. Process. Syst. – volume: 50 start-page: 34 year: 2017 ident: 10.1016/j.dsp.2024.104659_bib0020 article-title: Image retargeting using depth assisted saliency map publication-title: Signal Process.: Image Commun. – volume: 30 start-page: 415 year: 2018 ident: 10.1016/j.dsp.2024.104659_bib0022 article-title: Image retargeting using blur based depth saliency descriptor publication-title: J. Comput.-Aid. Des. Comput. Graph. – year: 2021 ident: 10.1016/j.dsp.2024.104659_bib0031 – start-page: 609 year: 2007 ident: 10.1016/j.dsp.2024.104659_bib0001 article-title: Seam carving for content-aware image resizing – volume: 47 start-page: 2956 issue: 11 year: 2017 ident: 10.1016/j.dsp.2024.104659_bib0015 article-title: Optimized multioperator image retargeting based on perceptual similarity measure publication-title: IEEE Transact. Syst. Man. Cybernet.: Syst. doi: 10.1109/TSMC.2016.2557225 – start-page: 15 year: 2016 ident: 10.1016/j.dsp.2024.104659_bib0049 – volume: 186 year: 2021 ident: 10.1016/j.dsp.2024.104659_bib0012 article-title: A grid anchor based cropping approach exploiting image aesthetics, geometric composition, and semantics publication-title: Expert Syst. Appl. doi: 10.1016/j.eswa.2021.115852 – start-page: 8748 year: 2021 ident: 10.1016/j.dsp.2024.104659_bib0034 article-title: Learning transferable visual models from natural language supervision – volume: 29 start-page: 407 year: 2013 ident: 10.1016/j.dsp.2024.104659_bib0024 article-title: Image retargeting with multifocus fisheye transformation publication-title: Vis. Comput. doi: 10.1007/s00371-012-0744-6 – volume: 15 start-page: 359 issue: 2 year: 2012 ident: 10.1016/j.dsp.2024.104659_bib0045 article-title: Patch-based image warping for content-aware retargeting publication-title: IEEE Trans. Multimed. doi: 10.1109/TMM.2012.2228475 – ident: 10.1016/j.dsp.2024.104659_bib0003 doi: 10.1109/TMM.2019.2959925 – volume: 7 start-page: 284 year: 2018 ident: 10.1016/j.dsp.2024.104659_bib0002 article-title: CarvingNet: content-guided seam carving using deep convolution neural network publication-title: IEEE Access doi: 10.1109/ACCESS.2018.2885347 – volume: 41 start-page: 21 year: 2016 ident: 10.1016/j.dsp.2024.104659_bib0008 article-title: Optimal bi-directional seam carving for compressibility-aware image retargeting publication-title: J. Vis. Commun. Image Represent. doi: 10.1016/j.jvcir.2016.09.002 – volume: 2016 start-page: 27 issue: 1 year: 2016 ident: 10.1016/j.dsp.2024.104659_bib0023 article-title: Gradient-based global features for seam carving publication-title: EURASIP J. Image Video Process. doi: 10.1186/s13640-016-0130-9 – start-page: 11929 year: 2020 ident: 10.1016/j.dsp.2024.104659_bib0030 – year: 2017 ident: 10.1016/j.dsp.2024.104659_bib0035 article-title: Learning multi-attention convolutional neural network for fine-grained image recognition – volume: 31 start-page: 126 issue: 1 year: 2020 ident: 10.1016/j.dsp.2024.104659_bib0017 article-title: Weakly supervised reinforced multi-operator image retargeting publication-title: IEEE Transact. Circuit. Syst. Video Technol. doi: 10.1109/TCSVT.2020.2977943 – volume: 104 year: 2022 ident: 10.1016/j.dsp.2024.104659_bib0011 article-title: Adaptive cropping with interframe relative displacement constraint for video retargeting publication-title: Signal Process.: Image Commun. – start-page: 493 year: 2018 ident: 10.1016/j.dsp.2024.104659_bib0021 article-title: Contour sensitive saliency and depth application in image retargeting – start-page: 73 year: 2018 ident: 10.1016/j.dsp.2024.104659_bib0052 article-title: The multi-modality content-aware retargeting algorithm and variable scale similarity measurement for image retargeting – volume: 37 start-page: 834 issue: 04 year: 2015 ident: 10.1016/j.dsp.2024.104659_bib0038 article-title: Stereo matching using tree filtering publication-title: IEEE Transact. Patt. Analy. Mach. Intell. doi: 10.1109/TPAMI.2014.2353642 – volume: 155 start-page: 233 year: 2019 ident: 10.1016/j.dsp.2024.104659_bib0009 article-title: Content-aware image resizing: an improved and shadow-preserving seam carving method publication-title: Signal Process. doi: 10.1016/j.sigpro.2018.09.037 – volume: 21 start-page: 4695 issue: 12 year: 2012 ident: 10.1016/j.dsp.2024.104659_bib0047 article-title: No-reference image quality assessment in the spatial domain publication-title: IEEE Transact. Image Process. doi: 10.1109/TIP.2012.2214050 |
| SSID | ssj0007426 |
| Score | 2.3782175 |
| Snippet | Content-aware image retargeting (CAIR) techniques are crucial in multimedia processing for displaying images on various devices while preserving visually... |
| SourceID | crossref elsevier |
| SourceType | Index Database Publisher |
| StartPage | 104659 |
| SubjectTerms | Encoder-decoder structure Image retargeting Long-range sampling Multi-class image reconstruction Supervised signals |
| Title | SMART: Supervised multi-class image retargeting generative model based on a long-range sampling strategy |
| URI | https://dx.doi.org/10.1016/j.dsp.2024.104659 |
| Volume | 154 |
| WOSCitedRecordID | wos001288597600001&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| journalDatabaseRights | – providerCode: PRVESC databaseName: ScienceDirect database issn: 1051-2004 databaseCode: AIEXJ dateStart: 19950101 customDbUrl: isFulltext: true dateEnd: 99991231 titleUrlDefault: https://www.sciencedirect.com omitProxy: false ssIdentifier: ssj0007426 providerName: Elsevier |
| link | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV1La9wwEBZN0kN6KH0kNH2hQ081Kl5bXtm9hTYlDW0IbAp7M5Isbzak3mW9W9J_35mR7fWmLbSBXIwxfjHzIX2a-WbE2JuhNS5TYShsOSyFlNKJtIiMCA12I3dal4a6639Rp6fpeJydNSUENW0noKoqvb7O5nfqargGzsbS2f9wd_dSuADn4HQ4gtvh-E-OH30FjooL_dFqjgNBDZSSZIPCIlMOpt9RpoMyQwyGY6RgQq2nSUNEG-MEOLUVmEbQwdWsmogFViAEtUb1OYUfqL_ERkL443SC248EqAfB4i5ff9DOi6TBJt3AybSbB-DUDzTHetbFX-kmlNp2wqAV5U8uXPVz1Q9RRLKp1eviZm3tzIa0E4jdwDuuPxb7jtK_jes-xHD5rqixx2gkKTXdtBLfbJc9wvdGlNHF9aRUW2wnUkkGI97O4eej8Uk3TytJm_F1_9HmvEn9d-NDf2YtPSZy_og9bJYQ_NC7_jG756on7EGvseRTdkEgeM_XEOA9CHCCAO9BgK8hwAkCnCDAZxXXfA0B3kKAtxDYY98-HZ1_OBbNnhrCwkJ7KZCgWWWS0FlVABWOk8wBR9SJKWVYAN0rwGgqdbBsGACzH1pr4kSX2kVATCOTxftsu5pV7hnjLitsbEsbq0EpTZilbhAa6YYGKX1p0gP2trVZPvetU_JWU3iZg4FzNHDuDXzAZGvVvOF-ntPlAIG_P_b8do-9YLtrnL5k28vFyr1i9-2P5bRevG6A8gtukX7G |
| linkProvider | Elsevier |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=SMART%3A+Supervised+multi-class+image+retargeting+generative+model+based+on+a+long-range+sampling+strategy&rft.jtitle=Digital+signal+processing&rft.au=Cui%2C+Jia&rft.au=Jiang%2C+Hao&rft.au=Qi%2C+Meng&rft.au=Gu%2C+Zhenyu&rft.date=2024-11-01&rft.pub=Elsevier+Inc&rft.issn=1051-2004&rft.volume=154&rft_id=info:doi/10.1016%2Fj.dsp.2024.104659&rft.externalDocID=S1051200424002847 |
| thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1051-2004&client=summon |
| thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1051-2004&client=summon |
| thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1051-2004&client=summon |