Landmarking for Navigational Streaming of Stored High-Dimensional Media
Modern media data such as 360° videos and light field (LF) images are typically captured in much higher dimensions than the observers' visual displays. To efficiently browse high-dimensional media, a navigational streaming model is considered: a client navigates the media space by dictating a n...
Gespeichert in:
| Veröffentlicht in: | IEEE transactions on circuits and systems for video technology Jg. 32; H. 8; S. 5663 - 5679 |
|---|---|
| Hauptverfasser: | , , , , |
| Format: | Journal Article |
| Sprache: | Englisch |
| Veröffentlicht: |
New York
IEEE
01.08.2022
The Institute of Electrical and Electronics Engineers, Inc. (IEEE) |
| Schlagworte: | |
| ISSN: | 1051-8215, 1558-2205 |
| Online-Zugang: | Volltext |
| Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
| Abstract | Modern media data such as 360° videos and light field (LF) images are typically captured in much higher dimensions than the observers' visual displays. To efficiently browse high-dimensional media, a navigational streaming model is considered: a client navigates the media space by dictating a navigation path to a server, who in response transmits the corresponding pre-encoded media data units (MDU) to the client one-by-one in sequence. Assuming that the MDU quality is pre-chosen and fixed, the problem resides in selecting and storing redundant representations of MDUs at the server in order to best trade off storage and transmission costs, while enabling adequate user's random access. We address this problem with a landmark-based MDU optimization framework. The media space is divided into neighborhoods, each containing one landmark (a chosen MDU). MDUs in a neighborhood use the associated landmark as a predictor for inter-coding. Thus, for any MDU transition within the same neighborhood, only one inter-coded MDU transmission is required when the landmark resides in the decoder buffer. It results in lower transmission cost and enables navigational random access. To optimize an MDU structure, we employ tree-structured vector quantizer (TSVQ) to first optimize landmark locations, then iteratively add P-MDUs as refinements using a fast branch-and-bound technique. Taking interactive LF images and viewport adaptive 360° images as illustrative applications, and I-, P- and previously proposed merge frames to intra- and inter-code MDUs, we show experimentally that landmarked MDU structures can noticeably reduce the expected transmission cost compared with MDU structures without landmarks. |
|---|---|
| AbstractList | Modern media data such as 360° videos and light field (LF) images are typically captured in much higher dimensions than the observers’ visual displays. To efficiently browse high-dimensional media, a navigational streaming model is considered: a client navigates the media space by dictating a navigation path to a server, who in response transmits the corresponding pre-encoded media data units (MDU) to the client one-by-one in sequence. Assuming that the MDU quality is pre-chosen and fixed, the problem resides in selecting and storing redundant representations of MDUs at the server in order to best trade off storage and transmission costs, while enabling adequate user’s random access. We address this problem with a landmark-based MDU optimization framework. The media space is divided into neighborhoods, each containing one landmark (a chosen MDU). MDUs in a neighborhood use the associated landmark as a predictor for inter-coding. Thus, for any MDU transition within the same neighborhood, only one inter-coded MDU transmission is required when the landmark resides in the decoder buffer. It results in lower transmission cost and enables navigational random access. To optimize an MDU structure, we employ tree-structured vector quantizer (TSVQ) to first optimize landmark locations, then iteratively add P-MDUs as refinements using a fast branch-and-bound technique. Taking interactive LF images and viewport adaptive 360° images as illustrative applications, and I-, P- and previously proposed merge frames to intra- and inter-code MDUs, we show experimentally that landmarked MDU structures can noticeably reduce the expected transmission cost compared with MDU structures without landmarks. |
| Author | Zhao, H. Vicky Huang, Jiwu Cheung, Gene Yuan, Yuan Frossard, Pascal |
| Author_xml | – sequence: 1 givenname: Yuan orcidid: 0000-0003-3352-0662 surname: Yuan fullname: Yuan, Yuan email: yuanyustc@hotmail.com organization: School of Computer Science, Guangdong Polytechnic Normal University, Guangzhou, China – sequence: 2 givenname: Gene orcidid: 0000-0002-5571-4137 surname: Cheung fullname: Cheung, Gene email: genec@yorku.ca organization: Department of EECS, York University, ON, Toronto, Canada – sequence: 3 givenname: Pascal orcidid: 0000-0002-4010-714X surname: Frossard fullname: Frossard, Pascal email: pascal.frossard@epfl.ch organization: Signal Processing Laboratory (LTS4), École Polytechnique Fédérale de Lausanne (EPFL), CH, Switzerland – sequence: 4 givenname: H. Vicky orcidid: 0000-0002-3690-9924 surname: Zhao fullname: Zhao, H. Vicky email: vzhao@tsinghua.edu.cn organization: Department of Automation, Tsinghua University, Beijing, China – sequence: 5 givenname: Jiwu orcidid: 0000-0002-7625-5689 surname: Huang fullname: Huang, Jiwu email: jwhuang@szu.edu.cn organization: Guangdong Key Laboratory of Intelligent Information Processing and Shenzhen Key Laboratory of Media Security, Shenzhen University, Shenzhen, China |
| BookMark | eNo9kE1PAjEQhhuDiaD-Ab1s4nmxH9ttezSoYIJ6AL023XYWi7DFdjHx37u4xFOnmeedzDwjNGhCAwhdETwmBKvb5WTxvhxTTOmYEY6FxCdoSDiXOaWYD7oac5JLSvgZGqW0xpgUshBDNJ2bxm1N_PTNKqtDzF7Mt1-Z1ofGbLJFG8FsD61Qd58QwWUzv_rI7_0WmtRDz-C8uUCntdkkuDy-5-jt8WE5meXz1-nT5G6eW6pwm1dQCecIrStmGSulqCQRQKuywApk7Sy2rmRcGlUSUzsQvMJFaa2olIUOZOfopp-7i-FrD6nV67CP3RpJ01IJLhhlvKNoT9kYUopQ61303ZU_mmB9EKb_hOmDMH0U1oWu-5AHgP-AElgJydkvmnZpXg |
| CODEN | ITCTEM |
| Cites_doi | 10.1117/12.527327 10.1109/ICC.2017.7996611 10.1109/ICME.2011.6011865 10.1109/ICIP.2017.8296617 10.1117/1.601531 10.1109/PCS.2009.5167460 10.1109/TC.1977.1674939 10.1016/j.image.2021.116202 10.1109/MMUL.2018.011921238 10.1109/PACKET.2009.5152147 10.1145/3123266.3123291 10.1109/TCSVT.2021.3055985 10.1007/978-1-4615-3626-0 10.1109/ICASSP.2012.6288163 10.1109/TCSVT.2020.3046242 10.1109/ICIP.2019.8803668 10.1109/TCSVT.2003.814969 10.1109/TCSVT.2012.2221191 10.1109/MMSP.2004.1436558 10.1109/TIP.2010.2070074 10.1109/TVCG.2018.2793599 10.1109/TIP.2016.2571564 10.1109/ICIP.2009.5414623 10.1109/TCSVT.2018.2886805 10.1109/TCSVT.2003.817626 10.5594/M001787 10.1109/TMM.2019.2932614 10.1109/ICIP.2016.7532582 10.1109/TMM.2007.893350 10.3390/sym12091491 10.1109/TMM.2020.2987682 10.1049/el:19961075 10.1109/ICIP.2017.8296676 |
| ContentType | Journal Article |
| Copyright | Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2022 |
| Copyright_xml | – notice: Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2022 |
| DBID | 97E RIA RIE AAYXX CITATION 7SC 7SP 8FD JQ2 L7M L~C L~D |
| DOI | 10.1109/TCSVT.2022.3150780 |
| DatabaseName | IEEE Xplore (IEEE) IEEE All-Society Periodicals Package (ASPP) 1998–Present IEEE/IET Electronic Library CrossRef Computer and Information Systems Abstracts Electronics & Communications Abstracts Technology Research Database ProQuest Computer Science Collection Advanced Technologies Database with Aerospace Computer and Information Systems Abstracts Academic Computer and Information Systems Abstracts Professional |
| DatabaseTitle | CrossRef Technology Research Database Computer and Information Systems Abstracts – Academic Electronics & Communications Abstracts ProQuest Computer Science Collection Computer and Information Systems Abstracts Advanced Technologies Database with Aerospace Computer and Information Systems Abstracts Professional |
| DatabaseTitleList | Technology Research Database |
| Database_xml | – sequence: 1 dbid: RIE name: IEEE Electronic Library (IEL) url: https://ieeexplore.ieee.org/ sourceTypes: Publisher |
| DeliveryMethod | fulltext_linktorsrc |
| Discipline | Engineering |
| EISSN | 1558-2205 |
| EndPage | 5679 |
| ExternalDocumentID | 10_1109_TCSVT_2022_3150780 9709785 |
| Genre | orig-research |
| GrantInformation_xml | – fundername: National Science Foundations of China grantid: U19B2022; U1636202; 61701310 funderid: 10.13039/501100001809 – fundername: NSERC grantid: RGPIN-2019-06271; RGPAS-2019-00110 funderid: 10.13039/501100000038 – fundername: Guangdong Natural Science Foundation grantid: 2020A1515110781 funderid: 10.13039/501100003453 |
| GroupedDBID | -~X 0R~ 29I 4.4 5GY 5VS 6IK 97E AAJGR AARMG AASAJ AAWTH ABAZT ABQJQ ABVLG ACGFO ACGFS ACIWK AENEX AETIX AGQYO AGSQL AHBIQ AI. AIBXA AKJIK AKQYR ALLEH ALMA_UNASSIGNED_HOLDINGS ASUFR ATWAV BEFXN BFFAM BGNUA BKEBE BPEOZ CS3 DU5 EBS EJD HZ~ H~9 ICLAB IFIPE IFJZH IPLJI JAVBF LAI M43 O9- OCL P2P RIA RIE RNS RXW TAE TN5 VH1 AAYXX CITATION 7SC 7SP 8FD JQ2 L7M L~C L~D |
| ID | FETCH-LOGICAL-c290t-beb7dd12fb3c33687b817e2b6409e8fdc0cd6358a961afde75b046cc7b9ce7e23 |
| IEDL.DBID | RIE |
| ISICitedReferencesCount | 0 |
| ISICitedReferencesURI | http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000835828500059&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| ISSN | 1051-8215 |
| IngestDate | Sun Nov 30 04:33:41 EST 2025 Sat Nov 29 01:44:18 EST 2025 Wed Aug 27 02:23:49 EDT 2025 |
| IsDoiOpenAccess | false |
| IsOpenAccess | true |
| IsPeerReviewed | true |
| IsScholarly | true |
| Issue | 8 |
| Language | English |
| License | https://ieeexplore.ieee.org/Xplorehelp/downloads/license-information/IEEE.html https://doi.org/10.15223/policy-029 https://doi.org/10.15223/policy-037 |
| LinkModel | DirectLink |
| MergedId | FETCHMERGED-LOGICAL-c290t-beb7dd12fb3c33687b817e2b6409e8fdc0cd6358a961afde75b046cc7b9ce7e23 |
| Notes | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 |
| ORCID | 0000-0002-3690-9924 0000-0002-5571-4137 0000-0002-4010-714X 0000-0003-3352-0662 0000-0002-7625-5689 |
| OpenAccessLink | http://infoscience.epfl.ch/record/295939 |
| PQID | 2697573235 |
| PQPubID | 85433 |
| PageCount | 17 |
| ParticipantIDs | proquest_journals_2697573235 crossref_primary_10_1109_TCSVT_2022_3150780 ieee_primary_9709785 |
| PublicationCentury | 2000 |
| PublicationDate | 2022-08-01 |
| PublicationDateYYYYMMDD | 2022-08-01 |
| PublicationDate_xml | – month: 08 year: 2022 text: 2022-08-01 day: 01 |
| PublicationDecade | 2020 |
| PublicationPlace | New York |
| PublicationPlace_xml | – name: New York |
| PublicationTitle | IEEE transactions on circuits and systems for video technology |
| PublicationTitleAbbrev | TCSVT |
| PublicationYear | 2022 |
| Publisher | IEEE The Institute of Electrical and Electronics Engineers, Inc. (IEEE) |
| Publisher_xml | – name: IEEE – name: The Institute of Electrical and Electronics Engineers, Inc. (IEEE) |
| References | ref13 ref35 ref12 ref34 ref37 ref14 Van der Auwera (ref16) 2020 ref36 ref31 ref11 ref33 ref10 ref2 ref17 ref39 ref19 ref18 Bross (ref38) 2018 ref24 ref23 ref26 ref25 Rerabek (ref1) ref20 ref22 ref21 ref28 ref27 ref29 ref8 ref7 Van der Auwera (ref15) 2017 ref9 ref4 ref3 ref6 ref5 Ng (ref30) 2005; 2 Bartelmess (ref32) 2016 |
| References_xml | – ident: ref21 doi: 10.1117/12.527327 – ident: ref34 doi: 10.1109/ICC.2017.7996611 – ident: ref7 doi: 10.1109/ICME.2011.6011865 – ident: ref31 doi: 10.1109/ICIP.2017.8296617 – ident: ref17 doi: 10.1117/1.601531 – ident: ref26 doi: 10.1109/PCS.2009.5167460 – ident: ref18 doi: 10.1109/TC.1977.1674939 – volume-title: AHG8: TSP Evaluation With Viewport-Aware Quality Metric for 360 Video year: 2017 ident: ref15 – ident: ref39 doi: 10.1016/j.image.2021.116202 – ident: ref2 doi: 10.1109/MMUL.2018.011921238 – ident: ref3 doi: 10.1109/PACKET.2009.5152147 – volume-title: Versatile Video Coding (Draft 1) year: 2018 ident: ref38 – ident: ref33 doi: 10.1145/3123266.3123291 – ident: ref13 doi: 10.1109/TCSVT.2021.3055985 – ident: ref20 doi: 10.1007/978-1-4615-3626-0 – ident: ref8 doi: 10.1109/ICASSP.2012.6288163 – volume: 2 start-page: 1 issue: 11 volume-title: Comput. Sci. Tech. Rep. year: 2005 ident: ref30 article-title: Light field photography with a hand-held plenoptic camera – ident: ref12 doi: 10.1109/TCSVT.2020.3046242 – ident: ref24 doi: 10.1109/ICIP.2019.8803668 – ident: ref28 doi: 10.1109/TCSVT.2003.814969 – ident: ref37 doi: 10.1109/TCSVT.2012.2221191 – ident: ref6 doi: 10.1109/MMSP.2004.1436558 – ident: ref5 doi: 10.1109/TIP.2010.2070074 – ident: ref35 doi: 10.1109/TVCG.2018.2793599 – ident: ref19 doi: 10.1109/TIP.2016.2571564 – ident: ref4 doi: 10.1109/ICIP.2009.5414623 – ident: ref10 doi: 10.1109/TCSVT.2018.2886805 – ident: ref23 doi: 10.1109/TCSVT.2003.817626 – ident: ref14 doi: 10.5594/M001787 – ident: ref25 doi: 10.1109/TMM.2019.2932614 – start-page: 1 volume-title: Proc. 8th Int. Conf. Quality Multimedia Exper. (QoMEX) ident: ref1 article-title: New light field image dataset – ident: ref9 doi: 10.1109/ICIP.2016.7532582 – volume-title: Compression efficiency of different picture coding structures in high efficiency video coding (HEVC) year: 2016 ident: ref32 – volume-title: Viewport-aware quality metric for 360-degree video year: 2020 ident: ref16 – ident: ref22 doi: 10.1109/TMM.2007.893350 – ident: ref11 doi: 10.3390/sym12091491 – ident: ref36 doi: 10.1109/TMM.2020.2987682 – ident: ref29 doi: 10.1049/el:19961075 – ident: ref27 doi: 10.1109/ICIP.2017.8296676 |
| SSID | ssj0014847 |
| Score | 2.3848026 |
| Snippet | Modern media data such as 360° videos and light field (LF) images are typically captured in much higher dimensions than the observers' visual displays. To... Modern media data such as 360° videos and light field (LF) images are typically captured in much higher dimensions than the observers’ visual displays. To... |
| SourceID | proquest crossref ieee |
| SourceType | Aggregation Database Index Database Publisher |
| StartPage | 5663 |
| SubjectTerms | Branch and bound methods Costs distributed source coding Encoding Media media compression Navigation Navigational streaming Optimization Random access Servers Storage Streaming media Videos Visual observation |
| Title | Landmarking for Navigational Streaming of Stored High-Dimensional Media |
| URI | https://ieeexplore.ieee.org/document/9709785 https://www.proquest.com/docview/2697573235 |
| Volume | 32 |
| WOSCitedRecordID | wos000835828500059&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| journalDatabaseRights | – providerCode: PRVIEE databaseName: IEEE Electronic Library (IEL) customDbUrl: eissn: 1558-2205 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0014847 issn: 1051-8215 databaseCode: RIE dateStart: 19910101 isFulltext: true titleUrlDefault: https://ieeexplore.ieee.org/ providerName: IEEE |
| link | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV05T8MwGP3UVgwwcBVEuZSBDUwTu75GVCgMqEKioG6Rr0gMTVGv34_tpFUlWNgSxUmsZyd-z98FcCO8qODMaUSVYahnLEOKkgI5zJSkhRA0NbHYBB8OxXgs3xpwt4mFcc5F5zN3Hw6jLd9OzTJslXUlD1EHtAlNznkVq7WxGPRELCbm6UKGfAfoOkAmld1R__1z5KUgxl6hev4TUkBuLUKxqsqvX3FcXwYH_-vZIezXPDJ5qAb-CBquPIa9reyCbXh-VaWdqLgZnnhumgzVKibUCOQ7CeZoNQmXpoU_8S-0SXD6QI8h33-VqyMJZhx1Ah-Dp1H_BdV1E5DBMl0g7TS3NsOFJoYQJrgWGXdYM6_lnCisSf2QECqUZJkqrONUe5VsDNfSON-QnEKrnJbuDBLKnQeWKIEL0bPGk0lPaVSoDCG5SxXpwO0ayPy7So-RR1mRyjzCngfY8xr2DrQDdJuWNWoduFxjn9df0DzHTHLKCSb0_O-7LmA3PLtyxruE1mK2dFewY1aLr_nsOk6OH3Ekt24 |
| linkProvider | IEEE |
| linkToHtml | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV3PT8MgFH6Z00Q9-Gsap1N78Ka4FkqBo5nOGedi4jS7NRRo4mGb2a-_X6DdskQv3tqUBvJBy_d4730P4Jpbo4IlJkNUqgTFSidIUpIjgxMpaM45DZUvNsF6PT4YiLcK3K5yYYwxPvjM3LlL78vXYzV3R2VNwVzWAd2ATRrHOCqytVY-g5j7cmKWMETIDoEuU2RC0ey33j_71hjE2NqolgE5Eci1bcjXVfn1M_Y7THv_f2M7gL2SSQb3xdQfQsWMjmB3TV-wBk9dOdJD6Y_DA8tOg55ceEkNR78D55CWQ_donNsb26EOXNgHenCK_4VaR-AcOfIYPtqP_VYHlZUTkMIinKHMZEzrCOcZUYQknGU8YgZnibXmDM-1Cu2kEMqlSCKZa8NoZu1kpVgmlLENyQlUR-OROYWAMmOBJZLjnMdaWTppSY10tSEEM6EkdbhZApl-FwIZqTcsQpF62FMHe1rCXoeag27VskStDo0l9mn5DU1TnAhGGcGEnv391hVsd_qv3bT73Hs5hx3XTxGa14DqbDI3F7ClFrOv6eTSL5QfKb26tQ |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Landmarking+for+Navigational+Streaming+of+Stored+High-Dimensional+Media&rft.jtitle=IEEE+transactions+on+circuits+and+systems+for+video+technology&rft.au=Yuan%2C+Yuan&rft.au=Cheung%2C+Gene&rft.au=Frossard%2C+Pascal&rft.au=Zhao%2C+H.+Vicky&rft.date=2022-08-01&rft.issn=1051-8215&rft.eissn=1558-2205&rft.volume=32&rft.issue=8&rft.spage=5663&rft.epage=5679&rft_id=info:doi/10.1109%2FTCSVT.2022.3150780&rft.externalDBID=n%2Fa&rft.externalDocID=10_1109_TCSVT_2022_3150780 |
| thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1051-8215&client=summon |
| thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1051-8215&client=summon |
| thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1051-8215&client=summon |