Channel-Wise Autoregressive Entropy Models for Learned Image Compression
In learning-based approaches to image compression, codecs are developed by optimizing a computational model to minimize a rate-distortion objective. Currently, the most effective learned image codecs take the form of an entropy-constrained autoencoder with an entropy model that uses both forward and...
Uloženo v:
| Vydáno v: | Proceedings - International Conference on Image Processing s. 3339 - 3343 |
|---|---|
| Hlavní autoři: | , |
| Médium: | Konferenční příspěvek |
| Jazyk: | angličtina |
| Vydáno: |
IEEE
01.10.2020
|
| Témata: | |
| ISSN: | 2381-8549 |
| On-line přístup: | Získat plný text |
| Tagy: |
Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
|
| Abstract | In learning-based approaches to image compression, codecs are developed by optimizing a computational model to minimize a rate-distortion objective. Currently, the most effective learned image codecs take the form of an entropy-constrained autoencoder with an entropy model that uses both forward and backward adaptation. Forward adaptation makes use of side information and can be efficiently integrated into a deep neural network. In contrast, backward adaptation typically makes predictions based on the causal context of each symbol, which requires serial processing that prevents efficient GPU / TPU utilization. We introduce two enhancements, channel-conditioning and latent residual prediction, that lead to network architectures with better rate-distortion performance than existing context-adaptive models while minimizing serial processing. Empirically, we see an average rate savings of 6.7% on the Kodak image set and 11.4% on the Tecnick image set compared to a context-adaptive baseline model. At low bit rates, where the improvements are most effective, our model saves up to 18% over the baseline and outperforms hand-engineered codecs like BPG by up to 25%. |
|---|---|
| AbstractList | In learning-based approaches to image compression, codecs are developed by optimizing a computational model to minimize a rate-distortion objective. Currently, the most effective learned image codecs take the form of an entropy-constrained autoencoder with an entropy model that uses both forward and backward adaptation. Forward adaptation makes use of side information and can be efficiently integrated into a deep neural network. In contrast, backward adaptation typically makes predictions based on the causal context of each symbol, which requires serial processing that prevents efficient GPU / TPU utilization. We introduce two enhancements, channel-conditioning and latent residual prediction, that lead to network architectures with better rate-distortion performance than existing context-adaptive models while minimizing serial processing. Empirically, we see an average rate savings of 6.7% on the Kodak image set and 11.4% on the Tecnick image set compared to a context-adaptive baseline model. At low bit rates, where the improvements are most effective, our model saves up to 18% over the baseline and outperforms hand-engineered codecs like BPG by up to 25%. |
| Author | Minnen, David Singh, Saurabh |
| Author_xml | – sequence: 1 givenname: David surname: Minnen fullname: Minnen, David organization: Google Research,Mountain View,CA,USA,94043 – sequence: 2 givenname: Saurabh surname: Singh fullname: Singh, Saurabh organization: Google Research,Mountain View,CA,USA,94043 |
| BookMark | eNotj9FKwzAUQKMouM19gSD5gdbcm7ZJHkeZW2GiD4qPI11uZ6VNRjKF_b2iezovhwNnyq588MTYPYgcQJiHpm5eCqGUzlGgyA0YYWR5weZGaVCooZKmrC7ZBKWGTJeFuWHTlD7Frw0SJmxdf1jvacje-0R88XUMkfaRUuq_iS_9MYbDiT8FR0PiXYh8QzZ6crwZ7Z54HcbDnxz8Lbvu7JBofuaMvT0uX-t1tnleNfVik_Uo5DHDXWkAnXakWqM7g85Z4dq2cmQLQYhOYAdAqgCwViPuCpKqVa51nWutkTN299_tiWh7iP1o42l7Hpc_A3FQhA |
| ContentType | Conference Proceeding |
| DBID | 6IE 6IH CBEJK RIE RIO |
| DOI | 10.1109/ICIP40778.2020.9190935 |
| DatabaseName | IEEE Electronic Library (IEL) Conference Proceedings IEEE Proceedings Order Plan (POP) 1998-present by volume IEEE Xplore All Conference Proceedings IEEE Electronic Library (IEL) IEEE Proceedings Order Plans (POP) 1998-present |
| DatabaseTitleList | |
| Database_xml | – sequence: 1 dbid: RIE name: IEEE Electronic Library (IEL) url: https://ieeexplore.ieee.org/ sourceTypes: Publisher |
| DeliveryMethod | fulltext_linktorsrc |
| Discipline | Applied Sciences |
| EISBN | 9781728163956 1728163951 |
| EISSN | 2381-8549 |
| EndPage | 3343 |
| ExternalDocumentID | 9190935 |
| Genre | orig-research |
| GroupedDBID | 29O 6IE 6IF 6IH 6IK 6IL 6IM 6IN AAJGR AAWTH ADZIZ ALMA_UNASSIGNED_HOLDINGS BEFXN BFFAM BGNUA BKEBE BPEOZ CBEJK CHZPO IPLJI M43 OCL RIE RIL RIO RNS |
| ID | FETCH-LOGICAL-i203t-2c5912d8de7b98f92dda0dbb6dea40e22d02f11e7411aa822c4e37b7dbdfdba93 |
| IEDL.DBID | RIE |
| ISICitedReferencesCount | 301 |
| ISICitedReferencesURI | http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000646178503090&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| IngestDate | Wed Aug 27 02:34:00 EDT 2025 |
| IsPeerReviewed | false |
| IsScholarly | true |
| Language | English |
| LinkModel | DirectLink |
| MergedId | FETCHMERGED-LOGICAL-i203t-2c5912d8de7b98f92dda0dbb6dea40e22d02f11e7411aa822c4e37b7dbdfdba93 |
| PageCount | 5 |
| ParticipantIDs | ieee_primary_9190935 |
| PublicationCentury | 2000 |
| PublicationDate | 2020-Oct. |
| PublicationDateYYYYMMDD | 2020-10-01 |
| PublicationDate_xml | – month: 10 year: 2020 text: 2020-Oct. |
| PublicationDecade | 2020 |
| PublicationTitle | Proceedings - International Conference on Image Processing |
| PublicationTitleAbbrev | ICIP |
| PublicationYear | 2020 |
| Publisher | IEEE |
| Publisher_xml | – name: IEEE |
| SSID | ssj0020131 |
| Score | 2.6345522 |
| Snippet | In learning-based approaches to image compression, codecs are developed by optimizing a computational model to minimize a rate-distortion objective. Currently,... |
| SourceID | ieee |
| SourceType | Publisher |
| StartPage | 3339 |
| SubjectTerms | Adaptation models Adaptive Entropy Modeling Bit rate Codecs Entropy Image coding Image Compression Neural Networks Predictive models Training |
| Title | Channel-Wise Autoregressive Entropy Models for Learned Image Compression |
| URI | https://ieeexplore.ieee.org/document/9190935 |
| WOSCitedRecordID | wos000646178503090&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| link | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV09a8MwED2S0KFT2ial32joWCWWbFnWWEJDsoQMLc0WLOkMgdQJcVLov68km5RCl25GYAnO6N7d-b07gEdZJKlmQlMXLBuaqJhRnRtJVWa1w6NQ6Q_DJuRsli0Wat6Cp6MWBhED-QwH_jH8y7cbc_ClsqFy6KVi0Ya2lGmt1TomV75vTKMAZpEaTkfTuctVpGdv8WjQvPlrhEpAkHH3f2efQf9HikfmR5A5hxaWF9BtYkfS3MyqBxMvEyhxTd9XFZJn35kAQyrtvBl58XT07Rfxg8_WFXFxKgl9Vd0O0w_nUIj3CjUhtuzD2_jldTShzZQEuuJRvKfcCMW4zSxKrbJCcWvzyGqdWsyTCDm3ES8YQxc6sDx39jcJxlJLq21hda7iS-iUmxKvgERWZCzRiEbIREiZp0oXNk4EKofyQlxDzxtmua0bYSwbm9z8vXwLp972NfPtDjr73QHv4cR87lfV7iF8vW_MgJvz |
| linkProvider | IEEE |
| linkToHtml | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV3Pa8IwFA7ODbaT23Ts93LYcdEkbUxzHKJY5sSDY96kaV5BcFWsDvbfL0mLY7DLbqGQFvLI-957_b73EHqUWdjVTGhig-WUhCpgRCepJCoy2uKRr_T7YRNyPI5mMzWpoae9FgYAPPkM2m7p_-WbVbpzpbKOsuilAnGADoXFUVqqtfbplescU2mAGVWduBdPbLYiHX-L03a199cQFY8hg8b_vn6KWj9iPDzZw8wZqkF-jhpV9Iiru1k00dAJBXJYkvdFAfjZ9SYAn0xbf4b7jpC-_sJu9NmywDZSxb6zqn1D_GFdCnZ-oaTE5i30NuhPe0NSzUkgC06DLeGpUIybyIDUKsoUNyahRuuugSSkwLmhPGMMbPDAksRaIA0hkFoabTKjExVcoHq-yuESYWpExEINkAoZCimTrtKZCUIByuK8EFeo6Q5mvi5bYcyrM7n--_EDOh5OX0fzUTx-uUEnzg4lD-4W1bebHdyho_Rzuyg2996S3yLwn0A |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=Proceedings+-+International+Conference+on+Image+Processing&rft.atitle=Channel-Wise+Autoregressive+Entropy+Models+for+Learned+Image+Compression&rft.au=Minnen%2C+David&rft.au=Singh%2C+Saurabh&rft.date=2020-10-01&rft.pub=IEEE&rft.eissn=2381-8549&rft.spage=3339&rft.epage=3343&rft_id=info:doi/10.1109%2FICIP40778.2020.9190935&rft.externalDocID=9190935 |