Lossless Coding of Multimodal Image Pairs Based on Image-To-Image Translation
Multimodal image coding often uses standard encoding algorithms, which do not exploit multimodality characteristics. This paper proposes a new cross-modality prediction approach for lossless coding of multimodal images, based on a Generative Adversarial Network (GAN). The GAN is added to the predict...
Uloženo v:
| Vydáno v: | European Workshop on Visual Information Processing s. 1 - 6 |
|---|---|
| Hlavní autoři: | , , , , |
| Médium: | Konferenční příspěvek |
| Jazyk: | angličtina |
| Vydáno: |
IEEE
11.09.2022
|
| Témata: | |
| ISSN: | 2471-8963 |
| On-line přístup: | Získat plný text |
| Tagy: |
Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
|
| Abstract | Multimodal image coding often uses standard encoding algorithms, which do not exploit multimodality characteristics. This paper proposes a new cross-modality prediction approach for lossless coding of multimodal images, based on a Generative Adversarial Network (GAN). The GAN is added to the prediction loop of the Versatile Video Coding (VVC) lossless encoder to perform cross-modality translation of an image to its counterpart modality. Then, such synthesized image is used as reference for inter prediction, followed by further optimization that includes rescaling and brightness adjustment. A publicly available dataset of Positron Emission Tomography (PET) and Computed Tomography (CT) image pairs is used to assess the performance of the proposed multimodal lossless image coding framework. In comparison with single modality coding using the VVC standard, average coding gains of 6.83% are achieved for the inter-coded PET images. |
|---|---|
| AbstractList | Multimodal image coding often uses standard encoding algorithms, which do not exploit multimodality characteristics. This paper proposes a new cross-modality prediction approach for lossless coding of multimodal images, based on a Generative Adversarial Network (GAN). The GAN is added to the prediction loop of the Versatile Video Coding (VVC) lossless encoder to perform cross-modality translation of an image to its counterpart modality. Then, such synthesized image is used as reference for inter prediction, followed by further optimization that includes rescaling and brightness adjustment. A publicly available dataset of Positron Emission Tomography (PET) and Computed Tomography (CT) image pairs is used to assess the performance of the proposed multimodal lossless image coding framework. In comparison with single modality coding using the VVC standard, average coding gains of 6.83% are achieved for the inter-coded PET images. |
| Author | Parracho, Joao O. Assuncao, Pedro A. A. Thomaz, Lucas A. Faria, Sergio M. M. Tavora, Luis M. N. |
| Author_xml | – sequence: 1 givenname: Joao O. surname: Parracho fullname: Parracho, Joao O. email: jparracho@co.it.pt organization: Instituto de Telecomunicações,Leiria,Portugal – sequence: 2 givenname: Lucas A. surname: Thomaz fullname: Thomaz, Lucas A. email: lucas.thomaz@co.it.pt organization: Instituto de Telecomunicações,Leiria,Portugal – sequence: 3 givenname: Luis M. N. surname: Tavora fullname: Tavora, Luis M. N. email: luis.tavora@co.it.pt organization: Instituto de Telecomunicações,Leiria,Portugal – sequence: 4 givenname: Pedro A. A. surname: Assuncao fullname: Assuncao, Pedro A. A. email: amado@co.it.pt organization: Instituto de Telecomunicações,Leiria,Portugal – sequence: 5 givenname: Sergio M. M. surname: Faria fullname: Faria, Sergio M. M. email: sergio.faria@co.it.pt organization: Instituto de Telecomunicações,Leiria,Portugal |
| BookMark | eNotkL1OwzAURg0CiVL6BAz4BRJ87fhvhKhApFZ0aFkrJ76ujJIYxWXg7RnS6ZPOkc7w3ZObMY1IyBOwEoDZ5_Xhq9lJYY0tOeO8tJZzzdUVWVltQClZKcWFuCYLXmkojFXijqxy_maMgQJQrFqQ7Sbl3GPOtE4-jieaAt3-9uc4JO962gzuhHTn4pTpq8voaRpnWOxTMdv95Mbcu3NM4wO5Da7PuLrskhze1vv6o9h8vjf1y6aInIlz0QYTWiM6CWCtrDoPgXdSt1UHLYgQ0AeQzAVkHffSG221ZRJNcBI1VkEsyePcjYh4_Jni4Ka_4-UA8Q9MtFIg |
| ContentType | Conference Proceeding |
| DBID | 6IE 6IL CBEJK RIE RIL |
| DOI | 10.1109/EUVIP53989.2022.9922726 |
| DatabaseName | IEEE Electronic Library (IEL) Conference Proceedings IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume IEEE Xplore All Conference Proceedings IEEE Xplore IEEE Proceedings Order Plans (POP All) 1998-Present |
| DatabaseTitleList | |
| Database_xml | – sequence: 1 dbid: RIE name: IEEE Electronic Library (IEL) url: https://ieeexplore.ieee.org/ sourceTypes: Publisher |
| DeliveryMethod | fulltext_linktorsrc |
| Discipline | Applied Sciences |
| EISBN | 9781665466233 1665466235 |
| EISSN | 2471-8963 |
| EndPage | 6 |
| ExternalDocumentID | 9922726 |
| Genre | orig-research |
| GroupedDBID | 6IE 6IF 6IH 6IK 6IL 6IN AAJGR AAWTH ABLEC ADZIZ ALMA_UNASSIGNED_HOLDINGS BEFXN BFFAM BGNUA BKEBE BPEOZ CBEJK CHZPO IEGSK IPLJI OCL RIE RIL RNS |
| ID | FETCH-LOGICAL-i203t-bf8fb83c5119954cd1f2c57b4c1b13ffedf150afe0c2d5d8797905e8fa5e7e4f3 |
| IEDL.DBID | RIE |
| ISICitedReferencesCount | 1 |
| ISICitedReferencesURI | http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000886233300015&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| IngestDate | Wed Aug 27 02:27:07 EDT 2025 |
| IsPeerReviewed | false |
| IsScholarly | true |
| Language | English |
| LinkModel | DirectLink |
| MergedId | FETCHMERGED-LOGICAL-i203t-bf8fb83c5119954cd1f2c57b4c1b13ffedf150afe0c2d5d8797905e8fa5e7e4f3 |
| PageCount | 6 |
| ParticipantIDs | ieee_primary_9922726 |
| PublicationCentury | 2000 |
| PublicationDate | 2022-Sept.-11 |
| PublicationDateYYYYMMDD | 2022-09-11 |
| PublicationDate_xml | – month: 09 year: 2022 text: 2022-Sept.-11 day: 11 |
| PublicationDecade | 2020 |
| PublicationTitle | European Workshop on Visual Information Processing |
| PublicationTitleAbbrev | EUVIP |
| PublicationYear | 2022 |
| Publisher | IEEE |
| Publisher_xml | – name: IEEE |
| SSID | ssj0001611604 |
| Score | 2.2008965 |
| Snippet | Multimodal image coding often uses standard encoding algorithms, which do not exploit multimodality characteristics. This paper proposes a new cross-modality... |
| SourceID | ieee |
| SourceType | Publisher |
| StartPage | 1 |
| SubjectTerms | Computed tomography Generative adversarial networks Generative predictive coding Image coding Information processing Learning based prediction Lossless image coding Multimodal image coding Prediction algorithms Versatile Video Coding Video coding Visualization |
| Title | Lossless Coding of Multimodal Image Pairs Based on Image-To-Image Translation |
| URI | https://ieeexplore.ieee.org/document/9922726 |
| WOSCitedRecordID | wos000886233300015&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| link | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1LawIxEB5UeujJtlr6JoceG93sZjfJtaJUaGUPWrxJniCoW3z09zfZXZRCL72FPAhkSL6ZzHwzAM-Sxk565MLSSYW9_SWwJJm3UqiSHjESIxQvi02wyYTP5yJvwMuRC2OtLYPPbC80S1--KfQhfJX1Qw5VFmdNaDKWVVyt039KRvwetA7hIpHoD2ef4zxNBA98lDju1at_lVEpUWTU_t_-F9A90fFQfgSaS2jYzRW0a_0R1bdz14GPd494K_90oUERZqLCoZJguy6MXKHx2r8dKA8OHPTqwcugYlN14mmBq9ESuqrwuC7MRsPp4A3X5RLwMo6SPVaOO8UTHTyDIqXaEBfrlCmqiSKJc9Y4r_1JZyMdm9RwJkJyLsudTC2z1CXX0NoUG3sDiAUr0SuSYTrliorM8siKREqhKRfZLXTC6Sy-qowYi_pg7v7uvofzIIAQZUHIA7T224N9hDP9vV_utk-lGH8ASRqdtQ |
| linkProvider | IEEE |
| linkToHtml | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1LawIxEA7WFtqTbbX03Rx6bHSTze4m14qiVGUPWrxJniDobvHR399kd1EKvfQW8iAhIflmMvPNAPAqKLHCIRcSVkjk9C-OBI6dlkKlcIgRai5ZkWwimUzYfM7TGng7cGGMMYXzmWn7YmHL17na-6-yjo-hmpD4BJxGlJKgZGsdf1Ri7GahlRMXDninN_scplHImWekENKuxv9KpFLgSL_xvxVcgtaRkAfTA9RcgZrJrkGjkiBhdT-3TTAeOcxbuccLdnPfE-YWFhTbda7FCg7X7vWAqTfhwHcHXxrmWVmJpjkqWwvwKh3kWmDW7027A1QlTEBLEoQ7JC2zkoXK2wZ5RJXGlqgokVRhiUNrjbZO_hPWBIroSLOE-_BchlkRmcRQG96AepZn5hbAxOuJTpT03SmTlMeGBYaHQnBFGY_vQNPvzuKrjImxqDbm_u_qF3A-mI5Hi9Fw8vEALvxheJ8LjB9BfbfZmydwpr53y-3muTjSH61XoPw |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=European+Workshop+on+Visual+Information+Processing&rft.atitle=Lossless+Coding+of+Multimodal+Image+Pairs+Based+on+Image-To-Image+Translation&rft.au=Parracho%2C+Joao+O.&rft.au=Thomaz%2C+Lucas+A.&rft.au=Tavora%2C+Luis+M.+N.&rft.au=Assuncao%2C+Pedro+A.+A.&rft.date=2022-09-11&rft.pub=IEEE&rft.eissn=2471-8963&rft.spage=1&rft.epage=6&rft_id=info:doi/10.1109%2FEUVIP53989.2022.9922726&rft.externalDocID=9922726 |