Cascaded Sliding-Window-Based Relativistic GAN Fusion for Perceptual and Consistent Video Super-Resolution

Uloženo v:
Podrobná bibliografie
Název: Cascaded Sliding-Window-Based Relativistic GAN Fusion for Perceptual and Consistent Video Super-Resolution
Autoři: Li, Dingyi
Přispěvatelé: School of Computer Science and Engineering Nanjing, Nanjing University of Science and Technology (NJUST), Zhongzhi Shi, Michael Witbrock, Qi Tian, TC 12
Zdroj: IFIP Advances in Information and Communication Technology ; 6th International Conference on Intelligence Science (ICIS) ; https://inria.hal.science/hal-05142880 ; 6th International Conference on Intelligence Science (ICIS), Oct 2024, Nanjing, China. pp.232-247, ⟨10.1007/978-3-031-71253-1_17⟩
Informace o vydavateli: CCSD
Springer Nature Switzerland
Rok vydání: 2024
Témata: Video Super-Resolution, Perceptual Quality, Temporal Consistency, Information Science, Intelligent Information Processing, [INFO]Computer Science [cs]
Geografické téma: Nanjing, China
Popis: Part 5: Perceptual Intelligence ; International audience ; Perceptual video super-resolution aims at converting low-resolution videos to visually appealing high-resolution ones. It may lead to temporal inconsistency due to the drastically changing outputs. In this paper, we propose cascaded sliding-window-based relativistic GAN (Generative Adversarial Network) fusion for perceptual and consistent video super-resolution (PC-VSR). Firstly, cascaded sliding-window-based relativistic GAN is designed to extract more useful information. It enlarges the temporal receptive field of sliding-window-based model in each step. It is able to enhance perceptual quality and compensate temporal consistency progressively and sufficiently. The trained separate refinement generator networks are fused into a final refinement generator. The final refinement generator can be calculated recursively at the testing stage. With our generator fusion, the parameter number is reduced and good quality is maintained. Extensive experimental results demonstrate that our approach outperforms state-of-the-art super-resolution methods in terms of perceptual quality. Our method also achieves good temporal consistency and per-pixel accuracy, compared with other perceptual approaches.
Druh dokumentu: conference object
Jazyk: English
DOI: 10.1007/978-3-031-71253-1_17
Dostupnost: https://inria.hal.science/hal-05142880
https://inria.hal.science/hal-05142880v1/document
https://inria.hal.science/hal-05142880v1/file/633143_1_En_17_Chapter.pdf
https://doi.org/10.1007/978-3-031-71253-1_17
Rights: http://creativecommons.org/licenses/by/
Přístupové číslo: edsbas.7833912C
Databáze: BASE
Popis
Abstrakt:Part 5: Perceptual Intelligence ; International audience ; Perceptual video super-resolution aims at converting low-resolution videos to visually appealing high-resolution ones. It may lead to temporal inconsistency due to the drastically changing outputs. In this paper, we propose cascaded sliding-window-based relativistic GAN (Generative Adversarial Network) fusion for perceptual and consistent video super-resolution (PC-VSR). Firstly, cascaded sliding-window-based relativistic GAN is designed to extract more useful information. It enlarges the temporal receptive field of sliding-window-based model in each step. It is able to enhance perceptual quality and compensate temporal consistency progressively and sufficiently. The trained separate refinement generator networks are fused into a final refinement generator. The final refinement generator can be calculated recursively at the testing stage. With our generator fusion, the parameter number is reduced and good quality is maintained. Extensive experimental results demonstrate that our approach outperforms state-of-the-art super-resolution methods in terms of perceptual quality. Our method also achieves good temporal consistency and per-pixel accuracy, compared with other perceptual approaches.
DOI:10.1007/978-3-031-71253-1_17