H.265/HEVC Decoding via Iterative Recovery From Incomplete Quantized Measurements

This letter is dedicated to improving the quality of video sequences compressed by the H.265/HEVC standard. We propose to consider this problem as a signal recovery from incomplete measurements taken in the HEVC transform domain. The recovery could be obtained via <inline-formula><tex-math...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:IEEE signal processing letters Jg. 32; S. 4149 - 4153
Hauptverfasser: Mahfod, Karam, Belyaev, Evgeny
Format: Journal Article
Sprache:Englisch
Veröffentlicht: New York IEEE 2025
The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Schlagworte:
ISSN:1070-9908, 1558-2361
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:This letter is dedicated to improving the quality of video sequences compressed by the H.265/HEVC standard. We propose to consider this problem as a signal recovery from incomplete measurements taken in the HEVC transform domain. The recovery could be obtained via <inline-formula><tex-math notation="LaTeX">l_{1}</tex-math></inline-formula>-minimization using the Iterative Shrinkage-Thresholding Algorithm (ISTA) well known in compressive sensing (CS) framework. However, in case of HEVC the ISTA updating step cannot be performed directly via matrix multiplications, since the sensing is performed using high-complex hybrid intra- and motion-compensated prediction in pixel domain, and the frame sensing matrix depends on the current and corresponding reference frames along with the encoder compression profile. In order to overcome these limitations, in this letter, we first propose to modify the HEVC decoder so that it also obtains the prediction values for each pixel taking into account all the coding modes within the input bitstream. Second, we propose to modify the ISTA updating step by introducing encoding and decoding operators applied instead of the matrix multiplication by sensing matrix and its transpose, respectively. These operators use the obtained prediction values, as well as prediction mode, motion vectors, quantization step, and transform type extracted for each coding unit from the input bitstream in order to replicate the encoding and the decoding process except the entropy coding. Herewith, the ISTA thresholding stage is performed by an image or video enhancement neural network. Experimental results show that the proposed approach provides up to 1 dB improvement in Peak Signal-to-Noise Ratio (PSNR) compared to the state-of-the-art approaches such as Recursive Fusion and Deformable Spatiotemporal Attention (RFDA), Spatio-Temporal Detail Information Retrieval (STDR) and Coding Priors-Guided Aggregation (CPGA).
Bibliographie:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ISSN:1070-9908
1558-2361
DOI:10.1109/LSP.2025.3624678