Perceptual color transformations for wide color gamut video coding

Saved in:
Bibliographic Details
Title: Perceptual color transformations for wide color gamut video coding
Patent Number: 10257,526
Publication Date: April 09, 2019
Appl. No: 14/862027
Application Filed: September 22, 2015
Abstract: A video coding pipeline is provided that can accommodate high dynamic range (HDR) and wide color gamut (WCG) content at a fixed bitrate. The video coding pipeline relies on separate chromaticity and luminance-specific transforms in order to process image content. Image content may be converted into a nearly perceptually uniform color space for coding in constant luminance. Moreover, chromaticity transforms are utilized which reduce coding errors in the chroma components (at the fixed bitrate) by enlarging the distribution of code words for compression.
Inventors: Disney Enterprises, Inc. (Burbank, CA, US); Eidgenoessische Technische Hochschule Zurich (ETH Zurich) (Zurich, CH)
Assignees: Disney Enterprises, Inc. (Burbank, CA, US)
Claim: 1. A computer-implemented method, comprising: converting an additive color model signal into a uniform color space signal having chromaticity components and a luminance component; transforming the chromaticity components by applying a chromaticity-specific transform to the chromaticity components, wherein the chromaticity-specific transform maps a main triangular representation of the chromaticity components in the uniform color space to a four-sided polygonal representation of the chromaticity components in the uniform color space; transforming the luminance component by applying a luminance-specific transform to the luminance component; quantizing the transformed chromaticity components and luminance component; and encoding the quantized chromaticity components and luminance component.
Claim: 2. The computer-implemented method of claim 1 , wherein the additive color model signal comprises a red-green-blue (RGB) color model signal.
Claim: 3. The computer-implemented method of claim 1 , wherein the uniform color space signal comprises a YUV color space signal.
Claim: 4. The computer-implemented method of claim 3 , wherein mapping a main triangular representation of the chromaticity components in the uniform color space comprises subdividing the main triangular representation of the chromaticity components in the YUV color space into a mesh comprising a plurality of secondary triangles.
Claim: 5. The computer-implemented method of claim 4 , wherein each of the vertices of the main triangular representation comprises one of the primaries of the additive color model.
Claim: 6. The computer-implemented method of claim 5 , further comprising defining a midway vertex between two of the primaries of the additive color model.
Claim: 7. The computer-implemented method of claim 6 , further comprising mapping each of the vertices of the main triangular representation and the defined midway vertex to a rectangular representation of the chromaticity components in the YUV color space.
Claim: 8. The computer-implemented method of claim 7 , wherein the rectangular representation comprises a unit square.
Claim: 9. The computer-implemented method of claim 7 , wherein respective vertices of each of the plurality of secondary triangles is mapped to the rectangular representation of the chromaticity components in the YUV color space using mean value coordinates.
Claim: 10. The computer-implemented method of claim 9 , wherein values of each of the chromaticity components in the YUV color space are mapped to the rectangular representation thereof using barycentric coordinates corresponding to the values of each of the chromaticity components.
Claim: 11. The computer-implemented method of claim 6 , further comprising mapping each of the vertices of the main triangular representation and the defined midway vertex to a four-sided polygonal representation of the chromaticity components in the YUV color space.
Claim: 12. The computer-implemented method of claim 11 , further comprising applying at least one of a plurality of constraints affecting the mapping of each of the vertices of the main triangular representation, each of the vertices of each of the plurality of secondary triangles, and the defined midway vertex to the four-sided polygonal representation.
Claim: 13. The computer-implemented method of claim 12 , further comprising adjusting characteristics of at least one of the plurality of constraints to modify the chromaticity-specific transform.
Claim: 14. The computer-implemented method of claim 1 , wherein the luminance-specific transform comprises an opto-electrical transfer function.
Claim: 15. A non-transitory computer readable medium having computer executable program code embodied thereon, the computer executable program code configured to cause a processor to: decode quantized color and brightness components of an input bitstream carrying an image signal represented by a first color space model having brightness and color components; dequantize the quantized color and brightness components; apply a first inverse transform to the dequantized color component, wherein the first inverse transform maps a four-sided polygonal representation of the color component in the first color space model to a triangular representation of the color components in the first color space model; apply a second inverse transform to the dequantized brightness component separately from applying the first inverse transform to the color component; and convert the image signal represented by a first color space model into that represented by a second color space model, the second color space model being an additive color space model.
Claim: 16. The non-transitory computer readable medium of claim 15 , wherein the first inverse transform comprises one of an inverse direct transform or an inverse least squares transform, wherein the color and brightness components have been previously transformed to increase the number of code words available to a video codec for compressing the image signal.
Claim: 17. The non-transitory computer readable medium of claim 16 , wherein the first color space model comprises a YUV color space model, the second color space model comprises a red-green-blue (RGB) color space model, and the video codec comprises a high efficiency video coding (HEVC) codec.
Claim: 18. The non-transitory computer readable medium of claim 15 , wherein the image signal comprises one of a standard dynamic range (SDR) image signal, a high dynamic range (HDR) image signal, a standard color gamut (SCG) image signal, or a wide color gamut (WCG) image signal.
Claim: 19. A system, comprising: a processor comprising: a converter, a first transformer, an opto-electrical transfer function, and an encoder; a non-transitory computer-readable medium coupled to the converter, the first transformer, the opto-electrical transfer function, and the encoder, the non-transitory computer-readable medium having computer executable program code embodied thereon, the computer executable program code configured to cause a processor to: convert an image signal represented by a first color space model into that represented by a second color space model having brightness and color components using the converter; apply a first transform to the color component using the first transformer, wherein the first transformer maps a triangular representation of the color components in the second color space model to a four-sided polygonal representation of the color components in the second color space model; apply a second transform to the brightness component using the encoder separately from applying the first transform to the color component; quantize the transformed color and brightness components using the quantizer; and encode the quantized color and brightness components and output a bitstream using the encoder.
Claim: 20. The system of claim 19 , wherein the first transform is invertible.
Patent References Cited: 6100998 August 2000 Nagao
8783901 July 2014 Zoorob
9584811 February 2017 Su
2004/0096104 May 2004 Terekhov
2005/0184998 August 2005 Yang
2007/0086027 April 2007 Tin
2008/0259400 October 2008 Hersch
2009/0285475 November 2009 Suzuki
2010/0060911 March 2010 Marcu
2010/0086226 April 2010 Rozzi
2014/0105277 April 2014 Bivolarsky
2015/0103902 April 2015 Li
2015/0116740 April 2015 Miyanaga
2016/0309154 October 2016 Rusanovskyy
2016/0366449 December 2016 Stessen
2018/0007372 January 2018 Lasserre
1875398 December 2006
103563376 February 2014
103827956 May 2014
104486605 April 2015








Other References: Farbman et al., Coordinates for instant image cloning, ACM Trans. Graph., Jul. 2009, pp. 1-9, 28(3):67. cited by applicant
Ford, Adrian, Colour space conversions, Aug. 1998, pp. 1-31. cited by applicant
Klaus,Simon, Farbe im Digitalen Publizieren, Springer, 2008, English abstract translation. cited by applicant
Luthra et al., Draft requirements and explorations for hdr and wcg content distribution, International Organisation for Standardisation, Apr. 2014, pp. 1-22, Valencia, Spain. cited by applicant
Luthra et al., Test sequences and anchor generation for hdr and wide gamut content distribution, International Organisation for Standardisation, Jul. 2014, pp. 1-15, Sapporo, Japan. cited by applicant
Pedzisa, Maciej Pedzisz, Beyond bt.709, SMPTE Motion Imaging Journal, 2014, pp. 1-8, 123:18-25. cited by applicant
Schubert, E. Fred , Chapter 17: Colorimetry, http://www.ecse.rpi.edu/˜schubert/Light-Emitting-Diodes-dot-org/chap17/chap17.htm, last accessed Mar. 21, 2016, pp. 1-15. cited by applicant
Sullivan, et al., Overview of the high efficiency video coding (hevc) standard, IEEE Transactions on Circuits and Systems for Video Technology, Dec. 12, 2012, pp. 1649-1668, 22(12). cited by applicant
First Office Action in Chinese Patent Application No. 2016102814925, dated Aug. 3, 2018. cited by applicant
Assistant Examiner: Sullivan, Tyler W.
Primary Examiner: Czekaj, Dave
Attorney, Agent or Firm: Sheppard Mullin Richter & Hampton LLP
Accession Number: edspgr.10257526
Database: USPTO Patent Grants
Description
Abstract:A video coding pipeline is provided that can accommodate high dynamic range (HDR) and wide color gamut (WCG) content at a fixed bitrate. The video coding pipeline relies on separate chromaticity and luminance-specific transforms in order to process image content. Image content may be converted into a nearly perceptually uniform color space for coding in constant luminance. Moreover, chromaticity transforms are utilized which reduce coding errors in the chroma components (at the fixed bitrate) by enlarging the distribution of code words for compression.