Efficient encoding/decoding of audio signals

Saved in:
Bibliographic Details
Title: Efficient encoding/decoding of audio signals
Patent Number: 9,280,980
Publication Date: March 08, 2016
Appl. No: 13/982515
Application Filed: February 09, 2011
Abstract: A method for encoding of an audio signal comprises performing (214) of a transform of the audio signal. An energy offset is selected (216) for each of the first subbands. An energy measure of a first reference band within a low band of an encoding of a synthesis signal is obtained (212). The first high band is encoded (220) by providing quantization indices representing a respective scalar quantization of a spectrum envelope in the first subbands of the first high band relative to the energy measure of the first reference band by use of the selected energy offset. An encoder apparatus comprises means for carrying out the steps of the method. Corresponding decoder methods and apparatuses are also described.
Inventors: Grancharov, Volodya (Solna, SE); Norvell, Erik (Stockholm, SE); Sverrisson, Sigurdur (Kungsängen, SE)
Assignees: Telefonaktiebolaget L M Ericsson (publ) (Stockholm, SE)
Claim: 1. A method, in an audio encoding device, for encoding of an audio signal, the method comprising: obtaining a low band synthesis signal of an encoding of said audio signal; obtaining a first energy measure of a first reference band within a low band (LB) in said low band synthesis signal; performing a transform of said audio signal into a transform domain; selecting an energy offset from a set of at least two predetermined energy offsets for each of a plurality of first subbands of a first high band (HB- 1) of said audio signal in said transform domain, said first high band (HB- 1) being situated at higher frequencies than said low band (LB); and encoding said first high band (HB- 1), wherein said encoding of said first high band (HB- 1) comprises providing a first set of quantization indices representing a respective scalar quantization of a spectrum envelope in said plurality of first subbands of said first high band (HB- 1) relative to said first energy measure, said first set of quantization indices being given with a respective said selected energy offset, and wherein said encoding of said first high band (HB- 1) further comprises providing a parameter defining the used energy offset; obtaining a second energy measure of a second reference band within said low band (LB) in said low band synthesis signal; and encoding a second high band (HB- 2) of said audio signal in said transform domain, said second high band (HB- 2) being situated in frequency between said low band (LB) and said first high band (HB- 1), and wherein said encoding of said second high band (HB- 2) comprises providing a second set of quantization indices representing a respective scalar quantization of a spectrum envelope in a plurality of second subbands of said second high band (HB- 2) relative to said second energy measure.
Claim: 2. The method of claim 1 , wherein said selecting an energy offset is dependent on a power distribution of said audio signal in a frequency domain.
Claim: 3. The method of claim 1 , wherein said selecting an energy offset is based on an open loop procedure, comprising determining a parameter characterizing a power distribution of said low band synthesis signal in a frequency domain, whereby said selecting is based on said determined parameter.
Claim: 4. The method of claim 1 , wherein said encoding in turn comprises providing one first set of said quantization indices for each predetermined energy offset range; and said selecting an energy offset in turn comprises: calculating a quantization error for each of said first sets of quantization indices; and selecting the first set of said quantization indices that gives the smallest quantization error.
Claim: 5. The method of claim 1 , wherein said transform encoding is a Modified Discrete Cosine Transform.
Claim: 6. The method of claim 1 , wherein a low frequency end of said first high band (HB- 1) is 8 kHz.
Claim: 7. The method of claim 1 , wherein a high frequency end of said first high band is (HB- 1) 11.6 kHz.
Claim: 8. The method according claim 1 , wherein said first high band (HB- 1) comprises five first subbands.
Claim: 9. The method of claim 1 , wherein said low band (LB) ranges from 0-6.4 kHz.
Claim: 10. The method of claim 1 , wherein said first reference band comprises entire said low band (LB).
Claim: 11. The method of claim 1 , wherein said first reference band ranges from 0-5.9 kHz.
Claim: 12. The method of claim 1 , wherein said low band synthesis signal is based on an encoding by a Code-Excited Linear Prediction encoder.
Claim: 13. The method of claim 1 , wherein the quantization indices of said second set of quantization indices are restricted in increased energy direction.
Claim: 14. The method of claim 13 , wherein said energy restriction of said quantization indices is reduced with increasing frequency of said second subbands.
Claim: 15. The method of claim 1 , wherein said second high band (HB- 2) ranges between 6.4 and 8 kHz.
Claim: 16. The method of claim 1 , wherein said second reference band ranges between 5.9 and 6.4 kHz.
Claim: 17. The method of claim 1 , wherein said second high band (HB- 2) comprises three second subbands.
Claim: 18. A method, in an audio decoding device, for decoding of an audio signal, the method comprising: receiving an encoding of said audio signal, said encoding representing a first set of quantization indices of a spectrum envelope in a plurality of first subbands of a first high band (HB- 1) of said audio signal, said first set of quantization indices representing energies relative to a first energy measure, said encoding further representing a parameter defining a used energy offset, wherein said encoding further represents a second set of quantization indices of a spectrum envelope in a plurality of second subbands of a second high band (HB- 2) of said audio signal, said second set of quantization indices representing energies relative to a second energy measure; obtaining a low band synthesis signal of an encoding of said audio signal; obtaining said first energy measure as an energy measure of a first reference band within a low band (LB) in said low band synthesis signal, said first high band (HB- 1) being situated at higher frequencies than said low band (LB) and said second high band (HB 2) being situated in frequency between said low band (LB) and said first high band (HB- 1); selecting an energy offset from a set of at least two predetermined energy offsets for each of said first subbands based on said parameter defining said used energy offset; reconstructing a signal in a transform domain by determining a spectrum envelope in said first high band (HB- 1) from said first set of quantization indices corresponding to said first subbands, by use of said selected energy offset and said first energy measure, for each of said first subbands of said first high band (HB- 1); and performing an inverse transform based on at least said reconstructed signal in said transform domain into said audio signal; obtaining said second energy measure as an energy measure of a second reference band within said low band (LB) in said low band synthesis signal; and wherein said reconstructing said signal in said transform domain further comprises determining a spectrum envelope in said second high band (HB- 1) from said second set of quantization indices corresponding to said second subbands by use of said second energy measure for each of said second subbands of said second high band (HB- 2).
Claim: 19. The method of claim 18 , wherein said transform encoding is a Modified Discrete Cosine Transform.
Claim: 20. The method of claim 18 , wherein a low frequency end of said first high (HB- 1) band is 8 kHz.
Claim: 21. The method of claim 18 , wherein a high frequency end of said first high (HB- 1) band is 11.6 kHz.
Claim: 22. The method of claim 18 , wherein said first high (HB- 1) band comprises five first subbands.
Claim: 23. The method of claim 18 , wherein said low band (LB) ranges from 0-6.4 kHz.
Claim: 24. The method of claim 18 , wherein said first reference band comprises entire said low band (LB).
Claim: 25. The method of claim 18 , wherein said first reference band ranges from 0-5.9 kHz.
Claim: 26. The method of claim 18 , wherein said low band synthesis signal is based on an encoding by a Code-Excited Linear Prediction encoder.
Claim: 27. The method of claim 18 , wherein the quantization indices of said second set of quantization indices are restricted in increased energy direction.
Claim: 28. The method of claim 27 , wherein said energy restriction of said quantization indices is reduced with increasing frequency of said second subbands.
Claim: 29. The method of claim 18 , wherein said second high band (HB- 2) ranges between 6.4 and 8 kHz.
Claim: 30. The method of claim 18 , wherein said second reference band ranges between 5.9 and 6.4 kHz.
Claim: 31. The method of claim 18 , wherein said second high band (HB- 2) comprises three second subbands.
Claim: 32. An encoder apparatus for encoding of an audio signal, comprising: a transform encoder configured to perform a transform of said audio signal into a transform domain; a selector configured to select an energy offset from a set of at least two predetermined energy offsets for each of a plurality of first subbands of a first high band (HB- 1) of said audio signal in said transform domain; a synthesizer configured to obtain a low band synthesis signal of an encoding of said audio signal; an energy reference block, connected to said synthesize and configured to obtain a first energy measure of a first reference band within a low band (LB) in said low band synthesis signal, said first high band HB- 1) being situated at higher frequencies than said low band (LB); an encoder block, connected to said selector and said energy reference block and configured to encode said first high band (HB- 1) so as to provide a first set of quantization indices representing a respective scalar quantization of a spectrum envelope in said plurality of first subbands of said first high band (HB- 1) relative to said first energy measure, said first set of quantization indices being given with a respective said selected energy offset, and so as to provide a parameter defining the used energy offset; wherein said energy reference block is further configured to obtain a second energy measure of a second reference band within said low band (LB) of said low band synthesis signal; wherein said encoder block is further configured to encode a second high band (HB- 2) of said audio signal in said transform domain, said second high band (HB- 2) being situated in frequency between said low band (LB) and said first high band (HB- 1), wherein said encoder block is configured to encode the second high band (HB- 2) so as to provide a second set of quantization indices representing a respective scalar quantization of a spectrum envelope in a plurality of second subbands of said second high band (HB- 2) relative to said second energy measure.
Claim: 33. The encoder apparatus of claim 32 , wherein said selector is configured to select an energy offset in dependence on a power distribution of said audio signal in a frequency domain.
Claim: 34. The encoder apparatus of claim 32 , wherein said selector is configured to determine a parameter characterizing a power distribution of said low band synthesis signal in a frequency domain and to select an energy offset based on said determined parameter.
Claim: 35. The encoder apparatus of claim 32 , wherein said encoder block is configured to provide one first set of said quantization indices for each predetermined energy offset range; and said selector is configured to receive said first sets of quantization indices for all predetermined energy offset ranges and comprises a calculation block configured to calculate a quantization error for each of said first sets of quantization indices and a selection block configured to select the first set of said quantization indices giving the smallest quantization error.
Claim: 36. The encoder apparatus of claim 32 , wherein said transform encoder is a Modified Discrete Cosine Transform encoder.
Claim: 37. A network node comprising the encoder apparatus of claim 32 .
Claim: 38. A decoder apparatus for decoding of an audio signal, the decoder apparatus comprising: an input block configured to receive an encoding of said audio signal, said encoding representing a first set of quantization indices of a spectrum envelope in a plurality of first subbands of a first high band (HB- 1) of said audio signal, said first set of quantization indices representing energies relative to a first energy measure, said encoding further representing a parameter defining a used energy offset, said encoding further representing a second set of quantization indices of a spectrum envelope in a plurality of second subbands of a second high band (HB- 2) of said audio signal, said second set of quantization indices representing energies relative to a second energy measure; a synthesizer configured to obtain a low band synthesis signal of an encoding of said audio signal; an energy reference block, connected to said synthesizer and configured to obtain said first energy measure as an energy measure of a first reference band within a low band (LB) in said low band synthesis signal, said first high band (HB- 1) being situated at higher frequencies than said low band (LB) and said second high band (HB- 2) being situated in frequency between said low band (LB) and said first high band (HB- 1); a selector, connected to said input block and configured to select an energy offset from a set of at least two predetermined energy offsets for each of said first subbands based on said parameter defining said used energy offset; a reconstruction block, connected to said input block, said selector, and said energy reference block, and configured to reconstruct a signal in a transform domain by determining a spectrum envelope in said first high band (HB- 1) from said first set of quantization indices corresponding to said first subbands, by use of said selected energy offset and said first energy measure, for each of said first subbands of said first high band (HB- 1); and an inverse transform decoder, connected to said reconstruction block and configured to perform an inverse transform based on at least said reconstructed signal in said transform domain into said audio signal; wherein said energy reference block is further configured to obtain said second energy measure as an energy measure of a second reference band within said low band (LB) of said low band synthesis signal; and wherein said reconstruction block is further configured to determine a spectrum envelope in said second high band (HB- 1) from said second set of quantization indices corresponding to said second subbands by use of said second energy measure for each of said second subbands of said second high band (HB- 2).
Claim: 39. The decoder apparatus of claim 38 , wherein said inverse transform decoder is a Modified Discrete Cosine inverse Transform decoder.
Claim: 40. A network node comprising the decoder apparatus of claim 38 .
Patent References Cited: 5621856 April 1997 Akagiri
7272556 September 2007 Aguilar et al.
8352279 January 2013 Gao
2004/0019492 January 2004 Tucker et al.
2005/0165611 July 2005 Mehrotra et al.
2008/0027718 January 2008 Krishnan et al.
2008/0052068 February 2008 Aguilar et al.
2010/0063812 March 2010 Gao
2010/0286990 November 2010 Biswas et al.
2011/0238425 September 2011 Neuendorf et al.
2013/0030797 January 2013 Gao
2013/0096930 April 2013 Neuendorf et al.
0878790 November 1998
01233496 September 1989
09172376 June 1997
2001343997 December 2001
2004027998 April 2004
2006048203 May 2006
2009059632 May 2009
2010042024 April 2010
Primary Examiner: Chawan, Vijay B
Attorney, Agent or Firm: Murphy, Bilak & Homiller, PLLC
Accession Number: edspgr.09280980
Database: USPTO Patent Grants
Description
Abstract:A method for encoding of an audio signal comprises performing (214) of a transform of the audio signal. An energy offset is selected (216) for each of the first subbands. An energy measure of a first reference band within a low band of an encoding of a synthesis signal is obtained (212). The first high band is encoded (220) by providing quantization indices representing a respective scalar quantization of a spectrum envelope in the first subbands of the first high band relative to the energy measure of the first reference band by use of the selected energy offset. An encoder apparatus comprises means for carrying out the steps of the method. Corresponding decoder methods and apparatuses are also described.