Apparatus and method for error concealment in low-delay unified speech and audio coding

Gespeichert in:
Bibliographische Detailangaben
Titel: Apparatus and method for error concealment in low-delay unified speech and audio coding
Patent Number: 9,384,739
Publikationsdatum: July 05, 2016
Appl. No: 13/966536
Application Filed: August 14, 2013
Abstract: An apparatus for generating spectral replacement values for an audio signal has a buffer unit for storing previous spectral values relating to a previously received error-free audio frame. Moreover, the apparatus includes a concealment frame generator for generating the spectral replacement values, when a current audio frame has not been received or is erroneous. The previously received error-free audio frame has filter information, the filter information having associated a filter stability value indicating a stability of a prediction filter. The concealment frame generator is adapted to generate the spectral replacement values based on the previous spectral values and based on the filter stability value.
Inventors: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V. (Munich, DE); Technische Universitaet Ilmenau (Ilmenau, DE)
Assignees: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V. (Munich, DE), Technische Universitaet Ilmenau (Ilmenau, DE)
Claim: 1. An apparatus for generating spectral replacement values for an audio signal comprising: a buffer unit for storing previous spectral values relating to a previously received error-free audio signal frame, and a concealment frame generator for generating the spectral replacement values when a current audio signal frame has not been received or is erroneous, wherein the previously received error-free audio signal frame comprises filter information, the filter information comprising an associated filter stability value indicating a stability of a prediction filter, and wherein the concealment frame generator is adapted to generate the spectral replacement values based on the previous spectral values and based on the filter stability value, wherein during a playback of the audio signal, at a receiver, the current audio signal frame that has not been received in time or is erroneous is replaced with a synthesized representation of the generated spectral replacement values, wherein the apparatus is implemented using a hardware apparatus or a computer or a combination of a hardware apparatus and a computer.
Claim: 2. The apparatus according to claim 1 , wherein the concealment frame generator is adapted to generate the spectral replacement values by randomly flipping the sign of the previous spectral values.
Claim: 3. The apparatus according to claim 1 , wherein the concealment frame generator is configured to generate the spectral replacement values by multiplying each of the previous spectral values by a first gain factor when the filter stability value comprises a first value, and by multiplying each of the previous spectral values by a second gain factor, being smaller than the first gain factor, when the filter stability value comprises a second value being smaller than the first value.
Claim: 4. The apparatus according to claim 1 , wherein the concealment frame generator is adapted to generate the spectral replacement values based on the filter stability value, wherein the previously received error-free audio signal frame comprises first predictive filter coefficients of the prediction filter, wherein a predecessor frame of the previously received error-free audio signal frame comprises second predictive filter coefficients, and wherein the filter stability value depends on the first predictive filter coefficients and on the second predictive filter coefficients.
Claim: 5. The apparatus according to claim 4 , wherein the concealment frame generator is adapted to determine the filter stability value based on the first predictive filter coefficients of the previously received error-free audio signal frame and based on the second predictive filter coefficients of the predecessor frame of the previously received error-free audio signal frame.
Claim: 6. The apparatus according to claim 4 , wherein the concealment frame generator is adapted to generate the spectral replacement values based on the filter stability value, wherein the filter stability value depends on a distance measure LSF dist , and wherein the distance measure LSF dist is defined by the formula: [mathematical expression included] wherein u+1 specifies a total number of the first predictive filter coefficients of the previously received error-free audio signal frame, and wherein u+1 also specifies a total number of the second predictive filter coefficients of the predecessor frame of the previously received error-free audio signal frame, wherein ƒ i specifies the i-th filter coefficient of the first predictive filter coefficients and wherein f i (p) specifies the i-th filter coefficient of the second predictive filter coefficients.
Claim: 7. The apparatus according to claim 1 , wherein the concealment frame generator is adapted to generate the spectral replacement values furthermore based on frame class information relating to the previously received error-free audio signal frame.
Claim: 8. The apparatus according to claim 7 , wherein the concealment frame generator is adapted to generate the spectral replacement values based on the frame class information, wherein the frame class information indicates that the previously received error-free audio signal frame is classified as “artificial onset”, “onset”, “voiced transition”, “unvoiced transition”, “unvoiced” or “voiced”.
Claim: 9. The apparatus according to claim 1 , wherein the concealment frame generator is adapted to generate the spectral replacement values furthermore based on a number of consecutive frames that did not arrive at a receiver or that were erroneous, since a last error-free audio signal frame had arrived at the receiver, wherein no other error-free audio signal frames arrived at the receiver since the last error-free audio signal frame had arrived at the receiver.
Claim: 10. The apparatus according to claim 9 , wherein the concealment frame generator is adapted to calculate a fade out factor, based on the filter stability value and based on the number of consecutive frames that did not arrive at the receiver or that were erroneous, and wherein the concealment frame generator is adapted to generate the spectral replacement values by multiplying the fade out factor by at least some of the previous spectral values, or by at least some values of a group of intermediate values, wherein each one of the intermediate values depends on at least one of the previous spectral values.
Claim: 11. The apparatus according to claim 1 , wherein the concealment frame generator is adapted to generate the spectral replacement values based on the previous spectral values, based on the filter stability value and also based on a prediction gain of a temporal noise shaping.
Claim: 12. An audio signal decoder comprising: an apparatus for decoding spectral audio signal values, and an apparatus for generating spectral replacement values according to claim 1 , wherein the apparatus for decoding spectral audio signal values is adapted to decode spectral values of an audio signal based on a previously received error-free audio signal frame, wherein the apparatus for decoding spectral audio signal values is furthermore adapted to store the spectral values of the audio signal in the buffer unit of the apparatus for generating spectral replacement values, and wherein the apparatus for generating spectral replacement values is adapted to generate the spectral replacement values based on the spectral values stored in the buffer unit, when a current audio signal frame has not been received or is erroneous, wherein the apparatus for decoding spectral audio signal values is implemented using a hardware apparatus or a computer or a combination of a hardware apparatus and a computer.
Claim: 13. An audio signal decoder, comprising: a decoding unit for generating first intermediate spectral values based on a received error-free audio signal frame, a temporal noise shaping unit for conducting temporal noise shaping on the first intermediate spectral values to acquire second intermediate spectral values, a prediction gain calculator for calculating a prediction gain of the temporal noise shaping depending on the first intermediate spectral values and depending on the second intermediate spectral values, an apparatus according to claim 1 , for generating spectral replacement values when a current audio signal frame has not been received or is erroneous, and a values selector for storing the first intermediate spectral values in the buffer unit of the apparatus for generating spectral replacement values, if the prediction gain is greater than or equal to a threshold value, or for storing the second intermediate spectral values in the buffer unit of the apparatus for generating spectral replacement values, if the prediction gain is smaller than the threshold value; wherein the decoding unit, the temporal noise shaping unit, the prediction gain calculator, and the values selector are implemented using a hardware apparatus or a computer or a combination of a hardware apparatus and a computer.
Claim: 14. An audio signal decoder, comprising: a first decoding module for generating generated spectral values based on a received error-free audio signal frame, an apparatus for generating spectral replacement values according to claim 1 , and a processing module for processing the generated spectral values by conducting temporal noise shaping, applying noise-filling or applying a global gain, to acquire spectral audio values of the decoded audio signal, wherein the apparatus for generating spectral replacement values is adapted to generate spectral replacement values and to feed them into the processing module, when a current frame has not been received or is erroneous; and wherein the first decoding module and the processing module are implemented using a hardware apparatus or a computer or a combination of a hardware apparatus and a computer.
Claim: 15. A method for generating spectral replacement values for an audio signal comprising: storing previous spectral values relating to a previously received error-free audio signal frame, and generating the spectral replacement values when a current audio signal frame has not been received or is erroneous, wherein the previously received error-free audio signal frame comprises filter information, the filter information comprising an associated filter stability value indicating a stability of a prediction filter defined by the filter information, wherein the spectral replacement values are generated based on the previous spectral values and based on the filter stability value, wherein during a playback of the audio signal, at a receiver, the current audio signal frame that has not been received in time or is erroneous is replaced with a synthesized representation of the generated spectral replacement values, wherein the method is performed using a hardware apparatus or a computer or a combination of a hardware apparatus and a computer.
Claim: 16. A non-transitory computer-readable medium comprising a computer program for implementing the method of claim 15 , when the computer program is executed by a computer or signal processor.
Patent References Cited: 5537510 July 1996 Kim
5598506 January 1997 Wigren
5606642 February 1997 Stautner et al.
5684920 November 1997 Iwakami et al.
5848391 December 1998 Bosi et al.
5953698 September 1999 Hayata
5960389 September 1999 Jarvinen et al.
6070137 May 2000 Bloebaum et al.
6134518 October 2000 Cohen et al.
6173257 January 2001 Gao
6236960 May 2001 Peng et al.
6317117 November 2001 Goff
6532443 March 2003 Nishiguchi et al.
6587817 July 2003 Vähätalo et al.
6636829 October 2003 Benyassine et al.
6757654 June 2004 Westerlund et al.
6879955 April 2005 Rao et al.
6969309 November 2005 Carpenter
7003448 February 2006 Lauber et al.
7124079 October 2006 Johansson et al.
7249014 July 2007 Kannan et al.
7280959 October 2007 Bessette
7343283 March 2008 Ashley et al.
7363218 April 2008 Jabri et al.
7519535 April 2009 Spindola
7519538 April 2009 Villemoes et al.
7536299 May 2009 Cheng et al.
7565286 July 2009 Gracie et al.
7587312 September 2009 Kim
7707034 April 2010 Sun et al.
7711563 May 2010 Chen
7788105 August 2010 Miseki
7809556 October 2010 Goto et al.
7873511 January 2011 Herre et al.
7877253 January 2011 Krishnan et al.
7933769 April 2011 Bessette
7979271 July 2011 Bessette
7987089 July 2011 Krishnan et al.
8045572 October 2011 Li et al.
8078458 December 2011 Zopf et al.
8121831 February 2012 Oh et al.
8160274 April 2012 Bongiovi
8239192 August 2012 Kovesi et al.
8255207 August 2012 Vaillancourt et al.
8255213 August 2012 Yoshida et al.
8364472 January 2013 Ehara
8428936 April 2013 Mittal et al.
8566106 October 2013 Salami et al.
8630862 January 2014 Geiger et al.
8630863 January 2014 Son et al.
8825496 September 2014 Setiawan et al.
8954321 February 2015 Beack et al.
2002/0111799 August 2002 Bernard
2002/0184009 December 2002 Heikkinen
2003/0009325 January 2003 Kirchherr et al.
2003/0033136 February 2003 Lee
2003/0046067 March 2003 Gradl
2003/0078771 April 2003 Jung et al.
2003/0225576 December 2003 Li et al.
2004/0046236 March 2004 Collier
2004/0093368 May 2004 Lee et al.
2004/0225505 November 2004 Andersen et al.
2005/0021338 January 2005 Graboi et al.
2005/0065785 March 2005 Bessette
2005/0091044 April 2005 Ramo et al.
2005/0096901 May 2005 Uvliden et al.
2005/0130321 June 2005 Nicholson et al.
2005/0131696 June 2005 Wang et al.
2005/0154584 July 2005 Jelinek et al.
2005/0165603 July 2005 Bessette et al.
2005/0192798 September 2005 Vainio et al.
2005/0240399 October 2005 Makinen
2005/0278171 December 2005 Suppappola et al.
2006/0116872 June 2006 Byun et al.
2006/0206334 September 2006 Kapoor et al.
2006/0271356 November 2006 Vos
2006/0293885 December 2006 Gournay et al.
2007/0016404 January 2007 Kim et al.
2007/0050189 March 2007 Cruz-Zeno et al.
2007/0100607 May 2007 Villemoes
2007/0147518 June 2007 Bessette
2007/0160218 July 2007 Jakka et al.
2007/0171931 July 2007 Manjunath et al.
2007/0172047 July 2007 Coughlan et al.
2007/0225971 September 2007 Bessette
2007/0253577 November 2007 Yen et al.
2007/0282603 December 2007 Bessette
2008/0010064 January 2008 Takeuchi et al.
2008/0015852 January 2008 Kruger et al.
2008/0027719 January 2008 Kirshnan et al.
2008/0052068 February 2008 Aguilar et al.
2008/0208599 August 2008 Rosec et al.
2008/0275580 November 2008 Andersen
2009/0024397 January 2009 Ryu et al.
2009/0076807 March 2009 Xu et al.
2009/0110208 April 2009 Choo et al.
2009/0204412 August 2009 Kovesi
2009/0226016 September 2009 Fitz et al.
2009/0319283 December 2009 Schnell et al.
2009/0326930 December 2009 Kawashima et al.
2010/0017200 January 2010 Oshikiri et al.
2010/0063812 March 2010 Gao
2010/0070270 March 2010 Gao
2010/0106496 April 2010 Morii et al.
2010/0138218 June 2010 Geiger
2010/0198586 August 2010 Edler et al.
2010/0217607 August 2010 Neuendorf et al.
2010/0268542 October 2010 Kim et al.
2011/0007827 January 2011 Virette
2011/0153333 June 2011 Bessette
2011/0161088 June 2011 Bayer et al.
2011/0178795 July 2011 Bayer et al.
2011/0218797 September 2011 Mittal et al.
2011/0218799 September 2011 Mittal et al.
2011/0218801 September 2011 Vary et al.
2011/0257979 October 2011 Gao
2011/0270616 November 2011 Garudadri et al.
2011/0311058 December 2011 Oh et al.
2012/0022881 January 2012 Geiger et al.
2012/0226505 September 2012 Lin et al.
2012/0271644 October 2012 Bessette et al.
2013/0332151 December 2013 Fuchs et al.
2007/312667 April 2008
2730239 January 2010
1274456 November 2000
1344067 April 2002
1381956 November 2002
1437747 August 2003
1539137 October 2004
1539138 October 2004
101351840 October 2006
101110214 January 2008
101366077 February 2009
101371295 February 2009
101379551 March 2009
101388210 March 2009
101425292 May 2009
101483043 July 2009
101488344 July 2009
101743587 June 2010
101770775 July 2010
102008015702 August 2009
0665530 August 1995
0673566 September 1995
0758123 February 1997
0784846 July 1997
0843301 May 1998
1120775 August 2001
1852851 July 2007
2107556 July 2009
2109098 October 2009
2144230 January 2010
2911228 July 2008
H08263098 October 1996
10039898 February 1998
H10214100 August 1998
H11502318 February 1999
H1198090 April 1999
2000357000 December 2000
2002-118517 April 2002
2003501925 January 2003
2003506764 February 2003
2004513381 April 2004
2004514182 May 2004
2005534950 November 2005
2006504123 February 2006
2007065636 March 2007
2007523388 August 2007
2007525707 September 2007
2007538282 December 2007
2008-15281 January 2008
2008513822 May 2008
2008261904 October 2008
2009508146 February 2009
2009075536 April 2009
2009522588 June 2009
2009-527773 July 2009
2010530084 September 2010
2010-538314 December 2010
2010539528 December 2010
2011501511 January 2011
2011527444 October 2011
1020040043278 May 2004
1020060025203 March 2006
1020070088276 August 2007
1020100059726 June 2010
1020100134709 April 2015
2169992 June 2001
2183034 May 2002
2003118444 December 2004
2004138289 June 2005
2296377 March 2007
2302665 July 2007
2312405 December 2007
2331933 August 2008
2335809 October 2008
2008126699 February 2010
2009107161 September 2010
2009118384 November 2010
200830277 October 1996
200943279 October 1998
201032218 September 1999
380246 January 2000
469423 December 2001
I253057 April 2006
200703234 January 2007
200729156 August 2007
200841743 October 2008
I313856 August 2009
200943792 October 2009
I316225 October 2009
I 320172 February 2010
201009810 March 2010
201009812 March 2010
I324762 May 2010
201027517 July 2010
201030735 August 2010
201040943 November 2010
I333643 November 2010
201103009 January 2011
92/22891 December 1992
95/10890 April 1995
95/30222 November 1995
96/29696 September 1996
00/31719 June 2000
0075919 December 2000
02/101724 December 2002
WO-02101722 December 2002
2004027368 April 2004
2005041169 May 2005
2005078706 August 2005
2005081231 September 2005
2005112003 November 2005
2006126844 November 2006
WO-2007051548 May 2007
2007083931 July 2007
WO-2007073604 July 2007
WO 2007073604 July 2007
WO2007/096552 August 2007
WO-2008013788 October 2008
2008/157296 December 2008
WO-2009029032 March 2009
2009077321 October 2009
2009121499 October 2009
2010/003563 January 2010
2010003491 January 2010
WO-2010/003491 January 2010
WO-2010040522 April 2010
2010059374 May 2010
2010/081892 July 2010
2010093224 August 2010
2011/006369 January 2011
WO-2010003532 February 2011
2011/048117 April 2011
WO-2011048094 April 2011
2011/147950 December 2011



































Other References: A Silence Compression Scheme for G.729 Optimized for Terminals Conforming to Recommendation V.70, ITU-T Recommendation G.729—Annex B, International Telecommunication Union, Nov. 1996, pp. 1-16. cited by applicant
Martin, R., Spectral Subtraction Based on Minimum Statistics, Proceedings of European Signal Processing Conference (EUSIPCO), Edinburg, Scotland, Great Britain, Sep. 1994, pp. 1182-1185. cited by applicant
“Digital Cellular Telecommunications System (Phase 2+); Universal Mobile Telecommunications System (UMTS); LTE; Speech codec speech processing functions; Adaptive Multi-Rate-Wideband (AMR-)WB Speech Codec; Transcoding Functions (3GPP TS 26.190 version 9.0.0”, Technical Specification, European Telecommunications Standards Institute (ETSI) 650, Route Des Lucioles; F-06921 Sophia-Antipolis; France; No. V.9.0.0, Jan. 1, 2012, 54 Pages. cited by applicant
“IEEE Signal Processing Letters”, IEEE Signgal Processing Society. vol. 15. ISSN 1070-9908., 2008, 9 Pages. cited by applicant
“Information Technology—MPEG Audio Technologies—Part 3: Unified Speech and Audio Coding”, ISO/IEC JTC 1/SC 29 ISO/IEC DIS 23003-3, Feb. 9, 2011, 233 Pages. cited by applicant
“WD7 of USAC”, International Organisation for Standardisation Organisation Internationale De Normailisation. ISO/IEC JTC1/SC29/WG11. Coding of Moving Pictures and Audio. Dresden, Germany., Apr. 2010, 148 Pages. cited by applicant
3GPP, , “3rd Generation Partnership Project; Technical Specification Group Service and System Aspects. Audio Codec Processing Functions. Exteneded AMR Wideband Codec; Transcoding functions (Release 6).”, 3GPP Draft; 26.290, V2.0.0 3rd Generation Partnership Project (3GPP), Mobile Competence Centre; Valbonne, France., Sep. 2004, pp. 1-85. cited by applicant
Ashley, J et al., “Wideband Coding of Speech Using a Scalable Pulse Codebook”, 2000 IEEE Speech Coding Proceedings., Sep. 17, 2000, pp. 148-150. cited by applicant
Bessette, B et al., “The Adaptive Multirate Wideband Speech Codec (AMR-WB)”, IEEE Transactions on Speech and Audio Processing, IEEE Service Center. New York. vol. 10, No. 8., Nov. 1, 2002, pp. 620-636. cited by applicant
Bessette, B et al., “Universal Speech/Audio Coding Using Hybrid ACELP/TCX Techniques”, ICASSP 2005 Proceedings. IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 3,, Jan. 2005, pp. 301-304. cited by applicant
Bessette, B et al., “Wideband Speech and Audio Codec at 16/24/32 Kbit/S Using Hybrid ACELP/TCX Techniques”, 1999 IEEE Speech Coding Proceedings. Porvoo, Finland., Jun. 20, 1999, pp. 7-9. cited by applicant
Ferreira, A et al., “Combined Spectral Envelope Normalization and Subtraction of Sinusoidal Components in the ODFTand MDCT Frequency Domains”, 2001 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics., Oct. 2001, pp. 51-54. cited by applicant
Fischer, et al., “Enumeration Encoding and Decoding Algorithms for Pyramid Cubic Lattice and Trellis Codes”, IEEE Transactions on Information Theory. IEEE Press, USA, vol. 41, No. 6, Part 2., Nov. 1, 1995, pp. 2056-2061. cited by applicant
Hermansky, H et al., “Perceptual linear predictive (PLP) analysis of speech”, J. Acoust. Soc. Amer. 87 (4)., Apr. 1990, pp. 1738-1751. cited by applicant
Hofbauer, K et al., “Estimating Frequency and Amplitude of Sinusoids in Harmonic Signals—A Survey and the Use of Shifted Fourier Transforms”, Graz: Graz University of Technology; Graz University of Music and Dramatic Arts; Diploma Thesis, Apr. 2004, 111 pages. cited by applicant
Lanciani, C et al., “Subband-Domain Filtering of MPEG Audio Signals”, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Phoenix, , AZ, USA., Mar. 15, 1999, pp. 917-920. cited by applicant
Lauber, P et al., “Error Concealment for Compressed Digital Audio”, Presented at the 111th AES Convention. Paper 5460. New York, USA., Sep. 21, 2001, 12 Pages. cited by applicant
Lee, Ick Don et al., “A Voice Activity Detection Algorithm for Communication Systems with Dynamically Varying Background Acoustic Noise”, Dept. of Electrical Engineering, 1998 IEEE, May 18-21, 1998, pp. 1214-1218. cited by applicant
Makinen, J et al., “AMR-WB+: a New Audio Coding Standard for 3rd Generation Mobile Audio Services”, 2005 IEEE International Conference on Acoustics, Speech, and Signal Processing. Philadelphia, PA, USA., Mar. 18, 2005, 1109-1112. cited by applicant
Motlicek, P et al., “Audio Coding Based on Long Temporal Contexts”, Rapport de recherche de l'IDIAP 06-30, Apr. 2006, pp. 1-10. cited by applicant
Neuendorf, M et al., “A Novel Scheme for Low Bitrate Unified Speech Audio Coding—MPEG RMO”, AES 126th Convention. Convention Paper 7713. Munich, Germany, May 1, 2009, 13 Pages. cited by applicant
Neuendorf, M et al., “Completion of Core Experiment on unification of USAC Windowing and Frame Transitions”, International Organisation for Standardisation Organisation Internationale De Normalisation ISO/IEC JTC1/SC29/WG11. Coding of Moving Pictures and Audio. Kyoto, Japan., Jan. 2010, 52 Pages. cited by applicant
Neuendorf, M et al., “Unified Speech and Audio Coding Scheme for High Quality at Low Bitrates”, ICASSP 2009 IEEE International Conference on Acoustics, Speech and Signal Processing, Piscataway, NJ, USA., Apr. 19, 2009, 4 Pages. cited by applicant
Patwardhan, P et al., “Effect of Voice Quality on Frequency-Warped Modeling of Vowel Spectra”, Speech Communication. vol. 48, No. 8., Aug. 2006, pp. 1009-1023. cited by applicant
Ryan, D et al., “Reflected Simplex Codebooks for Limited Feedback MIMO Beamforming”, IEEE. XP31506379A., Jun. 14-18, 2009, 6 Pages. cited by applicant
Sjoberg, J et al., “RTP Payload Format for the Extended Adaptive Multi-Rate Wideband (AMR-WB+) Audio Codec”, Memo. The Internet Society. Network Working Group. Category: Standards Track., Jan. 2006, pp. 1-38. cited by applicant
Terriberry, T et al., “A Multiply-Free Enumeration of Combinations with Replacement and Sign”, IEEE Signal Processing Letters. vol. 15, 2008, 11 Pages. cited by applicant
Terriberry, T et al., “Pulse Vector Coding”, Retrieved from the internet on Oct. 12, 2012. XP55025946. URL:http://people.xiph.org/˜tterribe/notes/cwrs.html, Dec. 1, 2007, 4 Pages. cited by applicant
Virette, D et al., “Enhanced Pulse Indexing CE for ACELP in USAC”, Organisation Internationale De Normalisation ISO/IEC JTC1/SC29/WG11. MPEG2012/M19305. Coding of Moving Pictures and Audio. Daegu, Korea., Jan. 2011, 13 Pages. cited by applicant
Wang, F et al., “Frequency Domain Adaptive Postfiltering for Enhancement of Noisy Speech”, Speech Communication 12. Elsevier Science Publishers. Amsterdam, North-Holland. vol. 12, No. 1., Mar. 1993, 41-56. cited by applicant
Waterschoot, T et al., “Comparison of Linear Prediction Models for Audio Signals”, EURASIP Journal on Audio, Speech, and Music Processing. vol. 21., Dec. 2008, 27 pages. cited by applicant
Zernicki, T et al., “Report on CE on Improved Tonal Component Coding in eSBR”, International Organisation for Standardisation Organisation Internationale De Normalisation ISO/IEC JTC1/SC29/WG11. Coding of Moving Pictures and Audio. Daegu, South Korea, Jan. 2011, 20 Pages. cited by applicant
3GPP, TS 26.290 Version 9.0.0; Digital cellular telecommunications system (Phase 2+); Universal Mobile Telecommunications System (UMTS); LTE; Audio codec processing functions; Extended Adaptive Multi-Rate-Wideband (AMR-WB+) codec; Transcoding functions (3GPP TS 26.290 version 9.0.0 release 9), Jan. 2010, Chapter 5.3, pp. 24-39. cited by applicant
Britanak, et al., “A new fast algorithm for the unified forward and inverse MDCT/MDST computation”, Signal Processing, vol. 82, Mar. 2002, pp. 433-459. cited by applicant
Herley, C. et al., “Tilings of the Time-Frequency Plane: Construction of Arbitrary Orthogonal Bases and Fast Tilings Algorithms”, IEEE Transactions on Signal Processing, vol. 41, No. 12, Dec. 1993, pp. 3341-3359. cited by applicant
Lefebvre, R. et al., “High quality coding of wideband audio signals using transform coded excitation (TCX)”, 1994 IEEE International Conference on Acoustics, Speech, and Signal Processing, Apr. 19-22, 1994, pp. I/193 to I/196 (4 pages). cited by applicant
Primary Examiner: Shah, Paras D
Attorney, Agent or Firm: Glenn, Michael A.
Perkins Coie LLP
Dokumentencode: edspgr.09384739
Datenbank: USPTO Patent Grants