High Parameter Frequency Resolution Encoding Scheme for Spatial Audio Objects Using Stacked Sparse Autoencoder

Object-based audio systems have become common in recent years as they provide the flexibility for many auditory scenarios, such as virtual reality games, interactive theater, and spatial audio communication. For saving bitrates, multiple audio objects are compressed into a mono downmix signal and si...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Neural processing letters Jg. 54; H. 2; S. 817 - 833
Hauptverfasser: Wu, Yulin, Hu, Ruimin, Wang, Xiaochen, Hu, Chenhao, Ke, Shanfa
Format: Journal Article
Sprache:Englisch
Veröffentlicht: New York Springer US 01.04.2022
Springer Nature B.V
Schlagworte:
ISSN:1370-4621, 1573-773X
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Abstract Object-based audio systems have become common in recent years as they provide the flexibility for many auditory scenarios, such as virtual reality games, interactive theater, and spatial audio communication. For saving bitrates, multiple audio objects are compressed into a mono downmix signal and side information parameters. However, side information parameter frequency resolution is too low to cause aliasing distortion. To overcome this issue, a new encoding scheme based on high parameter frequency resolution (224 sub-bands in a frame) is proposed in this paper. The side information parameters with high frequency resolution are compressed and reconstructed via SSAE (stacked sparse autoencoder) neural network and further used for recovering the audio objects. The performance of the proposed method is compared against existing SAOC (spatial audio object coding) methods at the same overall bitrate, judged by both objective and subjective results. The evaluation shows that our approach can facilitate the high quality of spatial audio objects.
AbstractList Object-based audio systems have become common in recent years as they provide the flexibility for many auditory scenarios, such as virtual reality games, interactive theater, and spatial audio communication. For saving bitrates, multiple audio objects are compressed into a mono downmix signal and side information parameters. However, side information parameter frequency resolution is too low to cause aliasing distortion. To overcome this issue, a new encoding scheme based on high parameter frequency resolution (224 sub-bands in a frame) is proposed in this paper. The side information parameters with high frequency resolution are compressed and reconstructed via SSAE (stacked sparse autoencoder) neural network and further used for recovering the audio objects. The performance of the proposed method is compared against existing SAOC (spatial audio object coding) methods at the same overall bitrate, judged by both objective and subjective results. The evaluation shows that our approach can facilitate the high quality of spatial audio objects.
Author Ke, Shanfa
Hu, Ruimin
Wang, Xiaochen
Wu, Yulin
Hu, Chenhao
Author_xml – sequence: 1
  givenname: Yulin
  surname: Wu
  fullname: Wu, Yulin
  organization: National Engineering Research Center for Multimedia Software, School of Computer Science, Wuhan University, Hubei Key Laboratory of Multimedia and Network Communication Engineering, Wuhan University
– sequence: 2
  givenname: Ruimin
  orcidid: 0000-0002-5872-3872
  surname: Hu
  fullname: Hu, Ruimin
  email: hrm@whu.edu.cn
  organization: National Engineering Research Center for Multimedia Software, School of Computer Science, Wuhan University, Hubei Key Laboratory of Multimedia and Network Communication Engineering, Wuhan University
– sequence: 3
  givenname: Xiaochen
  surname: Wang
  fullname: Wang, Xiaochen
  organization: National Engineering Research Center for Multimedia Software, School of Computer Science, Wuhan University, Research Institute of Wuhan University in Shenzhen
– sequence: 4
  givenname: Chenhao
  surname: Hu
  fullname: Hu, Chenhao
  organization: National Engineering Research Center for Multimedia Software, School of Computer Science, Wuhan University
– sequence: 5
  givenname: Shanfa
  surname: Ke
  fullname: Ke, Shanfa
  organization: National Engineering Research Center for Multimedia Software, School of Computer Science, Wuhan University
BookMark eNp9kMtOAjEUhhuDiYC-gKsmrkd7GZiZJSEgJiQakcRd02nPwODQYlsWvL0dxsTEBatzFt93Lv8A9Yw1gNA9JY-UkOzJU0rGPCGMJrEZFUl-hfp0lPEky_hnL_Y8I0k6ZvQGDbzfERI1RvrILOrNFr9JJ_cQwOG5g-8jGHXC7-Btcwy1NXhmlNW12eCV2sIecGUdXh1kqGWDJ0ddW_xa7kAFj9f-jAWpvkC3jPMQkWChHQHuFl1XsvFw91uHaD2ffUwXyfL1-WU6WSaK0yIkFSGpplmVlpqqnFEiq1RzlsWiAFReMEVyXZZpVlLN04ixsiyUpiBVkZOSD9FDN_fgbPzHB7GzR2fiSsEKmvOUszGNFOso5az3DipxcPVeupOgRLS5ii5XEXMV51xFHqX8n6TqINucgpN1c1nlnerjHrMB93fVBesHZdKQkQ
CitedBy_id crossref_primary_10_1007_s11063_024_11691_0
crossref_primary_10_1016_j_eswa_2024_123323
Cites_doi 10.1016/j.neucom.2015.08.104
10.3390/app7121301
10.1109/TASLP.2015.2419980
10.17743/jaes.2014.0049
10.1109/CC.2017.8068762
10.1109/TMM.2011.2168197
10.1007/s11042-020-10339-0
10.1007/s11042-019-7409-7
10.1109/TASL.2010.2092429
10.1109/TSA.2005.858005
10.1109/JSTSP.2015.2411578
10.1109/TSA.2003.818108
10.1109/TASL.2012.2211015
10.1007/s11042-018-6273-1
10.1016/j.asoc.2020.107003
10.1109/TII.2019.2949355
10.1515/cait-2015-0074
10.1109/APSIPA.2015.7415383
10.1109/I4Tech48345.2020.9102675
10.1109/ICASSP.2013.6637653
10.1007/978-3-030-37731-1_54
10.1109/ICME51207.2021.9428471
10.1109/CVPR46437.2021.00496
10.1007/978-3-319-53547-0_31
10.1109/ICACSIS.2014.7065868
10.1109/ICASSP.2017.7952247
10.1109/ICME51207.2021.9428227
10.1109/ICME51207.2021.9428297
10.1109/ICME.2007.4285045
10.1109/ICASSP.2017.7952254
10.1109/ICASSP39728.2021.9414585
10.1109/ICASSP40776.2020.9054412
10.1007/978-3-030-67832-6_5
10.1007/s00521-021-05933-8
ContentType Journal Article
Copyright The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2021
Copyright Springer Nature B.V. Apr 2022
Copyright_xml – notice: The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2021
– notice: Copyright Springer Nature B.V. Apr 2022
DBID AAYXX
CITATION
8FE
8FG
AFKRA
ARAPS
AZQEC
BENPR
BGLVJ
CCPQU
DWQXO
GNUQQ
HCIFZ
JQ2
K7-
P5Z
P62
PHGZM
PHGZT
PKEHL
PQEST
PQGLB
PQQKQ
PQUKI
PRINS
PSYQQ
DOI 10.1007/s11063-021-10659-8
DatabaseName CrossRef
ProQuest SciTech Collection
ProQuest Technology Collection
ProQuest Central UK/Ireland
Advanced Technologies & Computer Science Collection
ProQuest Central Essentials
ProQuest Central
Technology collection
ProQuest One Community College
ProQuest Central
ProQuest Central Student
SciTech Premium Collection
ProQuest Computer Science Collection
Computer Science Database
Advanced Technologies & Aerospace Database
ProQuest Advanced Technologies & Aerospace Collection
ProQuest Central Premium
ProQuest One Academic
ProQuest One Academic Middle East (New)
ProQuest One Academic Eastern Edition (DO NOT USE)
ProQuest One Applied & Life Sciences
ProQuest One Academic (retired)
ProQuest One Academic UKI Edition
ProQuest Central China
ProQuest One Psychology
DatabaseTitle CrossRef
Advanced Technologies & Aerospace Collection
ProQuest One Psychology
Computer Science Database
ProQuest Central Student
Technology Collection
ProQuest One Academic Middle East (New)
ProQuest Advanced Technologies & Aerospace Collection
ProQuest Central Essentials
ProQuest Computer Science Collection
ProQuest One Academic Eastern Edition
SciTech Premium Collection
ProQuest One Community College
ProQuest Technology Collection
ProQuest SciTech Collection
ProQuest Central China
ProQuest Central
Advanced Technologies & Aerospace Database
ProQuest One Applied & Life Sciences
ProQuest One Academic UKI Edition
ProQuest Central Korea
ProQuest Central (New)
ProQuest One Academic
ProQuest One Academic (New)
DatabaseTitleList Advanced Technologies & Aerospace Collection

Database_xml – sequence: 1
  dbid: P5Z
  name: Advanced Technologies & Aerospace Database
  url: https://search.proquest.com/hightechjournals
  sourceTypes: Aggregation Database
DeliveryMethod fulltext_linktorsrc
Discipline Computer Science
EISSN 1573-773X
EndPage 833
ExternalDocumentID 10_1007_s11063_021_10659_8
GrantInformation_xml – fundername: National Nature Science Foundation of China
  grantid: U1803262
– fundername: Basic Research Project of Science and Technology Plan of Shenzhen
  grantid: JCYJ20170818143246278
– fundername: National Key R&D Program of China
  grantid: 2017YFB1002803
– fundername: National Key R&D Program of China
  grantid: 2017YFB1002803; 2017YFB1002803
GroupedDBID -4Z
-5F
-5G
-BR
-EM
-Y2
-~C
.86
.DC
.VR
06D
0R~
0VY
123
1N0
1SB
2.D
203
28-
29N
2J2
2JN
2JY
2KG
2LR
2P1
2VQ
2~H
30V
4.4
406
408
409
40D
40E
53G
5QI
5VS
67Z
6NX
8TC
8UJ
95-
95.
95~
96X
AAAVM
AABHQ
AAHNG
AAIAL
AAJKR
AAJSJ
AAKKN
AANZL
AAOBN
AARHV
AARTL
AATVU
AAUYE
AAWCG
AAYIU
AAYOK
AAYQN
AAYTO
AAYZH
ABAKF
ABBBX
ABBXA
ABDZT
ABECU
ABEEZ
ABFTD
ABFTV
ABHLI
ABHQN
ABJNI
ABJOX
ABKCH
ABKTR
ABMNI
ABMOR
ABMQK
ABNWP
ABQBU
ABQSL
ABSXP
ABTEG
ABTHY
ABTKH
ABTMW
ABULA
ABWNU
ABXPI
ACACY
ACBXY
ACGFS
ACHSB
ACHXU
ACKNC
ACMDZ
ACMLO
ACOKC
ACOMO
ACSNA
ACULB
ACZOJ
ADHHG
ADHIR
ADIMF
ADINQ
ADKNI
ADKPE
ADRFC
ADTPH
ADURQ
ADYFF
ADZKW
AEBTG
AEFIE
AEFQL
AEGAL
AEGNC
AEJHL
AEJRE
AEKMD
AENEX
AEOHA
AEPYU
AESKC
AETLH
AEVLU
AEXYK
AFBBN
AFEXP
AFGCZ
AFGXO
AFKRA
AFLOW
AFQWF
AFWTZ
AFZKB
AGAYW
AGDGC
AGGDS
AGJBK
AGMZJ
AGQEE
AGQMX
AGRTI
AGWIL
AGWZB
AGYKE
AHAVH
AHBYD
AHKAY
AHSBF
AHYZX
AIAKS
AIIXL
AILAN
AITGF
AJBLW
AJRNO
AJZVZ
ALMA_UNASSIGNED_HOLDINGS
ALWAN
AMKLP
AMXSW
AMYLF
AMYQR
AOCGG
ARAPS
ARMRJ
ASPBG
AVWKF
AXYYD
AYJHY
AZFZN
B-.
BA0
BBWZM
BDATZ
BENPR
BGLVJ
BGNMA
C24
C6C
CAG
CCPQU
COF
CS3
CSCUP
DDRTE
DL5
DNIVK
DPUIP
DU5
EBLON
EBS
EIOEI
EJD
ESBYG
FEDTE
FERAY
FFXSO
FIGPU
FINBP
FNLPD
FRRFC
FSGXE
FWDCC
GGCAI
GGRSB
GJIRD
GNWQR
GQ6
GQ7
GQ8
GXS
H13
HCIFZ
HF~
HG5
HG6
HMJXF
HQYDN
HRMNR
HVGLF
HZ~
I09
IHE
IJ-
IKXTQ
ITM
IWAJR
IXC
IXE
IZIGR
IZQ
I~X
I~Z
J-C
J0Z
JBSCW
JCJTX
JZLTJ
K7-
KDC
KOV
KOW
LAK
LLZTM
M4Y
MA-
N2Q
NB0
NDZJH
NPVJJ
NQJWS
NU0
O9-
O93
O9G
O9I
O9J
OAM
OVD
P19
P2P
P9O
PF0
PSYQQ
PT5
QOK
QOS
R4E
R89
R9I
RHV
RNI
RNS
ROL
RPX
RSV
RZC
RZE
RZK
S16
S1Z
S26
S27
S28
S3B
SAP
SCJ
SCLPG
SDH
SDM
SHX
SISQX
SNE
SNPRN
SNX
SOHCF
SOJ
SPH
SPISZ
SRMVM
SSLCW
STPWE
SZN
T13
T16
TEORI
TSG
TSK
TSV
TUC
U2A
UG4
UOJIU
UTJUX
UZXMN
VC2
VFIZW
W23
W48
WK8
YLTOR
Z45
Z7R
Z7X
Z81
Z83
Z88
Z8M
Z8R
Z8U
Z8W
Z92
ZMTXR
~EX
77I
AASML
AAYXX
ABDBE
ABFSG
ACSTC
ADHKG
AEZWR
AFFHD
AFHIU
AGQPQ
AHPBZ
AHWEU
AIXLP
AYFIA
CITATION
PHGZM
PHGZT
PQGLB
8FE
8FG
AZQEC
DWQXO
GNUQQ
JQ2
P62
PKEHL
PQEST
PQQKQ
PQUKI
PRINS
ID FETCH-LOGICAL-c319t-f004d17f4bd1c8210af4d327af4ceec892c08dbb47b1d344bd2bb9cd1eac980b3
IEDL.DBID K7-
ISICitedReferencesCount 3
ISICitedReferencesURI http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000708365700001&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
ISSN 1370-4621
IngestDate Sat Oct 18 23:14:53 EDT 2025
Tue Nov 18 22:24:01 EST 2025
Sat Nov 29 02:27:52 EST 2025
Fri Feb 21 02:46:45 EST 2025
IsPeerReviewed true
IsScholarly true
Issue 2
Keywords SSAE
Frequency resolution
SAOC
Aliasing distortion
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c319t-f004d17f4bd1c8210af4d327af4ceec892c08dbb47b1d344bd2bb9cd1eac980b3
Notes ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ORCID 0000-0002-5872-3872
PQID 2918343261
PQPubID 2043838
PageCount 17
ParticipantIDs proquest_journals_2918343261
crossref_primary_10_1007_s11063_021_10659_8
crossref_citationtrail_10_1007_s11063_021_10659_8
springer_journals_10_1007_s11063_021_10659_8
PublicationCentury 2000
PublicationDate 20220400
2022-04-00
20220401
PublicationDateYYYYMMDD 2022-04-01
PublicationDate_xml – month: 4
  year: 2022
  text: 20220400
PublicationDecade 2020
PublicationPlace New York
PublicationPlace_xml – name: New York
– name: Dordrecht
PublicationTitle Neural processing letters
PublicationTitleAbbrev Neural Process Lett
PublicationYear 2022
Publisher Springer US
Springer Nature B.V
Publisher_xml – name: Springer US
– name: Springer Nature B.V
References Wang, Yao, Zhao (CR34) 2016; 184
CR18
CR17
CR39
CR16
Hu, Wang, Hu, Wu (CR19) 2021; 80
CR38
CR37
Bosi, Brandenburg, Quackenbush, Fielder, Akagiri, Fuchs, Dietz, Herre, Davidson, Oikawa (CR5) 1997; 45
Li, Lei, Wang, Jiang, Liu (CR26) 2021; 101
CR12
CR10
CR32
CR30
Zheng, Ritz, Xi (CR45) 2013; 21
Faller, Baumgarte (CR9) 2003; 11
Wu, Hu, Wang, Ke, Wang (CR35) 2017; 14
Yang, Jia, Wang, Zhang (CR41) 2015; 15
Herre, Hilpert, Kuntz, Plogsties (CR15) 2015; 62
Ando (CR1) 2011; 19
Herre, Purnhagen, Koppens, Hellmuth, Engdegard, Hilpert, Villemoes, Terentiv, Falch, Holzer, Valero, Resch, Mundt, Oh (CR13) 2012; 60
CR2
CR3
CR6
CR8
Jia, Yang, Bao, Zheng, Ritz (CR22) 2015; 23
Bosi, Goldberg (CR4) 2012
CR7
CR29
CR28
CR27
CR24
CR46
Herre, Hilpert, Kuntz, Plogsties (CR14) 2015; 9
Shi, Luo, He, Li, Liu, Li (CR31) 2020; 16
CR44
CR21
CR43
CR20
CR42
CR40
Gnouma, Ladjailia, Ejbali, Zaied (CR11) 2019; 78
Wu, Hu, Wang, Ke (CR36) 2019; 78
Kim, Seo, Beack, Kang, Hahn (CR25) 2011; 13
Jia, Zhang, Bao, Zheng (CR23) 2017; 7
Vincent, Gribonval, Févotte (CR33) 2006; 14
E Vincent (10659_CR33) 2006; 14
Y Li (10659_CR26) 2021; 101
Y Wang (10659_CR34) 2016; 184
T Wu (10659_CR35) 2017; 14
T Wu (10659_CR36) 2019; 78
C Hu (10659_CR19) 2021; 80
10659_CR18
A Ando (10659_CR1) 2011; 19
10659_CR17
10659_CR39
10659_CR16
10659_CR38
J Herre (10659_CR14) 2015; 9
Z Yang (10659_CR41) 2015; 15
M Bosi (10659_CR4) 2012
M Jia (10659_CR23) 2017; 7
10659_CR10
10659_CR32
M Bosi (10659_CR5) 1997; 45
10659_CR30
10659_CR37
10659_CR2
10659_CR12
10659_CR6
10659_CR3
10659_CR40
10659_CR7
10659_CR8
M Jia (10659_CR22) 2015; 23
J Herre (10659_CR15) 2015; 62
J Herre (10659_CR13) 2012; 60
M Gnouma (10659_CR11) 2019; 78
C Shi (10659_CR31) 2020; 16
K Kim (10659_CR25) 2011; 13
10659_CR29
10659_CR28
10659_CR27
10659_CR44
10659_CR21
10659_CR43
10659_CR20
10659_CR42
C Faller (10659_CR9) 2003; 11
10659_CR24
X Zheng (10659_CR45) 2013; 21
10659_CR46
References_xml – year: 2012
  ident: CR4
  publication-title: Introduction to digital audio coding and standards
– ident: CR18
– ident: CR43
– volume: 184
  start-page: 232
  year: 2016
  end-page: 242
  ident: CR34
  article-title: Auto-encoder based dimensionality reduction
  publication-title: Neurocomputing
  doi: 10.1016/j.neucom.2015.08.104
– ident: CR39
– ident: CR2
– ident: CR16
– ident: CR37
– volume: 7
  start-page: 1301
  issue: 12
  year: 2017
  end-page: 1312
  ident: CR23
  article-title: A psychoacoustic-based multiple audio object coding approach via intra-object sparsity
  publication-title: Appl Sci
  doi: 10.3390/app7121301
– ident: CR12
– ident: CR30
– volume: 23
  start-page: 1082
  issue: 6
  year: 2015
  end-page: 1095
  ident: CR22
  article-title: Encoding multiple audio objects using intra-object sparsity
  publication-title: IEEE/ACM Transactions Audio Speech Lang Process
  doi: 10.1109/TASLP.2015.2419980
– ident: CR10
– ident: CR6
– ident: CR29
– ident: CR8
– volume: 62
  start-page: 821
  issue: 12
  year: 2015
  end-page: 830
  ident: CR15
  article-title: MPEG-H audio-the new standard for universal spatial/3D audio coding
  publication-title: Audio Eng Soc (AES)
  doi: 10.17743/jaes.2014.0049
– volume: 14
  start-page: 32
  issue: 9
  year: 2017
  end-page: 41
  ident: CR35
  article-title: High quality audio object coding framework based on non-negative matrix factorization
  publication-title: China Commun
  doi: 10.1109/CC.2017.8068762
– ident: CR40
– volume: 60
  start-page: 655
  issue: 9
  year: 2012
  end-page: 673
  ident: CR13
  article-title: MPEG spatial audio object coding-The ISO/MPEG standard for efficient coding of interactive audio scenes
  publication-title: Audio Eng Soc (AES)
– volume: 13
  start-page: 1208
  issue: 6
  year: 2011
  end-page: 1216
  ident: CR25
  article-title: Spatial audio object coding with two-step coding structure for interactive audio service
  publication-title: IEEE Transactions Multimedia
  doi: 10.1109/TMM.2011.2168197
– ident: CR27
– ident: CR42
– ident: CR21
– ident: CR46
– volume: 80
  start-page: 18717
  issue: 12
  year: 2021
  end-page: 18733
  ident: CR19
  article-title: Audio object coding based on n-step residual compensating
  publication-title: Multimedia Tools Appl
  doi: 10.1007/s11042-020-10339-0
– volume: 78
  start-page: 20723
  issue: 15
  year: 2019
  end-page: 20738
  ident: CR36
  article-title: Audio object coding based on optimal parameter frequency resolution
  publication-title: Multimedia Tools Appl
  doi: 10.1007/s11042-019-7409-7
– ident: CR44
– volume: 19
  start-page: 1467
  issue: 6
  year: 2011
  end-page: 1475
  ident: CR1
  article-title: Conversion of multichannel sound signal maintaining physical properties of sound in reproduced sound field
  publication-title: IEEE Transactions Audio Speech Lang Process
  doi: 10.1109/TASL.2010.2092429
– volume: 14
  start-page: 1462
  issue: 4
  year: 2006
  end-page: 1469
  ident: CR33
  article-title: Performance measurement in blind audio source separation
  publication-title: IEEE Transactions Audio Speech Lang Process
  doi: 10.1109/TSA.2005.858005
– volume: 9
  start-page: 770
  issue: 5
  year: 2015
  end-page: 779
  ident: CR14
  article-title: MPEG-H 3D audio-the new standard for coding of immersive spatial audio
  publication-title: IEEE J Sel Topics Signal Process
  doi: 10.1109/JSTSP.2015.2411578
– volume: 11
  start-page: 520
  issue: 6
  year: 2003
  end-page: 531
  ident: CR9
  article-title: Binaural cue coding-part II: schemes and applications
  publication-title: IEEE Transactions Speech Audio Process
  doi: 10.1109/TSA.2003.818108
– ident: CR3
– ident: CR38
– volume: 21
  start-page: 29
  issue: 1
  year: 2013
  end-page: 38
  ident: CR45
  article-title: Encoding navigable speech sources: a psychoacoustic-based analysis-by-synthesis approach
  publication-title: IEEE Transactions Audio Speech Lang Process
  doi: 10.1109/TASL.2012.2211015
– ident: CR17
– volume: 45
  start-page: 789
  issue: 10
  year: 1997
  end-page: 814
  ident: CR5
  article-title: ISO/IEC MPEG-2 advanced audio coding
  publication-title: Audio Eng Soc (AES)
– ident: CR32
– volume: 78
  start-page: 2157
  issue: 2
  year: 2019
  end-page: 2179
  ident: CR11
  article-title: Stacked sparse autoencoder and history of binary motion image for human activity recognition
  publication-title: Multimedia Tools Appl
  doi: 10.1007/s11042-018-6273-1
– volume: 101
  start-page: 107003
  year: 2021
  ident: CR26
  article-title: Embedded stacked group sparse autoencoder ensemble with L1 regularization and manifold reduction
  publication-title: Appl Soft Comput
  doi: 10.1016/j.asoc.2020.107003
– ident: CR7
– volume: 16
  start-page: 5150
  issue: 8
  year: 2020
  end-page: 5159
  ident: CR31
  article-title: Tool wear prediction via multidimensional stacked sparse autoencoders with feature fusion
  publication-title: IEEE Transactions Ind Inform
  doi: 10.1109/TII.2019.2949355
– ident: CR28
– ident: CR24
– volume: 15
  start-page: 135
  issue: 6
  year: 2015
  end-page: 146
  ident: CR41
  article-title: Multi-stage encoding scheme for multiple audio objects using compressed sensing
  publication-title: Cybern Information Technol
  doi: 10.1515/cait-2015-0074
– ident: CR20
– volume: 60
  start-page: 655
  issue: 9
  year: 2012
  ident: 10659_CR13
  publication-title: Audio Eng Soc (AES)
– ident: 10659_CR40
  doi: 10.1109/APSIPA.2015.7415383
– ident: 10659_CR24
  doi: 10.1109/I4Tech48345.2020.9102675
– ident: 10659_CR46
  doi: 10.1109/ICASSP.2013.6637653
– ident: 10659_CR6
– volume: 16
  start-page: 5150
  issue: 8
  year: 2020
  ident: 10659_CR31
  publication-title: IEEE Transactions Ind Inform
  doi: 10.1109/TII.2019.2949355
– volume: 62
  start-page: 821
  issue: 12
  year: 2015
  ident: 10659_CR15
  publication-title: Audio Eng Soc (AES)
  doi: 10.17743/jaes.2014.0049
– volume: 14
  start-page: 1462
  issue: 4
  year: 2006
  ident: 10659_CR33
  publication-title: IEEE Transactions Audio Speech Lang Process
  doi: 10.1109/TSA.2005.858005
– ident: 10659_CR10
– volume: 23
  start-page: 1082
  issue: 6
  year: 2015
  ident: 10659_CR22
  publication-title: IEEE/ACM Transactions Audio Speech Lang Process
  doi: 10.1109/TASLP.2015.2419980
– ident: 10659_CR16
  doi: 10.1007/978-3-030-37731-1_54
– ident: 10659_CR18
  doi: 10.1109/ICME51207.2021.9428471
– ident: 10659_CR39
  doi: 10.1109/CVPR46437.2021.00496
– volume: 11
  start-page: 520
  issue: 6
  year: 2003
  ident: 10659_CR9
  publication-title: IEEE Transactions Speech Audio Process
  doi: 10.1109/TSA.2003.818108
– volume: 7
  start-page: 1301
  issue: 12
  year: 2017
  ident: 10659_CR23
  publication-title: Appl Sci
  doi: 10.3390/app7121301
– volume: 21
  start-page: 29
  issue: 1
  year: 2013
  ident: 10659_CR45
  publication-title: IEEE Transactions Audio Speech Lang Process
  doi: 10.1109/TASL.2012.2211015
– ident: 10659_CR2
  doi: 10.1007/978-3-319-53547-0_31
– ident: 10659_CR8
  doi: 10.1109/ICACSIS.2014.7065868
– volume: 78
  start-page: 20723
  issue: 15
  year: 2019
  ident: 10659_CR36
  publication-title: Multimedia Tools Appl
  doi: 10.1007/s11042-019-7409-7
– ident: 10659_CR32
  doi: 10.1109/ICASSP.2017.7952247
– volume-title: Introduction to digital audio coding and standards
  year: 2012
  ident: 10659_CR4
– ident: 10659_CR44
– ident: 10659_CR29
– volume: 14
  start-page: 32
  issue: 9
  year: 2017
  ident: 10659_CR35
  publication-title: China Commun
  doi: 10.1109/CC.2017.8068762
– volume: 19
  start-page: 1467
  issue: 6
  year: 2011
  ident: 10659_CR1
  publication-title: IEEE Transactions Audio Speech Lang Process
  doi: 10.1109/TASL.2010.2092429
– volume: 45
  start-page: 789
  issue: 10
  year: 1997
  ident: 10659_CR5
  publication-title: Audio Eng Soc (AES)
– ident: 10659_CR27
– ident: 10659_CR21
– ident: 10659_CR37
  doi: 10.1109/ICME51207.2021.9428227
– ident: 10659_CR17
  doi: 10.1109/ICME51207.2021.9428297
– ident: 10659_CR7
– ident: 10659_CR12
  doi: 10.1109/ICME.2007.4285045
– volume: 9
  start-page: 770
  issue: 5
  year: 2015
  ident: 10659_CR14
  publication-title: IEEE J Sel Topics Signal Process
  doi: 10.1109/JSTSP.2015.2411578
– volume: 78
  start-page: 2157
  issue: 2
  year: 2019
  ident: 10659_CR11
  publication-title: Multimedia Tools Appl
  doi: 10.1007/s11042-018-6273-1
– ident: 10659_CR30
  doi: 10.1109/ICASSP.2017.7952254
– ident: 10659_CR3
  doi: 10.1109/ICASSP39728.2021.9414585
– volume: 80
  start-page: 18717
  issue: 12
  year: 2021
  ident: 10659_CR19
  publication-title: Multimedia Tools Appl
  doi: 10.1007/s11042-020-10339-0
– ident: 10659_CR43
  doi: 10.1109/ICASSP40776.2020.9054412
– ident: 10659_CR38
  doi: 10.1007/978-3-030-67832-6_5
– volume: 101
  start-page: 107003
  year: 2021
  ident: 10659_CR26
  publication-title: Appl Soft Comput
  doi: 10.1016/j.asoc.2020.107003
– volume: 13
  start-page: 1208
  issue: 6
  year: 2011
  ident: 10659_CR25
  publication-title: IEEE Transactions Multimedia
  doi: 10.1109/TMM.2011.2168197
– volume: 15
  start-page: 135
  issue: 6
  year: 2015
  ident: 10659_CR41
  publication-title: Cybern Information Technol
  doi: 10.1515/cait-2015-0074
– ident: 10659_CR42
  doi: 10.1007/s00521-021-05933-8
– ident: 10659_CR20
– volume: 184
  start-page: 232
  year: 2016
  ident: 10659_CR34
  publication-title: Neurocomputing
  doi: 10.1016/j.neucom.2015.08.104
– ident: 10659_CR28
SSID ssj0010020
Score 2.2935123
Snippet Object-based audio systems have become common in recent years as they provide the flexibility for many auditory scenarios, such as virtual reality games,...
SourceID proquest
crossref
springer
SourceType Aggregation Database
Enrichment Source
Index Database
Publisher
StartPage 817
SubjectTerms Algorithms
Artificial Intelligence
Audio data
Coding
Complex Systems
Computational Intelligence
Computer Science
Data compression
Methods
Neural networks
Parameters
Sparsity
Virtual reality
SummonAdditionalLinks – databaseName: SpringerLINK Contemporary 1997-Present
  dbid: RSV
  link: http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnR3LSsQwMOjqwYvrE1dXycGbFto03aRHkV08rQs-2FtpHsUFbaXtCv69M31sUVTQQ2khk6RkknlkXoScx8CFXC58x7o6cLjlypEhfBnAN0j7iUqEqYpNiOlUzufhrAkKK1pv99YkWVHqLtgNtBe0OaIL1ggvr9bJRoDZZlBHv3tc2Q5QAqrULOE6fMS8JlTm-zE-s6NOxvxiFq24zaT_v__cIduNdEmv6u2wS9Zsukf6beUG2hzkfZKiewedxeiZhQ2TvPaofqd4nV9vRjpOdYacDbo92RdLQbylWMF4gTMszSKjtwpvcQpa-R1QkFuBJBiEyQsLIGWGWTKNzQ_Iw2R8f33jNJUXHA1HsnQSwJ_xRMKV8bQErTBOuPGZgBcwVS1Dpl1plOJCecbnAMaUCrXxgIyH0lX-IemlWWqPCGWBVDrmcRJoyVmslSssPAkHzS00ig2I1yIg0k1acqyO8Rx1CZVxQSNY0Kha0EgOyMWqz2udlONX6GGL16g5oEXEQqBlHGRXb0AuWzx2zT-Pdvw38BOyxTBgovL1GZJemS_tKdnUb-WiyM-qjfsBKd7n9Q
  priority: 102
  providerName: Springer Nature
Title High Parameter Frequency Resolution Encoding Scheme for Spatial Audio Objects Using Stacked Sparse Autoencoder
URI https://link.springer.com/article/10.1007/s11063-021-10659-8
https://www.proquest.com/docview/2918343261
Volume 54
WOSCitedRecordID wos000708365700001&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVPQU
  databaseName: Advanced Technologies & Aerospace Database
  customDbUrl:
  eissn: 1573-773X
  dateEnd: 20241214
  omitProxy: false
  ssIdentifier: ssj0010020
  issn: 1370-4621
  databaseCode: P5Z
  dateStart: 19970201
  isFulltext: true
  titleUrlDefault: https://search.proquest.com/hightechjournals
  providerName: ProQuest
– providerCode: PRVPQU
  databaseName: Computer Science Database
  customDbUrl:
  eissn: 1573-773X
  dateEnd: 20241214
  omitProxy: false
  ssIdentifier: ssj0010020
  issn: 1370-4621
  databaseCode: K7-
  dateStart: 19970201
  isFulltext: true
  titleUrlDefault: http://search.proquest.com/compscijour
  providerName: ProQuest
– providerCode: PRVPQU
  databaseName: ProQuest Central
  customDbUrl:
  eissn: 1573-773X
  dateEnd: 20241214
  omitProxy: false
  ssIdentifier: ssj0010020
  issn: 1370-4621
  databaseCode: BENPR
  dateStart: 19970201
  isFulltext: true
  titleUrlDefault: https://www.proquest.com/central
  providerName: ProQuest
– providerCode: PRVAVX
  databaseName: SpringerLINK Contemporary 1997-Present
  customDbUrl:
  eissn: 1573-773X
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0010020
  issn: 1370-4621
  databaseCode: RSV
  dateStart: 19970101
  isFulltext: true
  titleUrlDefault: https://link.springer.com/search?facet-content-type=%22Journal%22
  providerName: Springer Nature
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwpV1LT9wwEB6Vx4ELtFDEAkU-9EatJl4vdk4I0K562q4oVIhLFD8iVoJkSRYk_j0zicMKJLhwiBPJEyfSN54Zz4w9AD8z1EKRVH3uIzvg0kvDdYJPDvFGaz83uXJNsQk1Huurq2QSHG51SKvsZGIjqF1pyUf-WyTIfBKNjfh4ds-pahRFV0MJjSVYiQUKYQrKKv4SRSBbqFlwqYjLIxGHTTPt1jlcC1EEkxK6jsgV9loxLazNNwHSRu-MNj77x19hPVic7KRlkW_wxRebsNFVc2Bhcm9BQSkfbJJRthZ1jKo2y_qJkYu_ZVA2LGxJ2g5fu_F3nqHJy6iq8ZS-8OCmJftryLNTsyYXgaEti2LCEU1VeySZl3RypvPVd7gcDS_O_vBQjYFbnKZzniOmLla5NC62GleKWS5dXyi8oaK1OhE20s4YqUzs-hLJhDGJdTGK9kRHpr8Ny0VZ-B1gYqCNzWSWD6yWIrMmUh6vXOJqLnFG9CDuoEhtOKqcKmbcpotDlgm-FOFLG_hS3YPDl3dm7UEdH1Lvd5ilYdLW6QKwHvzqUF90vz_a7sej7cGaoE0TTb7PPizPqwf_A1bt43xaVwewcjocT84PGtbFdjK4xvb83_9nL0b1Ww
linkProvider ProQuest
linkToHtml http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMw1V1JT9tAFH6iUKm9ELqgBkI7BzhRq_ZkgseHqqqAKFFCyCFIERfjWaxGam1qm1b8KX5j3_OSiErNjUMPli3NeLzMN2-ZtwEcRsiFXOF3HevqniOsUI4M8MrgfKO0H6vYN2WxCX8ykfN5MN2AhyYWhtwqG5pYEmqTatoj_8QDBJ9AYcP7cvvToapRZF1tSmhUsBjZ-9-osuWfh2c4v0ec989npwOnrirgaIRb4cT4bsbzY6GMpyVqPFEsTJf7eEKGoWXAtSuNUsJXnukK7MaVCrTxkEQF0lVdHPcZbAnBXaqYMO1dL60WJHuVCp7vOuKEe3WQThWqh7oXWUzJgeyEtt4eM8KVdPuXQbbkc_3W__aHdmC7lqjZ12oJvIINm7yGVlOtgtXE6w0k5NLCphF5o1FDP6u8yO8ZmTCqBcjOE50SN8fbvtkflqFIz6hq84KecGcWKbtUtHOVs9LXgqGsjmTQUJ8st9ilSCkzqLHZW7h6kq_ehc0kTew7YLwnlY5EFPe0FDzSyvUtHrFAbTUwirfBa6Y-1HUqdqoI8j1cJZEmuIQIl7CESyjbcLy857ZKRLK2d6fBSFgTpTxcAaQNHxuUrZr_Pdre-tE-wIvB7GIcjoeT0T685BQgUvo2dWCzyO7sATzXv4pFnr0vlwuDm6dG3x-jzlJK
linkToPdf http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwpV1ZS8NAEB68EF-8xXrug28aTLbbZvMo2qIotXjRt5A9ggVNS5oK_ntnclgVFcSHkMDOJmFnd2dm55sZgIMIpZAr_LpjXd1whBXKkQE-GeQ3avuxin2TF5vwOx3Z6wXdD1H8Odq9ckkWMQ2UpSnJjocmPp4EvqElQ_5HgmM16SBrGmYFWjIE6rq5fXj3I5A2lJtcvuuIJvfKsJnv3_FZNE30zS8u0lzytJf-_8_LsFhqneykmCYrMGWTVViqKjqwcoGvQUKwD9aNCLFFDe20QFq_MjrmLyYpayV6QBIPuz3aZ8tQ7WVU2bhPXxib_oBdKzrdGbEcj8BQn8WtwhBNOrJIkg0oe6ax6Trct1t3p-dOWZHB0bhUMydGvhrPj4UynpZoLUaxMHXu4w2FrZYB1640SglfeaYukIwrFWjj4fYeSFfVN2AmGSR2ExhvSKUjEcUNLQWPtHJ9i1cs0KILjOI18CpmhLpMV05VM57CSaJlGtAQBzTMBzSUNTh87zMsknX8Sr1T8TgsF-4o5AHucQJ1Wq8GRxVPJ80_v23rb-T7MN89a4dXF53LbVjgFFORw4F2YCZLx3YX5vRL1h-le_l8fgOBOvO9
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=High+Parameter+Frequency+Resolution+Encoding+Scheme+for+Spatial+Audio+Objects+Using+Stacked+Sparse+Autoencoder&rft.jtitle=Neural+processing+letters&rft.date=2022-04-01&rft.pub=Springer+Nature+B.V&rft.issn=1370-4621&rft.eissn=1573-773X&rft.volume=54&rft.issue=2&rft.spage=817&rft.epage=833&rft_id=info:doi/10.1007%2Fs11063-021-10659-8
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1370-4621&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1370-4621&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1370-4621&client=summon