Deficient Basis Estimation of Noise Spatial Covariance Matrix for Rank-Constrained Spatial Covariance Matrix Estimation Method in Blind Speech Extraction

Rank-constrained spatial covariance matrix estimation (RCSCME) is a state-of-the-art blind speech extraction method applied to cases where one directional target speech and diffuse noise are mixed. In this paper, we proposed a new algorithmic extension of RCSCME. RCSCME complements a deficient one r...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Proceedings of the ... IEEE International Conference on Acoustics, Speech and Signal Processing (1998) S. 806 - 810
Hauptverfasser: Kondo, Yuto, Kubo, Yuki, Takamune, Norihiro, Kitamura, Daichi, Saruwatari, Hiroshi
Format: Tagungsbericht
Sprache:Englisch
Veröffentlicht: IEEE 06.06.2021
Schlagworte:
ISSN:2379-190X
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Abstract Rank-constrained spatial covariance matrix estimation (RCSCME) is a state-of-the-art blind speech extraction method applied to cases where one directional target speech and diffuse noise are mixed. In this paper, we proposed a new algorithmic extension of RCSCME. RCSCME complements a deficient one rank of the diffuse noise spatial covariance matrix, which cannot be estimated via preprocessing such as independent low-rank matrix analysis, and estimates the source model parameters simultaneously. In the conventional RC- SCME, a direction of the deficient basis is fixed in advance and only the scale is estimated; however, the candidate of this deficient basis is not unique in general. In the proposed RCSCM model, the deficient basis itself can be accurately estimated as a vector variable by solving a vector optimization problem. Also, we derive new update rules based on the EM algorithm. We confirm that the proposed method outperforms conventional methods under several noise conditions.
AbstractList Rank-constrained spatial covariance matrix estimation (RCSCME) is a state-of-the-art blind speech extraction method applied to cases where one directional target speech and diffuse noise are mixed. In this paper, we proposed a new algorithmic extension of RCSCME. RCSCME complements a deficient one rank of the diffuse noise spatial covariance matrix, which cannot be estimated via preprocessing such as independent low-rank matrix analysis, and estimates the source model parameters simultaneously. In the conventional RC- SCME, a direction of the deficient basis is fixed in advance and only the scale is estimated; however, the candidate of this deficient basis is not unique in general. In the proposed RCSCM model, the deficient basis itself can be accurately estimated as a vector variable by solving a vector optimization problem. Also, we derive new update rules based on the EM algorithm. We confirm that the proposed method outperforms conventional methods under several noise conditions.
Author Takamune, Norihiro
Kubo, Yuki
Kitamura, Daichi
Saruwatari, Hiroshi
Kondo, Yuto
Author_xml – sequence: 1
  givenname: Yuto
  surname: Kondo
  fullname: Kondo, Yuto
  organization: The University of Tokyo,Tokyo,Japan
– sequence: 2
  givenname: Yuki
  surname: Kubo
  fullname: Kubo, Yuki
  organization: The University of Tokyo,Tokyo,Japan
– sequence: 3
  givenname: Norihiro
  surname: Takamune
  fullname: Takamune, Norihiro
  organization: The University of Tokyo,Tokyo,Japan
– sequence: 4
  givenname: Daichi
  surname: Kitamura
  fullname: Kitamura, Daichi
  organization: National Institute of Technology,Kagawa College,Kagawa,Japan
– sequence: 5
  givenname: Hiroshi
  surname: Saruwatari
  fullname: Saruwatari, Hiroshi
  organization: The University of Tokyo,Tokyo,Japan
BookMark eNp9kM1OAjEUhavRRECewE1fYLA_M9N2KSOoCagRTdyRS-dOqGJLpo3BR_FtHSILV65OcnLul3tOn5z44JEQytmIc2Yu76qrxeJRGiX0SDDBRybnea7MERkapXlnc1WyojgmPSGVybhhr2ekH-MbY0yrXPfI9zU2zjr0iY4hukgnMbkPSC54Ghp6H1xEuth2BmxoFT6hdeAt0jmk1u1oE1r6BP49q4KPqQXnsf4n_gc-x7QONXWejjfO748Q7ZpOdh3F7hPn5LSBTcThQQfkZTp5rm6z2cNNV3yWOcFkyqRZMWCF0aaQhV6BwCY3HGtrLLd1iblitjSoa5CScYmikYglYlFyAL7ickAufrkOEZfbtnuw_VoelpQ_tqduIg
ContentType Conference Proceeding
DBID 6IE
6IH
CBEJK
RIE
RIO
DOI 10.1109/ICASSP39728.2021.9414479
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Proceedings Order Plan (POP) 1998-present by volume
IEEE Xplore All Conference Proceedings
IEEE Electronic Library (IEL)
IEEE Proceedings Order Plans (POP) 1998-present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library (IEL)
  url: https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
Discipline Engineering
EISBN 9781728176055
1728176050
EISSN 2379-190X
EndPage 810
ExternalDocumentID 9414479
Genre orig-research
GroupedDBID 23M
6IE
6IF
6IH
6IK
6IL
6IM
6IN
AAJGR
AAWTH
ABLEC
ACGFS
ADZIZ
ALMA_UNASSIGNED_HOLDINGS
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CBEJK
CHZPO
IEGSK
IJVOP
IPLJI
M43
OCL
RIE
RIL
RIO
RNS
ID FETCH-LOGICAL-i203t-39b0a059895358ba2ef491edc9c1cd6e470c69e8da33013e2f3ee6ee561aa1b13
IEDL.DBID RIE
ISICitedReferencesCount 1
ISICitedReferencesURI http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000704288401010&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
IngestDate Wed Aug 27 02:39:02 EDT 2025
IsPeerReviewed false
IsScholarly true
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-i203t-39b0a059895358ba2ef491edc9c1cd6e470c69e8da33013e2f3ee6ee561aa1b13
PageCount 5
ParticipantIDs ieee_primary_9414479
PublicationCentury 2000
PublicationDate 2021-June-6
PublicationDateYYYYMMDD 2021-06-06
PublicationDate_xml – month: 06
  year: 2021
  text: 2021-June-6
  day: 06
PublicationDecade 2020
PublicationTitle Proceedings of the ... IEEE International Conference on Acoustics, Speech and Signal Processing (1998)
PublicationTitleAbbrev ICASSP
PublicationYear 2021
Publisher IEEE
Publisher_xml – name: IEEE
SSID ssj0008748
Score 2.1608553
Snippet Rank-constrained spatial covariance matrix estimation (RCSCME) is a state-of-the-art blind speech extraction method applied to cases where one directional...
SourceID ieee
SourceType Publisher
StartPage 806
SubjectTerms Acoustics
Analytical models
Blind speech extraction
Conferences
diffuse noise
EM algorithm
Estimation
Pareto optimization
Signal processing
Signal processing algorithms
spatial covariance matrix
Title Deficient Basis Estimation of Noise Spatial Covariance Matrix for Rank-Constrained Spatial Covariance Matrix Estimation Method in Blind Speech Extraction
URI https://ieeexplore.ieee.org/document/9414479
WOSCitedRecordID wos000704288401010&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV3PS8MwFA7b8KAXf2zib3LwaLe1Sdvk6OZEDxvDKew20vQVi9DK2o39K_63vqRzThDBWwl9CSRp3vfS972PkGuQnpIsiB0IMTbhGDI4kYoUHobCTzj4wou4FZsIRyMxncpxjdxsuDAAYJPPoG0e7b_8ONcLc1XWkRzhfyjrpB6GQcXV2py6IuTiK1OnKzuP_dvJZIzO1jP5W57bXtv-EFGxPuR-_3-jH5DWNxmPjjdu5pDUIDsie1t1BJvk4w5MIQg0pz1VpAUd4IdbcRJpntBRnhZAjfgwbjbaz5cYH5vFpkNTn39FEbfSJ5W9OUa902pGQPzH61udD63-NE0z2kO4aowA9CsdrMp5RZhokZf7wXP_wVlrLjip12Wlw2TUVQi5hPSZLyLlQcKlC7GW2tVxADzs6kCCiBVj5gbVSxhAAIAwTCk3ctkxaWR5BieEJq5WGO7pUAOCLuWJGKFkohDicewN9ClpmkmevVdlNWbr-T37vfmc7Jp1tFlawQVplPMFXJIdvSzTYn5l98InycO43g
linkProvider IEEE
linkToHtml http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1bS8MwFA5zCuqLl028mwcf7damaZs8ujnZcBvDTdjbSNNTLEIru7G_4r_1pJtzggi-ldCTQJLmfCc93_kIuQXJlHT9yIIAYxOOIYMVqlDhYSi8mIMnWMhzsYmg2xXDoewVyN2aCwMAefIZVMxj_i8_yvTMXJVVJUf4H8gtsu1xzuwlW2t97oqAi69cHVtWW_X7fr-H7paZDC7mVFbWP2RUci_yePC_8Q9J-ZuOR3trR3NECpAek_2NSoIl8vEAphQEmtOamiQT2sBPd8lKpFlMu1kyAWrkh3G70Xo2xwjZLDftmAr9C4rIlT6r9M0y-p25agREf7y-0XknV6CmSUprCFiNEYB-pY3FdLykTJTJy2NjUG9aK9UFK2G2O7VcGdoKQZeQnuuJUDGIuXQg0lI7OvKBB7b2JYhIua65Q2WxC-ADIBBTygkd94QU0yyFU0JjRysM-HSgAWGXYiJCMBkrBHkcewN9Rkpmkkfvy8Iao9X8nv_efEN2m4NOe9RudZ8uyJ5Z0zxny78kxel4BldkR8-nyWR8ne-LT4zdvCU
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=Proceedings+of+the+...+IEEE+International+Conference+on+Acoustics%2C+Speech+and+Signal+Processing+%281998%29&rft.atitle=Deficient+Basis+Estimation+of+Noise+Spatial+Covariance+Matrix+for+Rank-Constrained+Spatial+Covariance+Matrix+Estimation+Method+in+Blind+Speech+Extraction&rft.au=Kondo%2C+Yuto&rft.au=Kubo%2C+Yuki&rft.au=Takamune%2C+Norihiro&rft.au=Kitamura%2C+Daichi&rft.date=2021-06-06&rft.pub=IEEE&rft.eissn=2379-190X&rft.spage=806&rft.epage=810&rft_id=info:doi/10.1109%2FICASSP39728.2021.9414479&rft.externalDocID=9414479