Separating time-frequency sources from time-domain convolutive mixtures using non-negative matrix factorization

This paper addresses the problem of under-determined audio source separation in multichannel reverberant mixtures. We target a semiblind scenario assuming that the mixing filters are known. Source separation is performed from the time-domain mixture signals in order to accurately model the convoluti...

Full description

Saved in:
Bibliographic Details
Published in:IEEE Workshop on Applications of Signal Processing to Audio and Acoustics : proceedings pp. 264 - 268
Main Authors: Leglaive, Simon, Badeau, Roland, Richard, Gael
Format: Conference Proceeding
Language:English
Published: IEEE 01.10.2017
Subjects:
ISSN:1947-1629
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Abstract This paper addresses the problem of under-determined audio source separation in multichannel reverberant mixtures. We target a semiblind scenario assuming that the mixing filters are known. Source separation is performed from the time-domain mixture signals in order to accurately model the convolutive mixing process. The source signals are however modeled as latent variables in a time-frequency domain. In a previous paper we proposed to use the modified discrete cosine transform. The present paper generalizes the method to the use of the odd-frequency short-time Fourier transform. In this domain, the source coefficients are modeled as centered complex Gaussian random variables whose variances are structured by means of a non-negative matrix factorization model. The inference procedure relies on a variational expectation-maximization algorithm. In the experiments we discuss the choice of the source representation and we show that the proposed approach outperforms two methods from the literature.
AbstractList This paper addresses the problem of under-determined audio source separation in multichannel reverberant mixtures. We target a semiblind scenario assuming that the mixing filters are known. Source separation is performed from the time-domain mixture signals in order to accurately model the convolutive mixing process. The source signals are however modeled as latent variables in a time-frequency domain. In a previous paper we proposed to use the modified discrete cosine transform. The present paper generalizes the method to the use of the odd-frequency short-time Fourier transform. In this domain, the source coefficients are modeled as centered complex Gaussian random variables whose variances are structured by means of a non-negative matrix factorization model. The inference procedure relies on a variational expectation-maximization algorithm. In the experiments we discuss the choice of the source representation and we show that the proposed approach outperforms two methods from the literature.
Author Badeau, Roland
Richard, Gael
Leglaive, Simon
Author_xml – sequence: 1
  givenname: Simon
  surname: Leglaive
  fullname: Leglaive, Simon
  organization: LTCI, Univ. Paris-Saclay, Paris, France
– sequence: 2
  givenname: Roland
  surname: Badeau
  fullname: Badeau, Roland
  organization: LTCI, Univ. Paris-Saclay, Paris, France
– sequence: 3
  givenname: Gael
  surname: Richard
  fullname: Richard, Gael
  organization: LTCI, Univ. Paris-Saclay, Paris, France
BookMark eNotkN1KAzEUhKMo2NY-QW_2BbbmJJtNcrkU_6CgUMXLEpOzJdJNajZbqk9vpb0amBk-hhmTqxADEjIDOgeg-u6jWb02zZxRkHMFklJeX5CplgoEVzXUnMElGYGuZAk10zdk3PdflAqmKjoicYU7k0z2YVNk32HZJvweMNifoo9DstgXbYrdKXOxMz4UNoZ93A7Z77Ho_CEP6dga-n_EcVsZcGNOmcnJH4rW2ByT_z2aMdyS69Zse5yedULeH-7fFk_l8uXxedEsSw9S5FJbA5UDqYFb7ZhiQnwqiZVGNBS1q-rKcMurlhnqTKukko5ZLpwDoUEYPiGzE9cj4nqXfGfSz_r8D_8DxyJfQg
ContentType Conference Proceeding
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.1109/WASPAA.2017.8170036
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Xplore POP ALL
IEEE Xplore All Conference Proceedings
IEEE Electronic Library (IEL)
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library (IEL)
  url: https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
Discipline Engineering
EISBN 9781538616321
1538616327
EISSN 1947-1629
EndPage 268
ExternalDocumentID 8170036
Genre orig-research
GroupedDBID -~X
29I
6IE
6IF
6IK
6IL
6IN
AAJGR
AAWTH
ABLEC
ADZIZ
ALMA_UNASSIGNED_HOLDINGS
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CBEJK
CHZPO
IEGSK
IPLJI
M43
OCL
RIE
RIL
RNS
ID FETCH-LOGICAL-i175t-9ca14d17913c9d28255b87e49eea0e9d464a3c34f2a0daf8787d2c35dd15915a3
IEDL.DBID RIE
ISICitedReferencesCount 6
ISICitedReferencesURI http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000426939000054&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
IngestDate Wed Aug 27 02:37:51 EDT 2025
IsPeerReviewed false
IsScholarly true
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-i175t-9ca14d17913c9d28255b87e49eea0e9d464a3c34f2a0daf8787d2c35dd15915a3
PageCount 5
ParticipantIDs ieee_primary_8170036
PublicationCentury 2000
PublicationDate 2017-Oct.
PublicationDateYYYYMMDD 2017-10-01
PublicationDate_xml – month: 10
  year: 2017
  text: 2017-Oct.
PublicationDecade 2010
PublicationTitle IEEE Workshop on Applications of Signal Processing to Audio and Acoustics : proceedings
PublicationTitleAbbrev WASPAA
PublicationYear 2017
Publisher IEEE
Publisher_xml – name: IEEE
SSID ssj0052840
Score 2.1062844
Snippet This paper addresses the problem of under-determined audio source separation in multichannel reverberant mixtures. We target a semiblind scenario assuming that...
SourceID ieee
SourceType Publisher
StartPage 264
SubjectTerms Audio source separation
Conferences
Convolution
Discrete Fourier transforms
non-negative matrix factorization
Random variables
reverberant mixtures
Source separation
Time-domain analysis
variational inference
Title Separating time-frequency sources from time-domain convolutive mixtures using non-negative matrix factorization
URI https://ieeexplore.ieee.org/document/8170036
WOSCitedRecordID wos000426939000054&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV07T8MwED61FQMsPFpEeckDI26TOi-PFaJiQFWlguhWOfa1ytAE9aXy7_HFoYDEwhbZii3dOXeO_X3fAdxFIhXKRj6e6DTmgfF9Lm2i5CqJSD9d2yRUSuY_x8NhMpnIUQ3u91wYRCzBZ9ihx_Iu3xR6Q0dl3VJMTkR1qMdx5LhaX1E3tGHWq1SFfE923_rjUb9P0K24U732q35KmT4Gx_-b-ARa3zw8NtpnmFOoYX4GRz8kBJtQjNGpd-dzRnXi-WzpwNEfzJ3LrxgxSFyfKRYqyxkhzcsVt0W2yHZ0h7BiBICfs7zIeY5z5fpIvn_HXEmeiq_ZgtfB48vDE6-KKPDM7gzWXGrlWxfE0hdaGmKqhmkSYyARlYfSBFGghBbBrKc8o2aJ_YBNT4vQGLvR8UMlzqFh58YLYGQDgSjTsEeqcvZXxAglVITSDmTXQhuaZLrpu9PJmFZWu_y7-QoOyTsOGHcNjfVygzdwoLfrbLW8LZ37CaK_qMI
linkProvider IEEE
linkToHtml http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1LTwIxEJ4gmqgXH2h824NHCyzdV4_ESDAiIQEjN9LdDmQP7BpewX9vZ7uiJl68bdpsm8x0Z7rt930DcOeLSCgT-XgYRwF3teNwaRIlV6FP-umxSUK5ZH4n6HbD4VD2SnC_4cIgYg4-wyo95nf5OouXdFRWy8XkhL8F21Q5q2BrfcVdzwTaeqEr5NRl7a3Z7zWbBN4KqsWLvyqo5AmkdfC_qQ_h5JuJx3qbHHMEJUyPYf-HiGAFsj5a_e50wqhSPB_PLDz6g9mT-TkjDont09lUJSkjrHm-5lbIpsmabhHmjCDwE5ZmKU9xomwfCfivmS3KUzA2T-C19Th4aPOijAJPzN5gwWWsHOOEQDoilpq4ql4UBuhKRFVHqV3fVSIW7rih6lqNQ_MJ60YsPK3NVsfxlDiFspkbz4CRDQSijLwG6cqZnxEtlFA-SjOQWQ3nUCHTjd6tUsaosNrF3823sNsevHRGnafu8yXskacsTO4KyovZEq9hJ14tkvnsJnf0J5XyrAs
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=IEEE+Workshop+on+Applications+of+Signal+Processing+to+Audio+and+Acoustics+%3A+proceedings&rft.atitle=Separating+time-frequency+sources+from+time-domain+convolutive+mixtures+using+non-negative+matrix+factorization&rft.au=Leglaive%2C+Simon&rft.au=Badeau%2C+Roland&rft.au=Richard%2C+Gael&rft.date=2017-10-01&rft.pub=IEEE&rft.eissn=1947-1629&rft.spage=264&rft.epage=268&rft_id=info:doi/10.1109%2FWASPAA.2017.8170036&rft.externalDocID=8170036