Separating time-frequency sources from time-domain convolutive mixtures using non-negative matrix factorization

This paper addresses the problem of under-determined audio source separation in multichannel reverberant mixtures. We target a semiblind scenario assuming that the mixing filters are known. Source separation is performed from the time-domain mixture signals in order to accurately model the convoluti...

Full description

Saved in:

Bibliographic Details
Published in:	IEEE Workshop on Applications of Signal Processing to Audio and Acoustics : proceedings pp. 264 - 268
Main Authors:	Leglaive, Simon, Badeau, Roland, Richard, Gael
Format:	Conference Proceeding
Language:	English
Published:	IEEE 01.10.2017
Subjects:	Audio source separation Conferences Convolution Discrete Fourier transforms non-negative matrix factorization Random variables reverberant mixtures Source separation Time-domain analysis variational inference
ISSN:	1947-1629
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

Abstract	This paper addresses the problem of under-determined audio source separation in multichannel reverberant mixtures. We target a semiblind scenario assuming that the mixing filters are known. Source separation is performed from the time-domain mixture signals in order to accurately model the convolutive mixing process. The source signals are however modeled as latent variables in a time-frequency domain. In a previous paper we proposed to use the modified discrete cosine transform. The present paper generalizes the method to the use of the odd-frequency short-time Fourier transform. In this domain, the source coefficients are modeled as centered complex Gaussian random variables whose variances are structured by means of a non-negative matrix factorization model. The inference procedure relies on a variational expectation-maximization algorithm. In the experiments we discuss the choice of the source representation and we show that the proposed approach outperforms two methods from the literature.
AbstractList	This paper addresses the problem of under-determined audio source separation in multichannel reverberant mixtures. We target a semiblind scenario assuming that the mixing filters are known. Source separation is performed from the time-domain mixture signals in order to accurately model the convolutive mixing process. The source signals are however modeled as latent variables in a time-frequency domain. In a previous paper we proposed to use the modified discrete cosine transform. The present paper generalizes the method to the use of the odd-frequency short-time Fourier transform. In this domain, the source coefficients are modeled as centered complex Gaussian random variables whose variances are structured by means of a non-negative matrix factorization model. The inference procedure relies on a variational expectation-maximization algorithm. In the experiments we discuss the choice of the source representation and we show that the proposed approach outperforms two methods from the literature.
Author	Badeau, Roland Richard, Gael Leglaive, Simon
Author_xml	– sequence: 1 givenname: Simon surname: Leglaive fullname: Leglaive, Simon organization: LTCI, Univ. Paris-Saclay, Paris, France – sequence: 2 givenname: Roland surname: Badeau fullname: Badeau, Roland organization: LTCI, Univ. Paris-Saclay, Paris, France – sequence: 3 givenname: Gael surname: Richard fullname: Richard, Gael organization: LTCI, Univ. Paris-Saclay, Paris, France
BookMark	eNotkN1KAzEUhKMo2NY-QW_2BbbmJJtNcrkU_6CgUMXLEpOzJdJNajZbqk9vpb0amBk-hhmTqxADEjIDOgeg-u6jWb02zZxRkHMFklJeX5CplgoEVzXUnMElGYGuZAk10zdk3PdflAqmKjoicYU7k0z2YVNk32HZJvweMNifoo9DstgXbYrdKXOxMz4UNoZ93A7Z77Ho_CEP6dga-n_EcVsZcGNOmcnJH4rW2ByT_z2aMdyS69Zse5yedULeH-7fFk_l8uXxedEsSw9S5FJbA5UDqYFb7ZhiQnwqiZVGNBS1q-rKcMurlhnqTKukko5ZLpwDoUEYPiGzE9cj4nqXfGfSz_r8D_8DxyJfQg
ContentType	Conference Proceeding
DBID	6IE 6IL CBEJK RIE RIL
DOI	10.1109/WASPAA.2017.8170036
DatabaseName	IEEE Electronic Library (IEL) Conference Proceedings IEEE Xplore POP ALL IEEE Xplore All Conference Proceedings IEEE Electronic Library (IEL) IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml	– sequence: 1 dbid: RIE name: IEEE Electronic Library (IEL) url: https://ieeexplore.ieee.org/ sourceTypes: Publisher
DeliveryMethod	fulltext_linktorsrc
Discipline	Engineering
EISBN	9781538616321 1538616327
EISSN	1947-1629
EndPage	268
ExternalDocumentID	8170036
Genre	orig-research
GroupedDBID	-~X 29I 6IE 6IF 6IK 6IL 6IN AAJGR AAWTH ABLEC ADZIZ ALMA_UNASSIGNED_HOLDINGS BEFXN BFFAM BGNUA BKEBE BPEOZ CBEJK CHZPO IEGSK IPLJI M43 OCL RIE RIL RNS
ID	FETCH-LOGICAL-i175t-9ca14d17913c9d28255b87e49eea0e9d464a3c34f2a0daf8787d2c35dd15915a3
IEDL.DBID	RIE
ISICitedReferencesCount	6
ISICitedReferencesURI	http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000426939000054&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
IngestDate	Wed Aug 27 02:37:51 EDT 2025
IsPeerReviewed	false
IsScholarly	true
Language	English
LinkModel	DirectLink
MergedId	FETCHMERGED-LOGICAL-i175t-9ca14d17913c9d28255b87e49eea0e9d464a3c34f2a0daf8787d2c35dd15915a3
PageCount	5
ParticipantIDs	ieee_primary_8170036
PublicationCentury	2000
PublicationDate	2017-Oct.
PublicationDateYYYYMMDD	2017-10-01
PublicationDate_xml	– month: 10 year: 2017 text: 2017-Oct.
PublicationDecade	2010
PublicationTitle	IEEE Workshop on Applications of Signal Processing to Audio and Acoustics : proceedings
PublicationTitleAbbrev	WASPAA
PublicationYear	2017
Publisher	IEEE
Publisher_xml	– name: IEEE
SSID	ssj0052840
Score	2.1062844
Snippet	This paper addresses the problem of under-determined audio source separation in multichannel reverberant mixtures. We target a semiblind scenario assuming that...
SourceID	ieee
SourceType	Publisher
StartPage	264
SubjectTerms	Audio source separation Conferences Convolution Discrete Fourier transforms non-negative matrix factorization Random variables reverberant mixtures Source separation Time-domain analysis variational inference
Title	Separating time-frequency sources from time-domain convolutive mixtures using non-negative matrix factorization
URI	https://ieeexplore.ieee.org/document/8170036
WOSCitedRecordID	wos000426939000054&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
link	http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV07T8MwED61FQMsPFpEeckDI26TOi-PFaJiQFWlguhWOfa1ytAE9aXy7_HFoYDEwhbZii3dOXeO_X3fAdxFIhXKRj6e6DTmgfF9Lm2i5CqJSD9d2yRUSuY_x8NhMpnIUQ3u91wYRCzBZ9ihx_Iu3xR6Q0dl3VJMTkR1qMdx5LhaX1E3tGHWq1SFfE923_rjUb9P0K24U732q35KmT4Gx_-b-ARa3zw8NtpnmFOoYX4GRz8kBJtQjNGpd-dzRnXi-WzpwNEfzJ3LrxgxSFyfKRYqyxkhzcsVt0W2yHZ0h7BiBICfs7zIeY5z5fpIvn_HXEmeiq_ZgtfB48vDE6-KKPDM7gzWXGrlWxfE0hdaGmKqhmkSYyARlYfSBFGghBbBrKc8o2aJ_YBNT4vQGLvR8UMlzqFh58YLYGQDgSjTsEeqcvZXxAglVITSDmTXQhuaZLrpu9PJmFZWu_y7-QoOyTsOGHcNjfVygzdwoLfrbLW8LZ37CaK_qMI
linkProvider	IEEE
linkToHtml	http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1LTwIxEJ4gmqgXH2h824NHCyzdV4_ESDAiIQEjN9LdDmQP7BpewX9vZ7uiJl68bdpsm8x0Z7rt930DcOeLSCgT-XgYRwF3teNwaRIlV6FP-umxSUK5ZH4n6HbD4VD2SnC_4cIgYg4-wyo95nf5OouXdFRWy8XkhL8F21Q5q2BrfcVdzwTaeqEr5NRl7a3Z7zWbBN4KqsWLvyqo5AmkdfC_qQ_h5JuJx3qbHHMEJUyPYf-HiGAFsj5a_e50wqhSPB_PLDz6g9mT-TkjDont09lUJSkjrHm-5lbIpsmabhHmjCDwE5ZmKU9xomwfCfivmS3KUzA2T-C19Th4aPOijAJPzN5gwWWsHOOEQDoilpq4ql4UBuhKRFVHqV3fVSIW7rih6lqNQ_MJ60YsPK3NVsfxlDiFspkbz4CRDQSijLwG6cqZnxEtlFA-SjOQWQ3nUCHTjd6tUsaosNrF3823sNsevHRGnafu8yXskacsTO4KyovZEq9hJ14tkvnsJnf0J5XyrAs
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=IEEE+Workshop+on+Applications+of+Signal+Processing+to+Audio+and+Acoustics+%3A+proceedings&rft.atitle=Separating+time-frequency+sources+from+time-domain+convolutive+mixtures+using+non-negative+matrix+factorization&rft.au=Leglaive%2C+Simon&rft.au=Badeau%2C+Roland&rft.au=Richard%2C+Gael&rft.date=2017-10-01&rft.pub=IEEE&rft.eissn=1947-1629&rft.spage=264&rft.epage=268&rft_id=info:doi/10.1109%2FWASPAA.2017.8170036&rft.externalDocID=8170036