An integrated speech-background model for robust speaker identification

A procedure for text-independent speaker identification in noisy environments where the interfering background signals cannot be characterized using traditional broadband or impulsive noise models is examined. In the procedure, both the speaker and the background processes are modeled using mixtures...

Celý popis

Uložené v:

Podrobná bibliografia
Vydané v:	[Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing Ročník 2; s. 185 - 188 vol.2
Hlavní autori:	Reynolds, D.A., Rose, R.C.
Médium:	Konferenčný príspevok..
Jazyk:	English
Vydavateľské údaje:	IEEE 1992
Predmet:	Background noise Gaussian processes Noise robustness Parameter estimation Signal processing Speech analysis Speech enhancement Speech processing Working environment noise
ISBN:	9780780305328, 0780305329
ISSN:	1520-6149
On-line prístup:	Získať plný text
Tagy:	Pridať tag Žiadne tagy, Buďte prvý, kto otaguje tento záznam!

Abstract	A procedure for text-independent speaker identification in noisy environments where the interfering background signals cannot be characterized using traditional broadband or impulsive noise models is examined. In the procedure, both the speaker and the background processes are modeled using mixtures of Gaussians. Speaker and background models are integrated into a unified statistical framework allowing the decoupling of the underlying speech process from the noise corrupted observations via the expectation-minimization algorithm. Using this formalism, speaker model parameters are estimated in the presence of the background process, and a scoring procedure is implemented for computing the speaker likelihood in the noise corrupted environment. The performance was evaluated using a 16-speaker conversational speech database with both speech babble and white noise background processes.< >
AbstractList	A procedure for text-independent speaker identification in noisy environments where the interfering background signals cannot be characterized using traditional broadband or impulsive noise models is examined. In the procedure, both the speaker and the background processes are modeled using mixtures of Gaussians. Speaker and background models are integrated into a unified statistical framework allowing the decoupling of the underlying speech process from the noise corrupted observations via the expectation-minimization algorithm. Using this formalism, speaker model parameters are estimated in the presence of the background process, and a scoring procedure is implemented for computing the speaker likelihood in the noise corrupted environment. The performance was evaluated using a 16-speaker conversational speech database with both speech babble and white noise background processes.< >
Author	Reynolds, D.A. Rose, R.C.
Author_xml	– sequence: 1 givenname: D.A. surname: Reynolds fullname: Reynolds, D.A. organization: MIT Lincoln Lab., Lexington, MA, USA – sequence: 2 givenname: R.C. surname: Rose fullname: Rose, R.C. organization: MIT Lincoln Lab., Lexington, MA, USA
BookMark	eNotT81KAzEYDFjBWvcFesoLbP3ys9nNsRStQkGhei7Z5EuNbZOSTQ--vSt1GJjLMD_3ZBJTRELmDBaMgX58XS232_cF05ovOFfQ6RtS6baDkQIawbsJmbKGQ62Y1HekGoZvGCEb1ko-JetlpCEW3GdT0NHhjGi_6t7Ywz6nS3T0lBweqU-Z5tRfhvJnMQfMNDiMJfhgTQkpPpBbb44DVv86I5_PTx-rl3rzth43buow9pW6M71RILGx2FoBRgAysE5pYQwqL6H1vvNS64Y51zPLYPQoROG1RseFmJH5NTcg4u6cw8nkn931ufgFf-1P4w
ContentType	Conference Proceeding
DBID	6IE 6IL CBEJK RIE RIL
DOI	10.1109/ICASSP.1992.226089
DatabaseName	IEEE Electronic Library (IEL) Conference Proceedings IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume IEEE Xplore All Conference Proceedings IEEE Electronic Library (IEL) IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml	– sequence: 1 dbid: RIE name: IEEE Electronic Library (IEL) url: https://ieeexplore.ieee.org/ sourceTypes: Publisher
DeliveryMethod	fulltext_linktorsrc
Discipline	Engineering
EndPage	188 vol.2
ExternalDocumentID	226089
GroupedDBID	23M 29P 6IE 6IF 6IH 6IK 6IL 6IM 6IN AAJGR AAWTH ABLEC ACGFS ADZIZ ALMA_UNASSIGNED_HOLDINGS BEFXN BFFAM BGNUA BKEBE BPEOZ CBEJK CHZPO IEGSK IJVOP IPLJI M43 OCL RIE RIL RNS
ID	FETCH-LOGICAL-i174t-8aba604e5ce7c30a30e10cd693aae6f407ff8f49951ddb1c100a36ee3f99ed233
IEDL.DBID	RIE
ISBN	9780780305328 0780305329
ISICitedReferencesCount	0
ISICitedReferencesURI	http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=226089&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
ISSN	1520-6149
IngestDate	Tue Aug 26 21:35:18 EDT 2025
IsPeerReviewed	false
IsScholarly	true
Language	English
LinkModel	DirectLink
MergedId	FETCHMERGED-LOGICAL-i174t-8aba604e5ce7c30a30e10cd693aae6f407ff8f49951ddb1c100a36ee3f99ed233
ParticipantIDs	ieee_primary_226089
PublicationCentury	1900
PublicationDate	19920000
PublicationDateYYYYMMDD	1992-01-01
PublicationDate_xml	– year: 1992 text: 19920000
PublicationDecade	1990
PublicationTitle	[Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing
PublicationTitleAbbrev	ICASSP
PublicationYear	1992
Publisher	IEEE
Publisher_xml	– name: IEEE
SSID	ssj0000451742 ssj0008748
Score	1.3369006
Snippet	A procedure for text-independent speaker identification in noisy environments where the interfering background signals cannot be characterized using...
SourceID	ieee
SourceType	Publisher
StartPage	185
SubjectTerms	Background noise Gaussian processes Noise robustness Parameter estimation Signal processing Speech analysis Speech enhancement Speech processing Working environment noise
Title	An integrated speech-background model for robust speaker identification
URI	https://ieeexplore.ieee.org/document/226089
Volume	2
WOSCitedRecordID	wos226089&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
link	http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1NS8NAEF1s8aAXtVb8Zg9et81287F7LMWqIKVQhd7KfsxiKaQlSf397m7SquDFWxIWEnZC3rzJvDcIPcQeRRx0EwOpJnECMZEOpwlTlBqlskRDiPRrNpnw-VxMG5_toIUBgNB8Bj1_GP7lm7Xe-lJZ36UKERct1MqytJZq7csp3iYlsLzmI8yzMDjLoZNnR7EIjJ37t5sNRGO8szvnOzFNJPovo-FsNvUavkGvvt2vsSsBdcYn_3reU9T9Vu_h6R6XztAB5B10_MN48Bw9DXO8N4owuNwA6A-ipF55lUducBiQg11Ci4u12paVXyJXUOCladqLQkS76H38-DZ6Js1IBbJ021MRLpVMoxhcCDLNIskioJE2qWBSQmodu7OWW8eCEmqMoppGbk0KwKwQYAaMXaB2vs7hEmEujPVmgtqKJKYGlKbWunzDEVJmVCyvUMfvyGJTu2Ys6s24_vPqDTqq22B9aeMWtatiC3foUH9Wy7K4D5H-Av26o3o
linkProvider	IEEE
linkToHtml	http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1NT8JAEN0omqgXFTF-uwevCy1b2t0jISJEJCRgwo3sx2wkJIWU4u93d1tQEy_e2maTNjtN37zpvDcIPUUORSx0Ew2xIlELIiIsThMqw1BLmbQU-EgPkuGQTad8VPpsey0MAPjmM6i7Q_8vXy_VxpXKGjZVCBjfRwducFYp1toVVJxRiud55WeYJX50lsUnx48i7jk7c-83bfLSemd7zrZymoA3-p32eDxyKr5mvbjhr8ErHne6p_964jNU-9bv4dEOmc7RHqRVdPLDevACvbRTvLOK0Hi9AlAfRAq1cDqPVGM_IgfblBZnS7lZ526JWECG57psMPIxraH37vOk0yPlUAUyt9uTEyakiIMIbBASRQNBAwgDpWNOhYDYWH5nDDOWB7VCrWWowsCuiQGo4Rx0k9JLVEmXKVwhzLg2zk5QGd6KQg1ShcbYjMNSUqplJK5R1e3IbFX4ZsyKzbj58-ojOupN3gazQX_4eouOi6ZYV-i4Q5U828A9OlSf-XydPfiofwG23qbD
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=%5BProceedings%5D+ICASSP-92%3A+1992+IEEE+International+Conference+on+Acoustics%2C+Speech%2C+and+Signal+Processing&rft.atitle=An+integrated+speech-background+model+for+robust+speaker+identification&rft.au=Reynolds%2C+D.A.&rft.au=Rose%2C+R.C.&rft.date=1992-01-01&rft.pub=IEEE&rft.isbn=9780780305328&rft.issn=1520-6149&rft.volume=2&rft.spage=185&rft.epage=188+vol.2&rft_id=info:doi/10.1109%2FICASSP.1992.226089&rft.externalDocID=226089
thumbnail_l	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1520-6149&client=summon
thumbnail_m	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1520-6149&client=summon
thumbnail_s	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1520-6149&client=summon