Text-Independent Speaker Recognition in Clean and Noisy Backgrounds Using Modified VQ-LBG Algorithm

Speaker recognition is the process of identifying the proper speaker by analyzing the spectral shape of the speech signal. This process is done by extracting the desired features and matching the features of the speech signal. In this paper, we adopted the Mel frequency cepstrum coefficient (MFCC) t...

Full description

Saved in:

Bibliographic Details
Published in:	Circuits, systems, and signal processing Vol. 38; no. 6; pp. 2810 - 2828
Main Authors:	Mallikarjunan, M., Karmali Radha, P., Bharath, K. P., Muthu, Rajesh Kumar
Format:	Journal Article
Language:	English
Published:	New York Springer US 01.06.2019 Springer Nature B.V
Subjects:	Algorithms Circuits and Systems Covariance Electrical Engineering Electronics and Microelectronics Engineering Feature extraction Instrumentation Matching Probabilistic models Signal processing Signal,Image and Speech Processing Speech recognition Vector quantization Feature extraction MFCC Modified VQ-LBG GMM–UBM
ISSN:	0278-081X, 1531-5878
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

Description
Summary:	Speaker recognition is the process of identifying the proper speaker by analyzing the spectral shape of the speech signal. This process is done by extracting the desired features and matching the features of the speech signal. In this paper, we adopted the Mel frequency cepstrum coefficient (MFCC) technique for extracting the features from the speaker speech sample. These cepstrum coefficients are named as extracted features. The extracted MFCC features are given as input to the modified vector quantization via Linde–Buzo–Gray (modified VQ-LBG) process and expectation maximization (EM) algorithm. Vector quantization technique is mainly used for feature matching where a separate codebook will be generated for each speaker. The EM algorithm is utilized to develop the Gaussian mixture model–universal background model (GMM–UBM). In GMM–UBM model, k means cluster is summed up to consolidate data about the covariance structure of the information and the focuses of the inert Gaussians. From our analysis, the modified VQ-LBG algorithm gives better performance compared to the GMM–UBM model.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14
ISSN:	0278-081X 1531-5878
DOI:	10.1007/s00034-018-0992-4