Audio signal quality enhancement using multi-layered convolutional neural network based auto encoder–decoder

In this research article, a multi-layered convolutional neural network (MLCNN) based auto-CODEC for audio signal enhancement which is utilizing the Mel-frequency cepstral coefficients (MFCC) has been proposed. The proposed MLCNN takes the input as MFCC with different frames from the noise contaminat...

Celý popis

Uloženo v:
Podrobná bibliografie
Vydáno v:International journal of speech technology Ročník 24; číslo 2; s. 425 - 437
Hlavní autoři: Raj, Shivangi, Prakasam, P., Gupta, Shubham
Médium: Journal Article
Jazyk:angličtina
Vydáno: New York Springer US 01.06.2021
Springer Nature B.V
Témata:
ISSN:1381-2416, 1572-8110
On-line přístup:Získat plný text
Tagy: Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
Popis
Shrnutí:In this research article, a multi-layered convolutional neural network (MLCNN) based auto-CODEC for audio signal enhancement which is utilizing the Mel-frequency cepstral coefficients (MFCC) has been proposed. The proposed MLCNN takes the input as MFCC with different frames from the noise contaminated audio signal for training and testing. The proposed MLCNN models has been trained and tested as 80:20 and 70:30 ratios from the available database. The proposed method has been verified and validated MNIST database. From the validation it has been found that the proposed MLCNN model provides an accuracy of 93.25%. The performance of MLCNN has been evaluated using short-time objective intelligibility, perceptual evaluation of speech quality and Cosine similarities. The proposed MLCNN model has been compared with the reported models. Form the comparisons; it has been observed that the proposed MLCNN model outperforms other models. From the cosine similarity, it has been proved that MLCNN provides high security level which can be used for many secure applications.
Bibliografie:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ISSN:1381-2416
1572-8110
DOI:10.1007/s10772-021-09809-z