Harmonizing Voices: Enhancing Speech Recognition Through Integrated Phonological Features in Bengali

This study reveals a novel avenue for advancing speech recognition accuracy by integrating phonological features. Despite remarkable progress, speech recognition systems encounter challenges with varying accents and speech patterns. This study proposes an innovative approach incorporating phonologic...

Celý popis

Uložené v:
Podrobná bibliografia
Vydané v:2023 3rd International Conference on Smart Generation Computing, Communication and Networking (SMART GENCON) s. 1 - 8
Hlavní autori: Bhowmik, Tanmay, Choudhury, Amitava
Médium: Konferenčný príspevok..
Jazyk:English
Vydavateľské údaje: IEEE 29.12.2023
Predmet:
On-line prístup:Získať plný text
Tagy: Pridať tag
Žiadne tagy, Buďte prvý, kto otaguje tento záznam!
Abstract This study reveals a novel avenue for advancing speech recognition accuracy by integrating phonological features. Despite remarkable progress, speech recognition systems encounter challenges with varying accents and speech patterns. This study proposes an innovative approach incorporating phonological attributes like stress patterns, phoneme duration, and intonation into the recognition process. The system aims to capture intricate speech nuances essential for precise understanding by assimilating these linguistic cues. Extensive experiments conducted on diverse datasets demonstrate the efficacy of this phonology-enriched approach in enhancing recognition accuracy across different speech styles and variations. The phoneme detection model is generated on a system, prepared using deep neural network and the classification model is developed based on a stacked denoising autoencoder model. The outcomes under-score the potential of phonological integration in constructing adaptable and inclusive speech recognition systems, holding promise for improved communication technology in real-world multilingual scenarios. The proposed system produced 86.19% of overall accuracy. Classification among several places and manner of articulation has been performed also. In this classification task, the system produced 98.9% accuracy in the case of the manner of articulation and 50.2% in place of articulation.
AbstractList This study reveals a novel avenue for advancing speech recognition accuracy by integrating phonological features. Despite remarkable progress, speech recognition systems encounter challenges with varying accents and speech patterns. This study proposes an innovative approach incorporating phonological attributes like stress patterns, phoneme duration, and intonation into the recognition process. The system aims to capture intricate speech nuances essential for precise understanding by assimilating these linguistic cues. Extensive experiments conducted on diverse datasets demonstrate the efficacy of this phonology-enriched approach in enhancing recognition accuracy across different speech styles and variations. The phoneme detection model is generated on a system, prepared using deep neural network and the classification model is developed based on a stacked denoising autoencoder model. The outcomes under-score the potential of phonological integration in constructing adaptable and inclusive speech recognition systems, holding promise for improved communication technology in real-world multilingual scenarios. The proposed system produced 86.19% of overall accuracy. Classification among several places and manner of articulation has been performed also. In this classification task, the system produced 98.9% accuracy in the case of the manner of articulation and 50.2% in place of articulation.
Author Bhowmik, Tanmay
Choudhury, Amitava
Author_xml – sequence: 1
  givenname: Tanmay
  surname: Bhowmik
  fullname: Bhowmik, Tanmay
  email: tanmaybhowmik@gmail.com
  organization: Pandit Deendayal Energy University,Dept. of Computer Engineering,Gandhinagar,Gujarat,India
– sequence: 2
  givenname: Amitava
  surname: Choudhury
  fullname: Choudhury, Amitava
  email: a.choudhury2013@gmail.com
  organization: Pandit Deendayal Energy University,Dept. of Computer Engineering,Gandhinagar,Gujarat,India
BookMark eNo1jztPwzAYAI0EA5T-AwZPbAl-J2YrUV9SaVEbsVbG_pJYSu3KSQf49QgB00k3nHR36DrEAAg9UpJTSvTT4XW2r5fzbbXbKlJImTPCeE6JEEzw4gpNdaFLLgmnmjJ5i9zKpFMM_suHFr9Hb2F4xvPQmWB_zOEMYDu8Bxvb4EcfA667FC9th9dhhDaZERx-62KIfWy9NT1egBkvCQbsA36B0Jre36ObxvQDTP84QfViXlerbLNbrqvZJvOU6jHTtgFWUMGZsIZwaZyg3GgCruAgHGuoEtwJp6ApRGnIh1VcEShV44iWjk_Qw2_WA8DxnPzJpM_j_zv_BpiUV1U
ContentType Conference Proceeding
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.1109/SMARTGENCON60755.2023.10442437
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume
IEEE Xplore All Conference Proceedings
IEEE Electronic Library (IEL)
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Xplore
  url: https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
EISBN 9798350319125
EndPage 8
ExternalDocumentID 10442437
Genre orig-research
GroupedDBID 6IE
6IL
CBEJK
RIE
RIL
ID FETCH-LOGICAL-i119t-9cfe2714324ca035ad413a90ed73e4d2f1643d4d6ef748a0bc6360e86fd095d3
IEDL.DBID RIE
IngestDate Wed May 01 11:50:42 EDT 2024
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-i119t-9cfe2714324ca035ad413a90ed73e4d2f1643d4d6ef748a0bc6360e86fd095d3
PageCount 8
ParticipantIDs ieee_primary_10442437
PublicationCentury 2000
PublicationDate 2023-Dec.-29
PublicationDateYYYYMMDD 2023-12-29
PublicationDate_xml – month: 12
  year: 2023
  text: 2023-Dec.-29
  day: 29
PublicationDecade 2020
PublicationTitle 2023 3rd International Conference on Smart Generation Computing, Communication and Networking (SMART GENCON)
PublicationTitleAbbrev SMART GENCON
PublicationYear 2023
Publisher IEEE
Publisher_xml – name: IEEE
Score 1.8554466
Snippet This study reveals a novel avenue for advancing speech recognition accuracy by integrating phonological features. Despite remarkable progress, speech...
SourceID ieee
SourceType Publisher
StartPage 1
SubjectTerms Adaptation models
Feature extraction
inclusive recognition
phonological features
place and manner of articulation
Robustness
speech attributes
Speech enhancement
Speech recognition
Stress
Task analysis
Usability
Title Harmonizing Voices: Enhancing Speech Recognition Through Integrated Phonological Features in Bengali
URI https://ieeexplore.ieee.org/document/10442437
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1NSwMxEA22iHhSseI3OYi31N1NNtl4VFrrwVJskd5KNjNL95IttfXgrzfZdhUPHryFgRDI15tJZt4j5EZAmhurLYtznjMRx8hynXEmlYQCwBZc21psQg2H2XSqR9ti9boWBhHr5DPshmb9lw-VXYenMn_ChQgEei3SUkpuirX2yO2WN_Nu_OJ9wKeeD4aH0iNh2g3S4N2m0y_5lBo9-gf_HPeQdH7q8OjoG2GOyA66YwIDs_R7p_z0JvpWhYN-T3tuHogzvGW8QLRz-tokBlWOTjZaPPS5oYYAOppXrrn2aHAD1z7spqWjD-g8ZpQdMun3Jo8DttVKYGUc6xXTtsAkaJknwpqIpwY8OhkdISiOApLCh0UcBEgslMhMlNtAFIaZLMA7WcBPSNtVDk8JtUZJo1OTiFyLtOA5eKdKRSilMDyO7RnphBmaLTZsGLNmcs7_sF-Q_bAOIQUk0ZekvVqu8Yrs2o9V-b68rtfwC0w6oGU
linkProvider IEEE
linkToHtml http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV09T8MwELWgIGACRBHfeEBsKUnsODEjqKUVbVTRCHWrHPusZnGq0jLw67FDAmJgYLNOsiydfb47--49hG6oinIhufSCnOQeDQLwcp4Qj8VMaaWkJlxWZBNxmibTKR_XzepVLwwAVMVn0HHD6i9flXLtnsqshVPqAPQ20ZajzqrbtXbQbY2ceTcZ2SjwqWvT4ZRZXxh1HDl4p5n2i0Cl8h-9_X-ufIDaP514ePztYw7RBpgjpPpiaU9P8WFF-LV0pn6Pu2buoDOsZLIAkHP80pQGlQZnX2w8eNCAQyg8npemufiwCwTXNvHGhcEPYKzXKNoo63Wzx75XsyV4RRDwlcelhtCxmYdUCp9EQlkNCe6DiglQFWqbGBFFFQMd00T4uXRQYZAwrWyYpcgxapnSwAnCUsRM8EiENOc00iRXNqyKfWCMChIE8hS1nYZmiy88jFmjnLM_5Ndot5-NhrPhIH0-R3tuT1xBSMgvUGu1XMMl2pbvq-JteVXt5yfVC6Ou
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2023+3rd+International+Conference+on+Smart+Generation+Computing%2C+Communication+and+Networking+%28SMART+GENCON%29&rft.atitle=Harmonizing+Voices%3A+Enhancing+Speech+Recognition+Through+Integrated+Phonological+Features+in+Bengali&rft.au=Bhowmik%2C+Tanmay&rft.au=Choudhury%2C+Amitava&rft.date=2023-12-29&rft.pub=IEEE&rft.spage=1&rft.epage=8&rft_id=info:doi/10.1109%2FSMARTGENCON60755.2023.10442437&rft.externalDocID=10442437