IRDC-Net: An Inception Network with a Residual Module and Dilated Convolution for Sign Language Recognition Based on Surface Electromyography
Uloženo v:
| Název: | IRDC-Net: An Inception Network with a Residual Module and Dilated Convolution for Sign Language Recognition Based on Surface Electromyography |
|---|---|
| Autoři: | Xiangrui Wang, Lu Tang, Qibin Zheng, Xilin Yang, Zhiyuan Lu |
| Zdroj: | Sensors, Vol 23, Iss 13, p 5775 (2023) |
| Informace o vydavateli: | MDPI AG |
| Rok vydání: | 2023 |
| Sbírka: | Directory of Open Access Journals: DOAJ Articles |
| Témata: | sign language recognition, surface electromyogram, inception network, residual module, dilated convolution, Chemical technology, TP1-1185 |
| Popis: | Deaf and hearing-impaired people always face communication barriers. Non-invasive surface electromyography (sEMG) sensor-based sign language recognition (SLR) technology can help them to better integrate into social life. Since the traditional tandem convolutional neural network (CNN) structure used in most CNN-based studies inadequately captures the features of the input data, we propose a novel inception architecture with a residual module and dilated convolution (IRDC-net) to enlarge the receptive fields and enrich the feature maps, applying it to SLR tasks for the first time. This work first transformed the time domain signal into a time–frequency domain using discrete Fourier transformation. Second, an IRDC-net was constructed to recognize ten Chinese sign language signs. Third, the tandem CNN networks VGG-net and ResNet-18 were compared with our proposed parallel structure network, IRDC-net. Finally, the public dataset Ninapro DB1 was utilized to verify the generalization performance of the IRDC-net. The results showed that after transforming the time domain sEMG signal into the time–frequency domain, the classification accuracy (acc) increased from 84.29% to 91.70% when using the IRDC-net on our sign language dataset. Furthermore, for the time–frequency information of the public dataset Ninapro DB1, the classification accuracy reached 89.82%; this value is higher than that achieved in other recent studies. As such, our findings contribute to research into SLR tasks and to improving deaf and hearing-impaired people’s daily lives. |
| Druh dokumentu: | article in journal/newspaper |
| Jazyk: | English |
| Relation: | https://www.mdpi.com/1424-8220/23/13/5775; https://doaj.org/toc/1424-8220; https://doaj.org/article/3b2b3373faec4f0bbd5c30335b2b2465 |
| DOI: | 10.3390/s23135775 |
| Dostupnost: | https://doi.org/10.3390/s23135775 https://doaj.org/article/3b2b3373faec4f0bbd5c30335b2b2465 |
| Přístupové číslo: | edsbas.2E34B0F2 |
| Databáze: | BASE |
| FullText | Text: Availability: 0 CustomLinks: – Url: https://doi.org/10.3390/s23135775# Name: EDS - BASE (s4221598) Category: fullText Text: View record from BASE – Url: https://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=EBSCO&SrcAuth=EBSCO&DestApp=WOS&ServiceName=TransferToWoS&DestLinkType=GeneralSearchSummary&Func=Links&author=Wang%20X Name: ISI Category: fullText Text: Nájsť tento článok vo Web of Science Icon: https://imagesrvr.epnet.com/ls/20docs.gif MouseOverText: Nájsť tento článok vo Web of Science |
|---|---|
| Header | DbId: edsbas DbLabel: BASE An: edsbas.2E34B0F2 RelevancyScore: 944 AccessLevel: 3 PubType: Academic Journal PubTypeId: academicJournal PreciseRelevancyScore: 943.653564453125 |
| IllustrationInfo | |
| Items | – Name: Title Label: Title Group: Ti Data: IRDC-Net: An Inception Network with a Residual Module and Dilated Convolution for Sign Language Recognition Based on Surface Electromyography – Name: Author Label: Authors Group: Au Data: <searchLink fieldCode="AR" term="%22Xiangrui+Wang%22">Xiangrui Wang</searchLink><br /><searchLink fieldCode="AR" term="%22Lu+Tang%22">Lu Tang</searchLink><br /><searchLink fieldCode="AR" term="%22Qibin+Zheng%22">Qibin Zheng</searchLink><br /><searchLink fieldCode="AR" term="%22Xilin+Yang%22">Xilin Yang</searchLink><br /><searchLink fieldCode="AR" term="%22Zhiyuan+Lu%22">Zhiyuan Lu</searchLink> – Name: TitleSource Label: Source Group: Src Data: Sensors, Vol 23, Iss 13, p 5775 (2023) – Name: Publisher Label: Publisher Information Group: PubInfo Data: MDPI AG – Name: DatePubCY Label: Publication Year Group: Date Data: 2023 – Name: Subset Label: Collection Group: HoldingsInfo Data: Directory of Open Access Journals: DOAJ Articles – Name: Subject Label: Subject Terms Group: Su Data: <searchLink fieldCode="DE" term="%22sign+language+recognition%22">sign language recognition</searchLink><br /><searchLink fieldCode="DE" term="%22surface+electromyogram%22">surface electromyogram</searchLink><br /><searchLink fieldCode="DE" term="%22inception+network%22">inception network</searchLink><br /><searchLink fieldCode="DE" term="%22residual+module%22">residual module</searchLink><br /><searchLink fieldCode="DE" term="%22dilated+convolution%22">dilated convolution</searchLink><br /><searchLink fieldCode="DE" term="%22Chemical+technology%22">Chemical technology</searchLink><br /><searchLink fieldCode="DE" term="%22TP1-1185%22">TP1-1185</searchLink> – Name: Abstract Label: Description Group: Ab Data: Deaf and hearing-impaired people always face communication barriers. Non-invasive surface electromyography (sEMG) sensor-based sign language recognition (SLR) technology can help them to better integrate into social life. Since the traditional tandem convolutional neural network (CNN) structure used in most CNN-based studies inadequately captures the features of the input data, we propose a novel inception architecture with a residual module and dilated convolution (IRDC-net) to enlarge the receptive fields and enrich the feature maps, applying it to SLR tasks for the first time. This work first transformed the time domain signal into a time–frequency domain using discrete Fourier transformation. Second, an IRDC-net was constructed to recognize ten Chinese sign language signs. Third, the tandem CNN networks VGG-net and ResNet-18 were compared with our proposed parallel structure network, IRDC-net. Finally, the public dataset Ninapro DB1 was utilized to verify the generalization performance of the IRDC-net. The results showed that after transforming the time domain sEMG signal into the time–frequency domain, the classification accuracy (acc) increased from 84.29% to 91.70% when using the IRDC-net on our sign language dataset. Furthermore, for the time–frequency information of the public dataset Ninapro DB1, the classification accuracy reached 89.82%; this value is higher than that achieved in other recent studies. As such, our findings contribute to research into SLR tasks and to improving deaf and hearing-impaired people’s daily lives. – Name: TypeDocument Label: Document Type Group: TypDoc Data: article in journal/newspaper – Name: Language Label: Language Group: Lang Data: English – Name: NoteTitleSource Label: Relation Group: SrcInfo Data: https://www.mdpi.com/1424-8220/23/13/5775; https://doaj.org/toc/1424-8220; https://doaj.org/article/3b2b3373faec4f0bbd5c30335b2b2465 – Name: DOI Label: DOI Group: ID Data: 10.3390/s23135775 – Name: URL Label: Availability Group: URL Data: https://doi.org/10.3390/s23135775<br />https://doaj.org/article/3b2b3373faec4f0bbd5c30335b2b2465 – Name: AN Label: Accession Number Group: ID Data: edsbas.2E34B0F2 |
| PLink | https://erproxy.cvtisr.sk/sfx/access?url=https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&db=edsbas&AN=edsbas.2E34B0F2 |
| RecordInfo | BibRecord: BibEntity: Identifiers: – Type: doi Value: 10.3390/s23135775 Languages: – Text: English Subjects: – SubjectFull: sign language recognition Type: general – SubjectFull: surface electromyogram Type: general – SubjectFull: inception network Type: general – SubjectFull: residual module Type: general – SubjectFull: dilated convolution Type: general – SubjectFull: Chemical technology Type: general – SubjectFull: TP1-1185 Type: general Titles: – TitleFull: IRDC-Net: An Inception Network with a Residual Module and Dilated Convolution for Sign Language Recognition Based on Surface Electromyography Type: main BibRelationships: HasContributorRelationships: – PersonEntity: Name: NameFull: Xiangrui Wang – PersonEntity: Name: NameFull: Lu Tang – PersonEntity: Name: NameFull: Qibin Zheng – PersonEntity: Name: NameFull: Xilin Yang – PersonEntity: Name: NameFull: Zhiyuan Lu IsPartOfRelationships: – BibEntity: Dates: – D: 01 M: 01 Type: published Y: 2023 Identifiers: – Type: issn-locals Value: edsbas – Type: issn-locals Value: edsbas.oa Titles: – TitleFull: Sensors, Vol 23, Iss 13, p 5775 (2023 Type: main |
| ResultId | 1 |
Nájsť tento článok vo Web of Science