IRDC-Net: An Inception Network with a Residual Module and Dilated Convolution for Sign Language Recognition Based on Surface Electromyography

Uloženo v:
Podrobná bibliografie
Název: IRDC-Net: An Inception Network with a Residual Module and Dilated Convolution for Sign Language Recognition Based on Surface Electromyography
Autoři: Xiangrui Wang, Lu Tang, Qibin Zheng, Xilin Yang, Zhiyuan Lu
Zdroj: Sensors, Vol 23, Iss 13, p 5775 (2023)
Informace o vydavateli: MDPI AG
Rok vydání: 2023
Sbírka: Directory of Open Access Journals: DOAJ Articles
Témata: sign language recognition, surface electromyogram, inception network, residual module, dilated convolution, Chemical technology, TP1-1185
Popis: Deaf and hearing-impaired people always face communication barriers. Non-invasive surface electromyography (sEMG) sensor-based sign language recognition (SLR) technology can help them to better integrate into social life. Since the traditional tandem convolutional neural network (CNN) structure used in most CNN-based studies inadequately captures the features of the input data, we propose a novel inception architecture with a residual module and dilated convolution (IRDC-net) to enlarge the receptive fields and enrich the feature maps, applying it to SLR tasks for the first time. This work first transformed the time domain signal into a time–frequency domain using discrete Fourier transformation. Second, an IRDC-net was constructed to recognize ten Chinese sign language signs. Third, the tandem CNN networks VGG-net and ResNet-18 were compared with our proposed parallel structure network, IRDC-net. Finally, the public dataset Ninapro DB1 was utilized to verify the generalization performance of the IRDC-net. The results showed that after transforming the time domain sEMG signal into the time–frequency domain, the classification accuracy (acc) increased from 84.29% to 91.70% when using the IRDC-net on our sign language dataset. Furthermore, for the time–frequency information of the public dataset Ninapro DB1, the classification accuracy reached 89.82%; this value is higher than that achieved in other recent studies. As such, our findings contribute to research into SLR tasks and to improving deaf and hearing-impaired people’s daily lives.
Druh dokumentu: article in journal/newspaper
Jazyk: English
Relation: https://www.mdpi.com/1424-8220/23/13/5775; https://doaj.org/toc/1424-8220; https://doaj.org/article/3b2b3373faec4f0bbd5c30335b2b2465
DOI: 10.3390/s23135775
Dostupnost: https://doi.org/10.3390/s23135775
https://doaj.org/article/3b2b3373faec4f0bbd5c30335b2b2465
Přístupové číslo: edsbas.2E34B0F2
Databáze: BASE
FullText Text:
  Availability: 0
CustomLinks:
  – Url: https://doi.org/10.3390/s23135775#
    Name: EDS - BASE (s4221598)
    Category: fullText
    Text: View record from BASE
  – Url: https://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=EBSCO&SrcAuth=EBSCO&DestApp=WOS&ServiceName=TransferToWoS&DestLinkType=GeneralSearchSummary&Func=Links&author=Wang%20X
    Name: ISI
    Category: fullText
    Text: Nájsť tento článok vo Web of Science
    Icon: https://imagesrvr.epnet.com/ls/20docs.gif
    MouseOverText: Nájsť tento článok vo Web of Science
Header DbId: edsbas
DbLabel: BASE
An: edsbas.2E34B0F2
RelevancyScore: 944
AccessLevel: 3
PubType: Academic Journal
PubTypeId: academicJournal
PreciseRelevancyScore: 943.653564453125
IllustrationInfo
Items – Name: Title
  Label: Title
  Group: Ti
  Data: IRDC-Net: An Inception Network with a Residual Module and Dilated Convolution for Sign Language Recognition Based on Surface Electromyography
– Name: Author
  Label: Authors
  Group: Au
  Data: <searchLink fieldCode="AR" term="%22Xiangrui+Wang%22">Xiangrui Wang</searchLink><br /><searchLink fieldCode="AR" term="%22Lu+Tang%22">Lu Tang</searchLink><br /><searchLink fieldCode="AR" term="%22Qibin+Zheng%22">Qibin Zheng</searchLink><br /><searchLink fieldCode="AR" term="%22Xilin+Yang%22">Xilin Yang</searchLink><br /><searchLink fieldCode="AR" term="%22Zhiyuan+Lu%22">Zhiyuan Lu</searchLink>
– Name: TitleSource
  Label: Source
  Group: Src
  Data: Sensors, Vol 23, Iss 13, p 5775 (2023)
– Name: Publisher
  Label: Publisher Information
  Group: PubInfo
  Data: MDPI AG
– Name: DatePubCY
  Label: Publication Year
  Group: Date
  Data: 2023
– Name: Subset
  Label: Collection
  Group: HoldingsInfo
  Data: Directory of Open Access Journals: DOAJ Articles
– Name: Subject
  Label: Subject Terms
  Group: Su
  Data: <searchLink fieldCode="DE" term="%22sign+language+recognition%22">sign language recognition</searchLink><br /><searchLink fieldCode="DE" term="%22surface+electromyogram%22">surface electromyogram</searchLink><br /><searchLink fieldCode="DE" term="%22inception+network%22">inception network</searchLink><br /><searchLink fieldCode="DE" term="%22residual+module%22">residual module</searchLink><br /><searchLink fieldCode="DE" term="%22dilated+convolution%22">dilated convolution</searchLink><br /><searchLink fieldCode="DE" term="%22Chemical+technology%22">Chemical technology</searchLink><br /><searchLink fieldCode="DE" term="%22TP1-1185%22">TP1-1185</searchLink>
– Name: Abstract
  Label: Description
  Group: Ab
  Data: Deaf and hearing-impaired people always face communication barriers. Non-invasive surface electromyography (sEMG) sensor-based sign language recognition (SLR) technology can help them to better integrate into social life. Since the traditional tandem convolutional neural network (CNN) structure used in most CNN-based studies inadequately captures the features of the input data, we propose a novel inception architecture with a residual module and dilated convolution (IRDC-net) to enlarge the receptive fields and enrich the feature maps, applying it to SLR tasks for the first time. This work first transformed the time domain signal into a time–frequency domain using discrete Fourier transformation. Second, an IRDC-net was constructed to recognize ten Chinese sign language signs. Third, the tandem CNN networks VGG-net and ResNet-18 were compared with our proposed parallel structure network, IRDC-net. Finally, the public dataset Ninapro DB1 was utilized to verify the generalization performance of the IRDC-net. The results showed that after transforming the time domain sEMG signal into the time–frequency domain, the classification accuracy (acc) increased from 84.29% to 91.70% when using the IRDC-net on our sign language dataset. Furthermore, for the time–frequency information of the public dataset Ninapro DB1, the classification accuracy reached 89.82%; this value is higher than that achieved in other recent studies. As such, our findings contribute to research into SLR tasks and to improving deaf and hearing-impaired people’s daily lives.
– Name: TypeDocument
  Label: Document Type
  Group: TypDoc
  Data: article in journal/newspaper
– Name: Language
  Label: Language
  Group: Lang
  Data: English
– Name: NoteTitleSource
  Label: Relation
  Group: SrcInfo
  Data: https://www.mdpi.com/1424-8220/23/13/5775; https://doaj.org/toc/1424-8220; https://doaj.org/article/3b2b3373faec4f0bbd5c30335b2b2465
– Name: DOI
  Label: DOI
  Group: ID
  Data: 10.3390/s23135775
– Name: URL
  Label: Availability
  Group: URL
  Data: https://doi.org/10.3390/s23135775<br />https://doaj.org/article/3b2b3373faec4f0bbd5c30335b2b2465
– Name: AN
  Label: Accession Number
  Group: ID
  Data: edsbas.2E34B0F2
PLink https://erproxy.cvtisr.sk/sfx/access?url=https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&db=edsbas&AN=edsbas.2E34B0F2
RecordInfo BibRecord:
  BibEntity:
    Identifiers:
      – Type: doi
        Value: 10.3390/s23135775
    Languages:
      – Text: English
    Subjects:
      – SubjectFull: sign language recognition
        Type: general
      – SubjectFull: surface electromyogram
        Type: general
      – SubjectFull: inception network
        Type: general
      – SubjectFull: residual module
        Type: general
      – SubjectFull: dilated convolution
        Type: general
      – SubjectFull: Chemical technology
        Type: general
      – SubjectFull: TP1-1185
        Type: general
    Titles:
      – TitleFull: IRDC-Net: An Inception Network with a Residual Module and Dilated Convolution for Sign Language Recognition Based on Surface Electromyography
        Type: main
  BibRelationships:
    HasContributorRelationships:
      – PersonEntity:
          Name:
            NameFull: Xiangrui Wang
      – PersonEntity:
          Name:
            NameFull: Lu Tang
      – PersonEntity:
          Name:
            NameFull: Qibin Zheng
      – PersonEntity:
          Name:
            NameFull: Xilin Yang
      – PersonEntity:
          Name:
            NameFull: Zhiyuan Lu
    IsPartOfRelationships:
      – BibEntity:
          Dates:
            – D: 01
              M: 01
              Type: published
              Y: 2023
          Identifiers:
            – Type: issn-locals
              Value: edsbas
            – Type: issn-locals
              Value: edsbas.oa
          Titles:
            – TitleFull: Sensors, Vol 23, Iss 13, p 5775 (2023
              Type: main
ResultId 1