MP-GestLSTM: real time gesture detection using MediaPipe and LSTM.

Uložené v:
Podrobná bibliografia
Názov: MP-GestLSTM: real time gesture detection using MediaPipe and LSTM.
Autori: Varshini, T.S., Rukmani, P.
Zdroj: Systems Science & Control Engineering; Dec2025, Vol. 13 Issue 1, p1-18, 18p
Predmety: SIGN language, LONG short-term memory, DEEP learning, BODY language
Abstrakt: Sign language is a crucial form of communication for individuals with hearing impairments; thus, the development of accurate and efficient sign language recognition systems is of significant importance. The main aim of the work is to develop an accurate and efficient sign language recognition model using MediaPipe (MP) and deep learning techniques. MediaPipe (BlazePose) is a top-down model with an encoder-decoder architecture to predict the keypoints for all joints. MediaPipe detects 33 body(pose) landmarks, 21 hand landmarks, and 478 3-dimensional face landmarks. A custom dataset called MP-Gest consists of 800 videos belonging to 20 classes at the word level of American Sign Language (ASL) is generated by two signers. A selection of MediaPipe keypoints is extracted from this custom dataset and used to train a Long Short-Term Memory (LSTM)-based custom model called MP-GestLSTM. The MP-GestLSTM model achieved a validation accuracy of 93.75% and testing accuracy of 94.38%, indicating that the model outperformed existing research with minimal error. The proposed model gives enhanced recognition accuracy using a simple custom architecture and contributes to the advancement of sign language recognition by overcoming the existing limitations. [ABSTRACT FROM AUTHOR]
Copyright of Systems Science & Control Engineering is the property of Taylor & Francis Ltd and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
Databáza: Complementary Index
Buďte prvý, kto okomentuje tento záznam!
Najprv sa musíte prihlásiť.