Multi-talker Verbal Interaction for Humanoid Robots

Working in multi-talker mode is viable under certain conditions, such as the fusion of audio and video stimuli along with smart adaptive beamforming of received audio signals. In this article, the authors verify part of the researched novel framework, which focuses on adapting to dynamic interlocuto...

Celý popis

Uložené v:
Podrobná bibliografia
Vydané v:International Conference on Methods and Models in Automation Robot. (Online) s. 521 - 526
Hlavní autori: Klin, Bartlomiej, Beniak, Ryszard, Podpora, Michal, Gardecki, Arkadiusz, Rut, Joanna
Médium: Konferenčný príspevok..
Jazyk:English
Vydavateľské údaje: IEEE 27.08.2024
Predmet:
ISSN:2835-2807
On-line prístup:Získať plný text
Tagy: Pridať tag
Žiadne tagy, Buďte prvý, kto otaguje tento záznam!
Popis
Shrnutí:Working in multi-talker mode is viable under certain conditions, such as the fusion of audio and video stimuli along with smart adaptive beamforming of received audio signals. In this article, the authors verify part of the researched novel framework, which focuses on adapting to dynamic interlocutor's location changes in the engagement zone of humanoid robots during the multi-talker conversation. After evaluating the framework, the authors confirm the necessity of a complementary and independent method of increasing the interlocutor's signal isolation accuracy. It is necessary when video analysis performance plummets. The authors described the leading cause as insufficient performance during dynamic conversations. The video analysis cannot derive a new configuration when the interlocutor's speech apparatus moves beyond the expected margin and the video frame rate drops.
ISSN:2835-2807
DOI:10.1109/MMAR62187.2024.10680820