Multi-talker Verbal Interaction for Humanoid Robots

Working in multi-talker mode is viable under certain conditions, such as the fusion of audio and video stimuli along with smart adaptive beamforming of received audio signals. In this article, the authors verify part of the researched novel framework, which focuses on adapting to dynamic interlocuto...

Celý popis

Uloženo v:
Podrobná bibliografie
Vydáno v:International Conference on Methods and Models in Automation Robot. (Online) s. 521 - 526
Hlavní autoři: Klin, Bartlomiej, Beniak, Ryszard, Podpora, Michal, Gardecki, Arkadiusz, Rut, Joanna
Médium: Konferenční příspěvek
Jazyk:angličtina
Vydáno: IEEE 27.08.2024
Témata:
ISSN:2835-2807
On-line přístup:Získat plný text
Tagy: Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
Popis
Shrnutí:Working in multi-talker mode is viable under certain conditions, such as the fusion of audio and video stimuli along with smart adaptive beamforming of received audio signals. In this article, the authors verify part of the researched novel framework, which focuses on adapting to dynamic interlocutor's location changes in the engagement zone of humanoid robots during the multi-talker conversation. After evaluating the framework, the authors confirm the necessity of a complementary and independent method of increasing the interlocutor's signal isolation accuracy. It is necessary when video analysis performance plummets. The authors described the leading cause as insufficient performance during dynamic conversations. The video analysis cannot derive a new configuration when the interlocutor's speech apparatus moves beyond the expected margin and the video frame rate drops.
ISSN:2835-2807
DOI:10.1109/MMAR62187.2024.10680820