elib
DLR-Header
DLR-Logo -> http://www.dlr.de
DLR Portal Home | Imprint | Privacy Policy | Contact | Deutsch
Fontsize: [-] Text [+]

Context-Aware Speech Interface for Human-Robot Interaction

Tülin, İzer Kaptan (2017) Context-Aware Speech Interface for Human-Robot Interaction. DLR-Interner Bericht. DLR-IB-RM-OP-2017-271. Master's. Technische Universität München.

[img] PDF - Only accessible within DLR
5MB

Abstract

The focus of this thesis is the emerging field of human-robot interaction (HRI). Speech is the most intuitive way of communication between humans. This thesis aims at introducing speech recognition capabilities to RAZER, an intuitive graphical human-robot interface with an integrated framework. Additionally, the architecture of the framework is extended generically in order to support multiple interaction methods. While interaction via a graphical user interface (GUI) is effective, it may not be suitable for the cases where the user needs their hands to be free. Thus, further interaction methods are needed as an alternative or as complementary to the GUI. In the scope of this thesis, different speech recognition tools and speech recognition limitations are analyzed. A custom language model is created for speech decoding. An I/O service is implemented in order to support different interaction methods. A speech interaction concept is implemented via CMU Pocketsphinx and the custom language model created. For the evaluation of the system, a user study is conducted. The system is evaluated in terms of usefulness and efficiency. Furthermore, Google Speech API and CMU Pocketsphinx tools are compared using both generic and customized language models for Pocketsphinx by means of accuracy. The results suggest that speech interface is a useful complement to the GUI. According to accuracy tests, CMU Pocketsphinx with the customized language model is the most accurate amongst three tools. Google Speech API is the second best tool and CMU Pocketsphinx with the generic language model delivered the worst results.

Item URL in elib:https://elib.dlr.de/117405/
Document Type:Monograph (DLR-Interner Bericht, Master's)
Title:Context-Aware Speech Interface for Human-Robot Interaction
Authors:
AuthorsInstitution or Email of AuthorsAuthor's ORCID iDORCID Put Code
Tülin, İzer KaptanUNSPECIFIEDUNSPECIFIEDUNSPECIFIED
Date:December 2017
Refereed publication:No
Open Access:No
Status:Published
Keywords:Speech recognition, Human-Robot Interface
Institution:Technische Universität München
Department:Fakultät für Informatik
HGF - Research field:Aeronautics, Space and Transport
HGF - Program:Space
HGF - Program Themes:Space System Technology
DLR - Research area:Raumfahrt
DLR - Program:R SY - Space System Technology
DLR - Research theme (Project):R - Vorhaben Intelligente Mobilität (old)
Location: Oberpfaffenhofen
Institutes and Institutions:Institute of Robotics and Mechatronics (since 2013) > Cognitive Robotics
Deposited By: Steinmetz, Franz
Deposited On:19 Dec 2017 15:01
Last Modified:24 Jan 2020 11:12

Repository Staff Only: item control page

Browse
Search
Help & Contact
Information
electronic library is running on EPrints 3.3.12
Website and database design: Copyright © German Aerospace Center (DLR). All rights reserved.