elib
DLR-Header
DLR-Logo -> http://www.dlr.de
DLR Portal Home | Imprint | Privacy Policy | Contact | Deutsch
Fontsize: [-] Text [+]

Context-Aware Speech Interface for Human-Robot Interaction

Tülin, İzer Kaptan (2017) Context-Aware Speech Interface for Human-Robot Interaction. Master's. DLR-Interner Bericht. DLR-IB-RM-OP-2017-271. (In Press)

[img] PDF - Registered users only
5MB

Abstract

The focus of this thesis is the emerging field of human-robot interaction (HRI). Speech is the most intuitive way of communication between humans. This thesis aims at introducing speech recognition capabilities to RAZER, an intuitive graphical human-robot interface with an integrated framework. Additionally, the architecture of the framework is extended generically in order to support multiple interaction methods. While interaction via a graphical user interface (GUI) is effective, it may not be suitable for the cases where the user needs their hands to be free. Thus, further interaction methods are needed as an alternative or as complementary to the GUI. In the scope of this thesis, different speech recognition tools and speech recognition limitations are analyzed. A custom language model is created for speech decoding. An I/O service is implemented in order to support different interaction methods. A speech interaction concept is implemented via CMU Pocketsphinx and the custom language model created. For the evaluation of the system, a user study is conducted. The system is evaluated in terms of usefulness and efficiency. Furthermore, Google Speech API and CMU Pocketsphinx tools are compared using both generic and customized language models for Pocketsphinx by means of accuracy. The results suggest that speech interface is a useful complement to the GUI. According to accuracy tests, CMU Pocketsphinx with the customized language model is the most accurate amongst three tools. Google Speech API is the second best tool and CMU Pocketsphinx with the generic language model delivered the worst results.

Item URL in elib:https://elib.dlr.de/117405/
Document Type:Monograph (DLR-Interner Bericht, Master's)
Title:Context-Aware Speech Interface for Human-Robot Interaction
Authors:
AuthorsInstitution or Email of AuthorsAuthors ORCID iD
Tülin, İzer Kaptantulinizer (at) gmail.comUNSPECIFIED
Date:December 2017
Refereed publication:No
Open Access:No
Gold Open Access:No
In SCOPUS:No
In ISI Web of Science:No
Status:In Press
Keywords:Speech recognition, Human-Robot Interface
Institution:Technische Universität München
Department:Fakultät für Informatik
HGF - Research field:Aeronautics, Space and Transport
HGF - Program:Space
HGF - Program Themes:Space Technology
DLR - Research area:Raumfahrt
DLR - Program:R SY - Technik für Raumfahrtsysteme
DLR - Research theme (Project):R - Vorhaben Intelligente Mobilität
Location: Oberpfaffenhofen
Institutes and Institutions:Institute of Robotics and Mechatronics (since 2013) > Cognitive Robotics
Deposited By: Steinmetz, Franz
Deposited On:19 Dec 2017 15:01
Last Modified:19 Dec 2017 15:01

Repository Staff Only: item control page

Browse
Search
Help & Contact
Information
electronic library is running on EPrints 3.3.12
Copyright © 2008-2017 German Aerospace Center (DLR). All rights reserved.