elib
DLR-Header
DLR-Logo -> http://www.dlr.de
DLR Portal Home | Imprint | Privacy Policy | Contact | Deutsch
Fontsize: [-] Text [+]

Real-Time Integration of Dynamic Context Information for Improving Automatic Speech Recognition

Oualil, Youssef and Schulder, Marc and Helmke, Hartmut and Schmidt, Anna and Klakow, Dietrich (2015) Real-Time Integration of Dynamic Context Information for Improving Automatic Speech Recognition. Interspeech 2015, 2015-09-06 - 2015-09-10, Dresden, Deutschland.

Full text not available from this repository.

Abstract

The use of prior situational/contextual knowledge about a given task can significantly improve automatic speech recognition (ASR) performance. This is typically done through adaptation of acoustic or language models if data is available or using knowledge-based rescoring. The main adaptation techniques, however, are either domain-specific, which makes them inadequate for other tasks, or static and offline, and therefore cannot deal with dynamic knowledge. To circumvent this problem, we propose a real-time system which dynamically integrates situational context into ASR. The context integration is done either post-recognition, in which case a weighted Levenshtein distance between the ASR hypotheses and the context information based on the ASR confidence scores is proposed to extract the most likely sequence of spoken words, or pre-recognition, where the search space is adjusted to the new situational knowledge through adaptation of the finite state machine modeling the spoken language. Experiments conducted on 3 hours of Air Traffic Control (ATC) data achieved a 51% reduction of the Command Error Rate (CmdER) which is used as evaluation metric in the ATC domain.

Item URL in elib:https://elib.dlr.de/96937/
Document Type:Conference or Workshop Item (Speech)
Title:Real-Time Integration of Dynamic Context Information for Improving Automatic Speech Recognition
Authors:
AuthorsInstitution or Email of AuthorsAuthor's ORCID iDORCID Put Code
Oualil, YoussefUdSUNSPECIFIEDUNSPECIFIED
Schulder, MarcUdSUNSPECIFIEDUNSPECIFIED
Helmke, HartmutUNSPECIFIEDhttps://orcid.org/0000-0002-1939-0200UNSPECIFIED
Schmidt, AnnaUdSUNSPECIFIEDUNSPECIFIED
Klakow, DietrichUdSUNSPECIFIEDUNSPECIFIED
Date:September 2015
Refereed publication:Yes
Open Access:No
Gold Open Access:No
In SCOPUS:No
In ISI Web of Science:No
Page Range:pp. 1-5
Status:Published
Keywords:speech recognition, situational context, Levenshtein distance
Event Title:Interspeech 2015
Event Location:Dresden, Deutschland
Event Type:international Conference
Event Start Date:6 September 2015
Event End Date:10 September 2015
Organizer:Technische Universität Berlin
HGF - Research field:Aeronautics, Space and Transport
HGF - Program:Aeronautics
HGF - Program Themes:air traffic management and operations
DLR - Research area:Aeronautics
DLR - Program:L AO - Air Traffic Management and Operation
DLR - Research theme (Project):L - Efficient Flight Guidance (old)
Location: Braunschweig
Institutes and Institutions:Institute of Flight Guidance > Controller Assistance
Deposited By: Diederich, Kerstin
Deposited On:02 Jul 2015 15:09
Last Modified:24 Apr 2024 20:02

Repository Staff Only: item control page

Browse
Search
Help & Contact
Information
OpenAIRE Validator logo electronic library is running on EPrints 3.3.12
Website and database design: Copyright © German Aerospace Center (DLR). All rights reserved.