Oualil, Youssef and Schulder, Marc and Helmke, Hartmut and Schmidt, Anna and Klakow, Dietrich (2015) Real-Time Integration of Dynamic Context Information for Improving Automatic Speech Recognition. Interspeech 2015, 2015-09-06 - 2015-09-10, Dresden, Deutschland.
Full text not available from this repository.
Abstract
The use of prior situational/contextual knowledge about a given task can significantly improve automatic speech recognition (ASR) performance. This is typically done through adaptation of acoustic or language models if data is available or using knowledge-based rescoring. The main adaptation techniques, however, are either domain-specific, which makes them inadequate for other tasks, or static and offline, and therefore cannot deal with dynamic knowledge. To circumvent this problem, we propose a real-time system which dynamically integrates situational context into ASR. The context integration is done either post-recognition, in which case a weighted Levenshtein distance between the ASR hypotheses and the context information based on the ASR confidence scores is proposed to extract the most likely sequence of spoken words, or pre-recognition, where the search space is adjusted to the new situational knowledge through adaptation of the finite state machine modeling the spoken language. Experiments conducted on 3 hours of Air Traffic Control (ATC) data achieved a 51% reduction of the Command Error Rate (CmdER) which is used as evaluation metric in the ATC domain.
Item URL in elib: | https://elib.dlr.de/96937/ | ||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Document Type: | Conference or Workshop Item (Speech) | ||||||||||||||||||||||||
Title: | Real-Time Integration of Dynamic Context Information for Improving Automatic Speech Recognition | ||||||||||||||||||||||||
Authors: |
| ||||||||||||||||||||||||
Date: | September 2015 | ||||||||||||||||||||||||
Refereed publication: | Yes | ||||||||||||||||||||||||
Open Access: | No | ||||||||||||||||||||||||
Gold Open Access: | No | ||||||||||||||||||||||||
In SCOPUS: | No | ||||||||||||||||||||||||
In ISI Web of Science: | No | ||||||||||||||||||||||||
Page Range: | pp. 1-5 | ||||||||||||||||||||||||
Status: | Published | ||||||||||||||||||||||||
Keywords: | speech recognition, situational context, Levenshtein distance | ||||||||||||||||||||||||
Event Title: | Interspeech 2015 | ||||||||||||||||||||||||
Event Location: | Dresden, Deutschland | ||||||||||||||||||||||||
Event Type: | international Conference | ||||||||||||||||||||||||
Event Start Date: | 6 September 2015 | ||||||||||||||||||||||||
Event End Date: | 10 September 2015 | ||||||||||||||||||||||||
Organizer: | Technische Universität Berlin | ||||||||||||||||||||||||
HGF - Research field: | Aeronautics, Space and Transport | ||||||||||||||||||||||||
HGF - Program: | Aeronautics | ||||||||||||||||||||||||
HGF - Program Themes: | air traffic management and operations | ||||||||||||||||||||||||
DLR - Research area: | Aeronautics | ||||||||||||||||||||||||
DLR - Program: | L AO - Air Traffic Management and Operation | ||||||||||||||||||||||||
DLR - Research theme (Project): | L - Efficient Flight Guidance (old) | ||||||||||||||||||||||||
Location: | Braunschweig | ||||||||||||||||||||||||
Institutes and Institutions: | Institute of Flight Guidance > Controller Assistance | ||||||||||||||||||||||||
Deposited By: | Diederich, Kerstin | ||||||||||||||||||||||||
Deposited On: | 02 Jul 2015 15:09 | ||||||||||||||||||||||||
Last Modified: | 24 Apr 2024 20:02 |
Repository Staff Only: item control page