Real-Time Integration of Dynamic Context Information for Improving Automatic Speech Recognition

Oualil, Youssef und Schulder, Marc und Helmke, Hartmut und Schmidt, Anna und Klakow, Dietrich (2015) Real-Time Integration of Dynamic Context Information for Improving Automatic Speech Recognition. Interspeech 2015, 2015-09-06 - 2015-09-10, Dresden, Deutschland.

Dieses Archiv kann nicht den Volltext zur Verfügung stellen.

Kurzfassung

The use of prior situational/contextual knowledge about a given task can significantly improve automatic speech recognition (ASR) performance. This is typically done through adaptation of acoustic or language models if data is available or using knowledge-based rescoring. The main adaptation techniques, however, are either domain-specific, which makes them inadequate for other tasks, or static and offline, and therefore cannot deal with dynamic knowledge. To circumvent this problem, we propose a real-time system which dynamically integrates situational context into ASR. The context integration is done either post-recognition, in which case a weighted Levenshtein distance between the ASR hypotheses and the context information based on the ASR confidence scores is proposed to extract the most likely sequence of spoken words, or pre-recognition, where the search space is adjusted to the new situational knowledge through adaptation of the finite state machine modeling the spoken language. Experiments conducted on 3 hours of Air Traffic Control (ATC) data achieved a 51% reduction of the Command Error Rate (CmdER) which is used as evaluation metric in the ATC domain.

elib-URL des Eintrags:

https://elib.dlr.de/96937/

Dokumentart:

Konferenzbeitrag (Vortrag)

Titel:

Real-Time Integration of Dynamic Context Information for Improving Automatic Speech Recognition

Autoren:

Autoren	Institution oder E-Mail-Adresse	Autoren-ORCID-iD	ORCID Put Code
Oualil, Youssef	UdS	NICHT SPEZIFIZIERT	NICHT SPEZIFIZIERT
Schulder, Marc	UdS	NICHT SPEZIFIZIERT	NICHT SPEZIFIZIERT
Helmke, Hartmut	Hartmut.Helmke (at) dlr.de	https://orcid.org/0000-0002-1939-0200	NICHT SPEZIFIZIERT
Schmidt, Anna	UdS	NICHT SPEZIFIZIERT	NICHT SPEZIFIZIERT
Klakow, Dietrich	UdS	NICHT SPEZIFIZIERT	NICHT SPEZIFIZIERT

Datum:

September 2015

Referierte Publikation:

Open Access:

Nein

Gold Open Access:

Nein

In SCOPUS:

Nein

In ISI Web of Science:

Nein

Seitenbereich:

Seiten 1-5

Status:

veröffentlicht

Stichwörter:

speech recognition, situational context, Levenshtein distance

Veranstaltungstitel:

Interspeech 2015

Veranstaltungsort:

Dresden, Deutschland

Veranstaltungsart:

internationale Konferenz

Veranstaltungsbeginn:

6 September 2015

Veranstaltungsende:

10 September 2015

Veranstalter :

Technische Universität Berlin

HGF - Forschungsbereich:

Luftfahrt, Raumfahrt und Verkehr

HGF - Programm:

Luftfahrt

HGF - Programmthema:

Luftverkehrsmanagement und Flugbetrieb

DLR - Schwerpunkt:

Luftfahrt

DLR - Forschungsgebiet:

L AO - Air Traffic Management and Operation

DLR - Teilgebiet (Projekt, Vorhaben):

L - Effiziente Flugführung (alt)

Standort:

Braunschweig

Institute & Einrichtungen:

Institut für Flugführung > Lotsenassistenz

Hinterlegt von:

Diederich, Kerstin

Hinterlegt am:

02 Jul 2015 15:09

Letzte Änderung:

24 Apr 2024 20:02

Nur für Mitarbeiter des Archivs: Kontrollseite des Eintrags