Oualil, Youssef und Schulder, Marc und Helmke, Hartmut und Schmidt, Anna und Klakow, Dietrich (2015) Real-Time Integration of Dynamic Context Information for Improving Automatic Speech Recognition. Interspeech 2015, 2015-09-06 - 2015-09-10, Dresden, Deutschland.
Dieses Archiv kann nicht den Volltext zur Verfügung stellen.
Kurzfassung
The use of prior situational/contextual knowledge about a given task can significantly improve automatic speech recognition (ASR) performance. This is typically done through adaptation of acoustic or language models if data is available or using knowledge-based rescoring. The main adaptation techniques, however, are either domain-specific, which makes them inadequate for other tasks, or static and offline, and therefore cannot deal with dynamic knowledge. To circumvent this problem, we propose a real-time system which dynamically integrates situational context into ASR. The context integration is done either post-recognition, in which case a weighted Levenshtein distance between the ASR hypotheses and the context information based on the ASR confidence scores is proposed to extract the most likely sequence of spoken words, or pre-recognition, where the search space is adjusted to the new situational knowledge through adaptation of the finite state machine modeling the spoken language. Experiments conducted on 3 hours of Air Traffic Control (ATC) data achieved a 51% reduction of the Command Error Rate (CmdER) which is used as evaluation metric in the ATC domain.
| elib-URL des Eintrags: | https://elib.dlr.de/96937/ | ||||||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Dokumentart: | Konferenzbeitrag (Vortrag) | ||||||||||||||||||||||||
| Titel: | Real-Time Integration of Dynamic Context Information for Improving Automatic Speech Recognition | ||||||||||||||||||||||||
| Autoren: |
| ||||||||||||||||||||||||
| Datum: | September 2015 | ||||||||||||||||||||||||
| Referierte Publikation: | Ja | ||||||||||||||||||||||||
| Open Access: | Nein | ||||||||||||||||||||||||
| Gold Open Access: | Nein | ||||||||||||||||||||||||
| In SCOPUS: | Nein | ||||||||||||||||||||||||
| In ISI Web of Science: | Nein | ||||||||||||||||||||||||
| Seitenbereich: | Seiten 1-5 | ||||||||||||||||||||||||
| Status: | veröffentlicht | ||||||||||||||||||||||||
| Stichwörter: | speech recognition, situational context, Levenshtein distance | ||||||||||||||||||||||||
| Veranstaltungstitel: | Interspeech 2015 | ||||||||||||||||||||||||
| Veranstaltungsort: | Dresden, Deutschland | ||||||||||||||||||||||||
| Veranstaltungsart: | internationale Konferenz | ||||||||||||||||||||||||
| Veranstaltungsbeginn: | 6 September 2015 | ||||||||||||||||||||||||
| Veranstaltungsende: | 10 September 2015 | ||||||||||||||||||||||||
| Veranstalter : | Technische Universität Berlin | ||||||||||||||||||||||||
| HGF - Forschungsbereich: | Luftfahrt, Raumfahrt und Verkehr | ||||||||||||||||||||||||
| HGF - Programm: | Luftfahrt | ||||||||||||||||||||||||
| HGF - Programmthema: | Luftverkehrsmanagement und Flugbetrieb | ||||||||||||||||||||||||
| DLR - Schwerpunkt: | Luftfahrt | ||||||||||||||||||||||||
| DLR - Forschungsgebiet: | L AO - Air Traffic Management and Operation | ||||||||||||||||||||||||
| DLR - Teilgebiet (Projekt, Vorhaben): | L - Effiziente Flugführung (alt) | ||||||||||||||||||||||||
| Standort: | Braunschweig | ||||||||||||||||||||||||
| Institute & Einrichtungen: | Institut für Flugführung > Lotsenassistenz | ||||||||||||||||||||||||
| Hinterlegt von: | Diederich, Kerstin | ||||||||||||||||||||||||
| Hinterlegt am: | 02 Jul 2015 15:09 | ||||||||||||||||||||||||
| Letzte Änderung: | 24 Apr 2024 20:02 |
Nur für Mitarbeiter des Archivs: Kontrollseite des Eintrags