DLR-Logo -> http://www.dlr.de
DLR Portal Home | Imprint | Privacy Policy | Contact | Deutsch
Fontsize: [-] Text [+]

Robust Command Recognition for Lithuanian Air Traffic Control Tower Utterances

Ohneiser, Oliver and Sarfjoo, Saeed and Helmke, Hartmut and Shetty, Shruthi and Motlicek, Petr and Kleinert, Matthias and Ehr, Heiko and Murauskas, Sarunas (2021) Robust Command Recognition for Lithuanian Air Traffic Control Tower Utterances. In: 22nd Annual Conference of the International Speech Communication Association, INTERSPEECH 2021, 1, 266 -270. INTERSPEECH 2021, 2021-08-30 - 2021-09-03, Brno, Tschechien (hybrid). doi: 10.21437/Interspeech.2021-935. ISBN 978-171383690-2. ISSN 2308-457X.

Full text not available from this repository.

Official URL: https://www.isca-speech.org/archive/interspeech_2021/ohneiser21_interspeech.html


The maturity of automatic speech recognition (ASR) systems at controller working positions is currently a highly relevant technological topic in air traffic control (ATC). However, ATC service providers are less interested in pure word error rate (WER). They want to see benefits of ASR applications for ATC. Such applications transform recognized word sequences into semantic meanings, i.e., a number of related concepts such as callsign, type, value, unit, etc., which are combined to form commands. Digitized concepts or recognized commands can enter ATC systems based on an ontology for utterance annotation agreed between European ATC stakeholders. Command recognition (CR) has already been performed in approach control. However, spoken utterances of tower controllers are longer, include more free speech, and contain other command types than in approach. An automatic CR rate of 95.8% is achievable on perfect word recognition, i.e., manually transcribed audio recordings (gold transcriptions), taken from Lithuanian controllers in a multiple remote tower environment. This paper presents CR results for various speech-to-text models with different WERs on tower utterances. Although WERs were around 9%, we achieve CR rates of 85%. CR rates only slightly decrease with higher WERs, which enables to bring ASR applications closer to operational ATC environment.

Item URL in elib:https://elib.dlr.de/143893/
Document Type:Conference or Workshop Item (Speech)
Title:Robust Command Recognition for Lithuanian Air Traffic Control Tower Utterances
AuthorsInstitution or Email of AuthorsAuthor's ORCID iDORCID Put Code
Ohneiser, OliverUNSPECIFIEDhttps://orcid.org/0000-0002-5411-691XUNSPECIFIED
Helmke, HartmutUNSPECIFIEDhttps://orcid.org/0000-0002-1939-0200UNSPECIFIED
Kleinert, MatthiasUNSPECIFIEDhttps://orcid.org/0000-0002-0782-4147UNSPECIFIED
Murauskas, SarunasOro navigacija (ON)UNSPECIFIEDUNSPECIFIED
Journal or Publication Title:22nd Annual Conference of the International Speech Communication Association, INTERSPEECH 2021
Refereed publication:Yes
Open Access:No
Gold Open Access:No
In ISI Web of Science:Yes
Page Range:266 -270
Keywords:speech recognition; speech understanding; command recognition rate; air traffic control; tower utterances
Event Title:INTERSPEECH 2021
Event Location:Brno, Tschechien (hybrid)
Event Type:international Conference
Event Start Date:30 August 2021
Event End Date:3 September 2021
Organizer:ISCA, Brno University of Technology (BUT)
HGF - Research field:Aeronautics, Space and Transport
HGF - Program:Aeronautics
HGF - Program Themes:Air Transportation and Impact
DLR - Research area:Aeronautics
DLR - Program:L AI - Air Transportation and Impact
DLR - Research theme (Project):L - Human Factors
Location: Braunschweig
Institutes and Institutions:Institute of Flight Guidance > Controller Assistance
Deposited By: Ohneiser, Oliver
Deposited On:14 Sep 2021 10:22
Last Modified:24 Apr 2024 20:43

Repository Staff Only: item control page

Help & Contact
electronic library is running on EPrints 3.3.12
Website and database design: Copyright © German Aerospace Center (DLR). All rights reserved.