elib
DLR-Header
DLR-Logo -> http://www.dlr.de
DLR Portal Home | Imprint | Privacy Policy | Accessibility | Contact | Deutsch
Fontsize: [-] Text [+]

Grammar Based Identification Of Speaker Role For Improving ATCO And Pilot ASR

Prasad, Amrutha and Zuluaga-Gomez, Juan Pablo and Motlicek, Petr and Ohneiser, Oliver and Helmke, Hartmut and Sarfjoo, Saeed and Nigmatulina, Iuliia (2021) Grammar Based Identification Of Speaker Role For Improving ATCO And Pilot ASR. Interspeech 2021 Satellite Workshop "Automatic Speech Recognition in Air Traffic Management (ASR-ATM)", 2021-08-30, Brno, Tschechien (hybrid).

Full text not available from this repository.

Official URL: https://www.haawaii.de/wp/wp-content/uploads/2021/08/Multi_task_learning_Interspeech2021.pdf

Abstract

Assistant Based Speech Recognition (ABSR) for air traffic control is generally trained by pooling both Air Traffic Controller (ATCO) and pilot data. In practice, this is motivated by the fact that the proportion of pilot data is lesser compared to ATCO while their standard language of communication is similar. However, due to data imbalance of ATCO and pilot and their varying acoustic conditions, the ASR performance is usually significantly better for ATCOs than pilots. In this paper, we propose to (1) split the ATCO and pilot data using an automatic approach exploiting ASR transcripts, and (2) consider ATCO and pilot ASR as two separate tasks for Acoustic Model (AM) training. For speaker role classification of ATCO and pilot data, a hypothesized ASR transcript is generated with a seed model, subsequently used to classify the speaker role based on the knowledge extracted from grammar defined by International Civil Aviation Organization (ICAO). This approach provides an average speaker role identification accuracy of 83% for ATCO and pilot. Finally, we show that training AMs separately for each task, or using a multitask approach is well suited for this data compared to AM trained by pooling all data.

Item URL in elib:https://elib.dlr.de/143895/
Document Type:Conference or Workshop Item (Speech)
Title:Grammar Based Identification Of Speaker Role For Improving ATCO And Pilot ASR
Authors:
AuthorsInstitution or Email of AuthorsAuthor's ORCID iDORCID Put Code
Prasad, AmruthaIdiap, BUTUNSPECIFIEDUNSPECIFIED
Zuluaga-Gomez, Juan PabloIdiap, EPFLUNSPECIFIEDUNSPECIFIED
Motlicek, PetrUNSPECIFIEDUNSPECIFIEDUNSPECIFIED
Ohneiser, OliverUNSPECIFIEDhttps://orcid.org/0000-0002-5411-691XUNSPECIFIED
Helmke, HartmutUNSPECIFIEDhttps://orcid.org/0000-0002-1939-0200UNSPECIFIED
Sarfjoo, SaeedIdiapUNSPECIFIEDUNSPECIFIED
Nigmatulina, IuliiaIdiapUNSPECIFIEDUNSPECIFIED
Date:2021
Refereed publication:Yes
Open Access:No
Gold Open Access:No
In SCOPUS:No
In ISI Web of Science:No
Status:Published
Keywords:assistant based speech recognition; air traffic management; multitask acoustic model; speaker classification
Event Title:Interspeech 2021 Satellite Workshop "Automatic Speech Recognition in Air Traffic Management (ASR-ATM)"
Event Location:Brno, Tschechien (hybrid)
Event Type:Workshop
Event Date:30 August 2021
Organizer:BUT, DLR, Idiap
HGF - Research field:Aeronautics, Space and Transport
HGF - Program:Aeronautics
HGF - Program Themes:Air Transportation and Impact
DLR - Research area:Aeronautics
DLR - Program:L AI - Air Transportation and Impact
DLR - Research theme (Project):L - Human Factors
Location: Braunschweig
Institutes and Institutions:Institute of Flight Guidance > Controller Assistance
Deposited By: Ohneiser, Oliver
Deposited On:14 Sep 2021 10:22
Last Modified:24 Apr 2024 20:43

Repository Staff Only: item control page

Browse
Search
Help & Contact
Information
OpenAIRE Validator logo electronic library is running on EPrints 3.3.12
Website and database design: Copyright © German Aerospace Center (DLR). All rights reserved.