elib
DLR-Header
DLR-Logo -> http://www.dlr.de
DLR Portal Home | Imprint | Privacy Policy | Accessibility | Contact | Deutsch
Fontsize: [-] Text [+]

Creating Artificial Voice Audio - Text-To-Speech for Air Traffic Control

Ohneiser, Oliver and Shetty, Shruthi and Ehr, Heiko and Hobein, Stephanie (2025) Creating Artificial Voice Audio - Text-To-Speech for Air Traffic Control. DLR-Interner Bericht. DLR-IB-FL-BS-2025-37. (Unpublished)

Full text not available from this repository.

Abstract

Automatic speech recognition and understanding (ASRU) has proven to effectively support human operators in air traffic control (ATC) - especially through reducing workload in case automated functions take over former human tasks. Still, the verbal communication remains a time-dominant function in ATC communication. After the COVID-19 pandemic, the aviation industry faced a shortage of air traffic controllers (ATCos) and pilots, highlighting a significant problem: managing resources for training new air traffic controllers (ATCos) and pilots. Therefore, we explore further technologies in addition to ASRU that can support operational work and training of ATCos and pilots focussing on the communication aspect with speech. Given the dramatic improvements of speech-to-text (STT) and text-to-speech (TTS) models in recent years, we herein analyse how TTS can support human aviation operators. Therefore, we (1) create new TTS models through fine-tuning of existing out-of-domain models, (2) generate new synthetic audio data to be used ad-hoc for verbalizing ATC utterances or to be used for training/fine-tuning of STT models, and (3) describe future applications of 'Speech Understanding, Generation, and Recognition' (SUGAR) in ATC such as supporting simulation pilots with readback generation and supporting ATCos with voice command generation. This report is a guide on how to perform the three listed steps in-house to customize TTS models, artificial audio utterances, and downstream applications of SUGAR.

Item URL in elib:https://elib.dlr.de/214878/
Document Type:Monograph (DLR-Interner Bericht)
Title:Creating Artificial Voice Audio - Text-To-Speech for Air Traffic Control
Authors:
AuthorsInstitution or Email of AuthorsAuthor's ORCID iDORCID Put Code
Ohneiser, OliverOliver.Ohneiser (at) dlr.dehttps://orcid.org/0000-0002-5411-691XUNSPECIFIED
Shetty, Shruthishruthi.shetty (at) dlr.dUNSPECIFIEDUNSPECIFIED
Ehr, HeikoHeiko.Ehr (at) dlr.deUNSPECIFIEDUNSPECIFIED
Hobein, Stephaniestephanie.hobein (at) dlr.deUNSPECIFIEDUNSPECIFIED
Date:2025
Refereed publication:No
Open Access:No
Status:Unpublished
Keywords:Text-To-Speech; Speech-To-Text
HGF - Research field:Aeronautics, Space and Transport
HGF - Program:Aeronautics
HGF - Program Themes:Air Transportation and Impact
DLR - Research area:Aeronautics
DLR - Program:L AI - Air Transportation and Impact
DLR - Research theme (Project):L - Integrated Flight Guidance
Location: Braunschweig
Institutes and Institutions:Institute of Flight Guidance > Controller Assistance
Institute of Flight Guidance > ATM-Simulation
Deposited By: Ohneiser, Oliver
Deposited On:06 Aug 2025 11:31
Last Modified:06 Aug 2025 11:31

Repository Staff Only: item control page

Browse
Search
Help & Contact
Information
OpenAIRE Validator logo electronic library is running on EPrints 3.3.12
Website and database design: Copyright © German Aerospace Center (DLR). All rights reserved.