Ohneiser, Oliver and Shetty, Shruthi and Ehr, Heiko and Hobein, Stephanie (2025) Creating Artificial Voice Audio - Text-To-Speech for Air Traffic Control. DLR-Interner Bericht. DLR-IB-FL-BS-2025-37. (Unpublished)
Full text not available from this repository.
Abstract
Automatic speech recognition and understanding (ASRU) has proven to effectively support human operators in air traffic control (ATC) - especially through reducing workload in case automated functions take over former human tasks. Still, the verbal communication remains a time-dominant function in ATC communication. After the COVID-19 pandemic, the aviation industry faced a shortage of air traffic controllers (ATCos) and pilots, highlighting a significant problem: managing resources for training new air traffic controllers (ATCos) and pilots. Therefore, we explore further technologies in addition to ASRU that can support operational work and training of ATCos and pilots focussing on the communication aspect with speech. Given the dramatic improvements of speech-to-text (STT) and text-to-speech (TTS) models in recent years, we herein analyse how TTS can support human aviation operators. Therefore, we (1) create new TTS models through fine-tuning of existing out-of-domain models, (2) generate new synthetic audio data to be used ad-hoc for verbalizing ATC utterances or to be used for training/fine-tuning of STT models, and (3) describe future applications of 'Speech Understanding, Generation, and Recognition' (SUGAR) in ATC such as supporting simulation pilots with readback generation and supporting ATCos with voice command generation. This report is a guide on how to perform the three listed steps in-house to customize TTS models, artificial audio utterances, and downstream applications of SUGAR.
| Item URL in elib: | https://elib.dlr.de/214878/ | ||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Document Type: | Monograph (DLR-Interner Bericht) | ||||||||||||||||||||
| Title: | Creating Artificial Voice Audio - Text-To-Speech for Air Traffic Control | ||||||||||||||||||||
| Authors: |
| ||||||||||||||||||||
| Date: | 2025 | ||||||||||||||||||||
| Refereed publication: | No | ||||||||||||||||||||
| Open Access: | No | ||||||||||||||||||||
| Status: | Unpublished | ||||||||||||||||||||
| Keywords: | Text-To-Speech; Speech-To-Text | ||||||||||||||||||||
| HGF - Research field: | Aeronautics, Space and Transport | ||||||||||||||||||||
| HGF - Program: | Aeronautics | ||||||||||||||||||||
| HGF - Program Themes: | Air Transportation and Impact | ||||||||||||||||||||
| DLR - Research area: | Aeronautics | ||||||||||||||||||||
| DLR - Program: | L AI - Air Transportation and Impact | ||||||||||||||||||||
| DLR - Research theme (Project): | L - Integrated Flight Guidance | ||||||||||||||||||||
| Location: | Braunschweig | ||||||||||||||||||||
| Institutes and Institutions: | Institute of Flight Guidance > Controller Assistance Institute of Flight Guidance > ATM-Simulation | ||||||||||||||||||||
| Deposited By: | Ohneiser, Oliver | ||||||||||||||||||||
| Deposited On: | 06 Aug 2025 11:31 | ||||||||||||||||||||
| Last Modified: | 06 Aug 2025 11:31 |
Repository Staff Only: item control page