elib
DLR-Header
DLR-Logo -> http://www.dlr.de
DLR Portal Home | Impressum | Datenschutz | Barrierefreiheit | Kontakt | English
Schriftgröße: [-] Text [+]

Text-to-Speech Application for Training of Aviation Radio Telephony Communication Operators

Ohneiser, Oliver und Ahmed, Umair (2025) Text-to-Speech Application for Training of Aviation Radio Telephony Communication Operators. IEEE Transactions on Aerospace and Electronic Systems, 61 (2), Seiten 4542-4560. IEEE - Institute of Electrical and Electronics Engineers. doi: 10.1109/TAES.2024.3504493. ISSN 0018-9251.

[img] PDF - Postprintversion (akzeptierte Manuskriptversion)
1MB

Kurzfassung

Air Traffic Control (ATC) and its dedicated radio telephony communication are critical components of safe and efficient air traffic. After the COVID-19 pandemic, the aviation industry faced a shortage of air traffic controllers (ATCos) and pilots, highlighting a significant problem: managing resources for training new ATCos and pilots.This paper explores using a text-to-speech application (TTS app) to simulate aviation radio telephony communication. The app utilizes open-source pre-trained TTS models fine-tuned using publicly available ATC communication-specific datasets. It synthesizes textual ATC utterances to simulate ATCo instructions and pilot responses, creating a realistic two-way communication scenario. It includes twenty ATCo and eight pilot voice models developed using two distinct fine-tuning approaches: (1) an end-to-end TTS method leveraging deep learning techniques and (2) a voice cloning method supporting multi-lingual speech generation.The app was evaluated in an online study by 20 international study subjects, comprising 14 ATCos, 4 pilots, and 2 individuals from other aviation backgrounds. The performance of the voice models varied across different aspects of audio quality such as overall experience, clarity, pronunciation, intonation, naturalness, and speed due to more than 4100 subjective rating values. The voice cloning models were rated significantly better overall than the end-to-end models. The female voice cloning models were rated significantly better overall than the male voice cloning models-both fine-tuned with ATCo data. The majority of voice cloning models especially for ATCo utterances received average overall ratings between 4 and 4.5 out of the highest score of 5. More than 83% of ratings classified the audio articulation speed as optimal. While some issues on pauses, pronunciation, and volume consistency were noted, the general feedback, especially on realism, showed the feasibility of TTS for ATC communication training. The possibility to synthesize speech faster than real-time and initial explorations of Large Language Models for TTS show that developing operational downstream applications is on the horizon.

elib-URL des Eintrags:https://elib.dlr.de/207830/
Dokumentart:Zeitschriftenbeitrag
Titel:Text-to-Speech Application for Training of Aviation Radio Telephony Communication Operators
Autoren:
AutorenInstitution oder E-Mail-AdresseAutoren-ORCID-iDORCID Put Code
Ohneiser, OliverOliver.Ohneiser (at) dlr.dehttps://orcid.org/0000-0002-5411-691X185331785
Ahmed, UmairTU ClausthalNICHT SPEZIFIZIERTNICHT SPEZIFIZIERT
Datum:14 April 2025
Erschienen in:IEEE Transactions on Aerospace and Electronic Systems
Referierte Publikation:Ja
Open Access:Ja
Gold Open Access:Nein
In SCOPUS:Ja
In ISI Web of Science:Ja
Band:61
DOI:10.1109/TAES.2024.3504493
Seitenbereich:Seiten 4542-4560
Verlag:IEEE - Institute of Electrical and Electronics Engineers
ISSN:0018-9251
Status:veröffentlicht
Stichwörter:Air Traffic Control (ATC) Aviation Radio Telephony Communication Simulation Phraseology Text-to-Speech (TTS) Voice Cloning
HGF - Forschungsbereich:Luftfahrt, Raumfahrt und Verkehr
HGF - Programm:Luftfahrt
HGF - Programmthema:Luftverkehr und Auswirkungen
DLR - Schwerpunkt:Luftfahrt
DLR - Forschungsgebiet:L AI - Luftverkehr und Auswirkungen
DLR - Teilgebiet (Projekt, Vorhaben):L - Integrierte Flugführung
Standort: Braunschweig
Institute & Einrichtungen:Institut für Flugführung > Lotsenassistenz
Hinterlegt von: Ohneiser, Oliver
Hinterlegt am:04 Jun 2025 09:30
Letzte Änderung:04 Jun 2025 09:30

Nur für Mitarbeiter des Archivs: Kontrollseite des Eintrags

Blättern
Suchen
Hilfe & Kontakt
Informationen
OpenAIRE Validator logo electronic library verwendet EPrints 3.3.12
Gestaltung Webseite und Datenbank: Copyright © Deutsches Zentrum für Luft- und Raumfahrt (DLR). Alle Rechte vorbehalten.