Wüstenbecker, Niclas und Helmke, Hartmut und Ohneiser, Oliver und Kleinert, Matthias und Daenzer, Bernhard (2025) Robust Air Traffic Control Speaker Role Classification through Combined Speaker Embeddings and Speech Understanding. In: 44th IEEE/AIAA Digital Avionics Systems Conference, DASC 2025, Seiten 1-10. 44th Digital Avionics Systems Conference DASC, 2025-09-14 - 2025-09-18, Montreal, Kanada.
|
PDF
1MB |
Kurzfassung
Automatic speech recognition (ASR) systems in air traffic control need to distinguish between controller and pilot transmissions, a task called speaker role classification. This paper presents a robust approach by combining complementary classification methods: clustering of speaker embeddings from audio data and a rule-based classification system analyzing the semantics from generated transcripts. Our combined approach achieves 99% precision and 96% recall even on recordings from the operational environment, outperforming state-of-the-art text-based supervised learning models. Additionally, we demonstrate the robustness of our approach by evaluating the method across datasets from the tower, approach, and en-route domains, as well as analyzing performance on degraded ASR transcripts.
| elib-URL des Eintrags: | https://elib.dlr.de/219498/ | ||||||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Dokumentart: | Konferenzbeitrag (Vortrag) | ||||||||||||||||||||||||
| Titel: | Robust Air Traffic Control Speaker Role Classification through Combined Speaker Embeddings and Speech Understanding | ||||||||||||||||||||||||
| Autoren: |
| ||||||||||||||||||||||||
| Datum: | September 2025 | ||||||||||||||||||||||||
| Erschienen in: | 44th IEEE/AIAA Digital Avionics Systems Conference, DASC 2025 | ||||||||||||||||||||||||
| Referierte Publikation: | Ja | ||||||||||||||||||||||||
| Open Access: | Ja | ||||||||||||||||||||||||
| Gold Open Access: | Nein | ||||||||||||||||||||||||
| In SCOPUS: | Nein | ||||||||||||||||||||||||
| In ISI Web of Science: | Nein | ||||||||||||||||||||||||
| Seitenbereich: | Seiten 1-10 | ||||||||||||||||||||||||
| Status: | veröffentlicht | ||||||||||||||||||||||||
| Stichwörter: | Air Traffic Control, Speaker Role Classification, Speaker Diarization, Speaker Embeddings, Automatic Speech Recognition, Speech Understanding, Rule-Based Systems | ||||||||||||||||||||||||
| Veranstaltungstitel: | 44th Digital Avionics Systems Conference DASC | ||||||||||||||||||||||||
| Veranstaltungsort: | Montreal, Kanada | ||||||||||||||||||||||||
| Veranstaltungsart: | internationale Konferenz | ||||||||||||||||||||||||
| Veranstaltungsbeginn: | 14 September 2025 | ||||||||||||||||||||||||
| Veranstaltungsende: | 18 September 2025 | ||||||||||||||||||||||||
| HGF - Forschungsbereich: | Luftfahrt, Raumfahrt und Verkehr | ||||||||||||||||||||||||
| HGF - Programm: | Luftfahrt | ||||||||||||||||||||||||
| HGF - Programmthema: | Luftverkehr und Auswirkungen | ||||||||||||||||||||||||
| DLR - Schwerpunkt: | Luftfahrt | ||||||||||||||||||||||||
| DLR - Forschungsgebiet: | L AI - Luftverkehr und Auswirkungen | ||||||||||||||||||||||||
| DLR - Teilgebiet (Projekt, Vorhaben): | L - Integrierte Flugführung | ||||||||||||||||||||||||
| Standort: | Braunschweig | ||||||||||||||||||||||||
| Institute & Einrichtungen: | Institut für Flugführung > Lotsenassistenz | ||||||||||||||||||||||||
| Hinterlegt von: | Wüstenbecker, Niclas | ||||||||||||||||||||||||
| Hinterlegt am: | 24 Nov 2025 10:00 | ||||||||||||||||||||||||
| Letzte Änderung: | 24 Nov 2025 10:00 |
Nur für Mitarbeiter des Archivs: Kontrollseite des Eintrags