elib
DLR-Header
DLR-Logo -> http://www.dlr.de
DLR Portal Home | Imprint | Privacy Policy | Accessibility | Contact | Deutsch
Fontsize: [-] Text [+]

Are Vision-Language Foundation Models Able to Fly?

Rüter, Joachim and Davydov, Philipp and Maienschein, Theresa Diana and Durak, Umut and Dauer, Johann C. (2025) Are Vision-Language Foundation Models Able to Fly? In: 44th AIAA DATC/IEEE Digital Avionics Systems Conference, DASC 2025. Institute of Electrical and Electronics Engineers Inc.. Digital Avionics Systems Conference, 2025-09-14, Montreal, Kanada. doi: 10.1109/DASC66011.2025.11257290. ISBN 979-833152519-4. ISSN 2155-7195.

[img] PDF - Only accessible within DLR
5MB

Abstract

Safe autonomous aircraft require accurate environment perception, which can be achieved through semantic segmentation of camera images. However, training neural networks relies on large, diverse datasets that are often unavailable in aviation. Vision-language foundation models offer a promising alternative, but their accuracy for aviation tasks is an open question as the aerial perspective might not be adequately represented in the original training data. Against this background, this paper investigates the performance of two vision-language foundation models, CLIPSeg and CAT-Seg, on an aerial image dataset. Our experiments show that the models can achieve competitive semantic segmentation performance without aviation-specific training. This paper further examines prompt engineering and discusses challenges of deploying these models in aviation. While certification and runtime constraints pose significant hurdles, our findings suggest that vision-language foundation models have potential for improving environment perception in aviation and may reduce the need for extensive training data in the future.

Item URL in elib:https://elib.dlr.de/221917/
Document Type:Conference or Workshop Item (Speech)
Title:Are Vision-Language Foundation Models Able to Fly?
Authors:
AuthorsInstitution or Email of AuthorsAuthor's ORCID iDORCID Put Code
Rüter, JoachimUNSPECIFIEDhttps://orcid.org/0000-0002-5559-5481203720200
Davydov, PhilippUNSPECIFIEDUNSPECIFIEDUNSPECIFIED
Maienschein, Theresa DianaUNSPECIFIEDUNSPECIFIEDUNSPECIFIED
Durak, UmutUNSPECIFIEDhttps://orcid.org/0000-0002-2928-1710203720201
Dauer, Johann C.UNSPECIFIEDhttps://orcid.org/0000-0002-8287-2376UNSPECIFIED
Date:2025
Journal or Publication Title:44th AIAA DATC/IEEE Digital Avionics Systems Conference, DASC 2025
Refereed publication:Yes
Open Access:No
Gold Open Access:No
In SCOPUS:Yes
In ISI Web of Science:Yes
DOI:10.1109/DASC66011.2025.11257290
Publisher:Institute of Electrical and Electronics Engineers Inc.
Series Name:AIAA/IEEE Digital Avionics Systems Conference - Proceedings
ISSN:2155-7195
ISBN:979-833152519-4
Status:Published
Keywords:Semantic Segmentation, Vision-Language Models, Deep Learning, Unmanned Aircraft System, Autonomy
Event Title:Digital Avionics Systems Conference
Event Location:Montreal, Kanada
Event Type:international Conference
Event Date:14 September 2025
HGF - Research field:Aeronautics, Space and Transport
HGF - Program:Aeronautics
HGF - Program Themes:Components and Systems
DLR - Research area:Aeronautics
DLR - Program:L CS - Components and Systems
DLR - Research theme (Project):L - Unmanned Aerial Systems
Location: Braunschweig
Institutes and Institutions:Institute of Flight Systems > Unmanned Aircraft
Institute of Flight Systems > Safety Critical Systems&Systems Engineering
Institute of Flight Systems
Deposited By: Rüter, Joachim
Deposited On:27 Jan 2026 12:26
Last Modified:24 Feb 2026 09:49

Repository Staff Only: item control page

Browse
Search
Help & Contact
Information
OpenAIRE Validator logo electronic library is running on EPrints 3.3.12
Website and database design: Copyright © German Aerospace Center (DLR). All rights reserved.