elib
DLR-Header
DLR-Logo -> http://www.dlr.de
DLR Portal Home | Imprint | Privacy Policy | Accessibility | Contact | Deutsch
Fontsize: [-] Text [+]

Multimodal Learning for Earth Observation: Automating Satellite Image Captioning with Geo-FMs

Chiarabini, Luca and Espinoza Molina, Daniela and Zappacosta, Antony and Kuzu, Ridvan Salih and Camero, Andres (2025) Multimodal Learning for Earth Observation: Automating Satellite Image Captioning with Geo-FMs. Helmholtz AI Conference 2025, 2025-06-03 - 2025-06-05, Karlsruhe.

[img] PDF
580kB

Abstract

The automatic generation of captions for satellite images can enhance the accessibility and interpretability of Earth Observation (EO) data. In this study, we compare two approaches to image captioning: TerraMind, a model developed within the FAST-EO project specifically for satellite imagery, and BLIP-2, a generic multimodal model trained on RGB images. The dataset used, SmallMinesDS, consists of annotated satellite images from five districts in Ghana, where unregulated small-scale gold mining threatens cocoa farmlands. Our evaluation focuses on caption accuracy, specificity, and adaptability to EO imagery, highlighting the strengths and limitations of each approach in the context of environmental monitoring.

Item URL in elib:https://elib.dlr.de/215040/
Document Type:Conference or Workshop Item (Poster)
Title:Multimodal Learning for Earth Observation: Automating Satellite Image Captioning with Geo-FMs
Authors:
AuthorsInstitution or Email of AuthorsAuthor's ORCID iDORCID Put Code
Chiarabini, LucaUNSPECIFIEDUNSPECIFIEDUNSPECIFIED
Espinoza Molina, DanielaUNSPECIFIEDUNSPECIFIEDUNSPECIFIED
Zappacosta, AntonyUNSPECIFIEDUNSPECIFIEDUNSPECIFIED
Kuzu, Ridvan SalihUNSPECIFIEDhttps://orcid.org/0000-0002-1816-181XUNSPECIFIED
Camero, AndresUNSPECIFIEDhttps://orcid.org/0000-0002-8152-9381UNSPECIFIED
Date:3 June 2025
Refereed publication:No
Open Access:Yes
Gold Open Access:No
In SCOPUS:No
In ISI Web of Science:No
Status:Published
Keywords:Satellite image captioning, Earth Observation (EO), TerraMind, Multimodal models, AI
Event Title:Helmholtz AI Conference 2025
Event Location:Karlsruhe
Event Type:national Conference
Event Start Date:3 June 2025
Event End Date:5 June 2025
HGF - Research field:Aeronautics, Space and Transport
HGF - Program:Space
HGF - Program Themes:Earth Observation
DLR - Research area:Raumfahrt
DLR - Program:R EO - Earth Observation
DLR - Research theme (Project):R - Artificial Intelligence, R - Optical remote sensing, R - Machine Learning, R - Remote Sensing and Geo Research
Location: Oberpfaffenhofen
Institutes and Institutions:Remote Sensing Technology Institute > EO Data Science
Deposited By: Chiarabini, Luca
Deposited On:09 Jul 2025 13:53
Last Modified:09 Jul 2025 13:53

Repository Staff Only: item control page

Browse
Search
Help & Contact
Information
OpenAIRE Validator logo electronic library is running on EPrints 3.3.12
Website and database design: Copyright © German Aerospace Center (DLR). All rights reserved.