Chiarabini, Luca and Espinoza Molina, Daniela and Zappacosta, Antony and Kuzu, Ridvan Salih and Camero, Andres (2025) Multimodal Learning for Earth Observation: Automating Satellite Image Captioning with Geo-FMs. Helmholtz AI Conference 2025, 2025-06-03 - 2025-06-05, Karlsruhe.
|
PDF
580kB |
Abstract
The automatic generation of captions for satellite images can enhance the accessibility and interpretability of Earth Observation (EO) data. In this study, we compare two approaches to image captioning: TerraMind, a model developed within the FAST-EO project specifically for satellite imagery, and BLIP-2, a generic multimodal model trained on RGB images. The dataset used, SmallMinesDS, consists of annotated satellite images from five districts in Ghana, where unregulated small-scale gold mining threatens cocoa farmlands. Our evaluation focuses on caption accuracy, specificity, and adaptability to EO imagery, highlighting the strengths and limitations of each approach in the context of environmental monitoring.
| Item URL in elib: | https://elib.dlr.de/215040/ | ||||||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Document Type: | Conference or Workshop Item (Poster) | ||||||||||||||||||||||||
| Title: | Multimodal Learning for Earth Observation: Automating Satellite Image Captioning with Geo-FMs | ||||||||||||||||||||||||
| Authors: |
| ||||||||||||||||||||||||
| Date: | 3 June 2025 | ||||||||||||||||||||||||
| Refereed publication: | No | ||||||||||||||||||||||||
| Open Access: | Yes | ||||||||||||||||||||||||
| Gold Open Access: | No | ||||||||||||||||||||||||
| In SCOPUS: | No | ||||||||||||||||||||||||
| In ISI Web of Science: | No | ||||||||||||||||||||||||
| Status: | Published | ||||||||||||||||||||||||
| Keywords: | Satellite image captioning, Earth Observation (EO), TerraMind, Multimodal models, AI | ||||||||||||||||||||||||
| Event Title: | Helmholtz AI Conference 2025 | ||||||||||||||||||||||||
| Event Location: | Karlsruhe | ||||||||||||||||||||||||
| Event Type: | national Conference | ||||||||||||||||||||||||
| Event Start Date: | 3 June 2025 | ||||||||||||||||||||||||
| Event End Date: | 5 June 2025 | ||||||||||||||||||||||||
| HGF - Research field: | Aeronautics, Space and Transport | ||||||||||||||||||||||||
| HGF - Program: | Space | ||||||||||||||||||||||||
| HGF - Program Themes: | Earth Observation | ||||||||||||||||||||||||
| DLR - Research area: | Raumfahrt | ||||||||||||||||||||||||
| DLR - Program: | R EO - Earth Observation | ||||||||||||||||||||||||
| DLR - Research theme (Project): | R - Artificial Intelligence, R - Optical remote sensing, R - Machine Learning, R - Remote Sensing and Geo Research | ||||||||||||||||||||||||
| Location: | Oberpfaffenhofen | ||||||||||||||||||||||||
| Institutes and Institutions: | Remote Sensing Technology Institute > EO Data Science | ||||||||||||||||||||||||
| Deposited By: | Chiarabini, Luca | ||||||||||||||||||||||||
| Deposited On: | 09 Jul 2025 13:53 | ||||||||||||||||||||||||
| Last Modified: | 09 Jul 2025 13:53 |
Repository Staff Only: item control page