elib
DLR-Header
DLR-Logo -> http://www.dlr.de
DLR Portal Home | Imprint | Privacy Policy | Accessibility | Contact | Deutsch
Fontsize: [-] Text [+]

Influence of multimodal training data for segmentation

Tarant, Yannick and Ramirez Agudelo, Oscar Hernan and Schreiber, Lena (2024) Influence of multimodal training data for segmentation. In: Signal Processing, Sensor/Information Fusion, and Target Recognition XXXIII 2024, XXXIII (130570). SPIE. Signal Processing, Sensor/Information Fusion, and Target Recognition XXXIII, 2024-04-21 - 2024-04-25, National Harbour, USA. doi: 10.1117/12.3012567. ISBN 978-151067432-5. ISSN 0277-786X.

[img] PDF
13MB

Abstract

In order to protect and secure critical infrastructures, it is important to advance technologies for automated detection and mapping of crucial components like the pipes of fire suppression systems. This work explores the instance segmentation on such pipes in multimodal image media. Instance segmentation has been significantly enhanced by machine learning techniques, but faces complexities when the used RGB training set is limited and lacks diversity. In the means of improving the training efficiency, we study the influence of enhancing the training set with thermal infrared images on the models performance. In our study, we harnessed both RGB and infrared images captured from the same location. We employed Mask R-CNN with transfer learning from the COCO weights and trained multiple neural networks on different training set combinations with RGB images and thermal infrared images. In order to further enlarge each training data set, we implemented different augmentation methods in the training. Subsequently, we conducted fine-tuning and optimization procedures for the Mask R-CNN training and determined the quality of the pipe instance segmentation of the produced models on test sets containing RGB and infrared images. Using the Jaccard score we provide a quantitative measure of pipe segmentation. The results show that the addition of a comparatively low number of infrared images to the training process does not only improve the detection performance on infrared data, but also improves the models performance on RGB data. A SOTA comparison shows that the detection quality of our models is comparable with the detection results of Ultralytics YOLOv8 and Metas SAM, which are both more recently released.

Item URL in elib:https://elib.dlr.de/210263/
Document Type:Conference or Workshop Item (Speech)
Title:Influence of multimodal training data for segmentation
Authors:
AuthorsInstitution or Email of AuthorsAuthor's ORCID iDORCID Put Code
Tarant, YannickUNSPECIFIEDUNSPECIFIEDUNSPECIFIED
Ramirez Agudelo, Oscar HernanUNSPECIFIEDhttps://orcid.org/0000-0002-9379-5409UNSPECIFIED
Schreiber, LenaUNSPECIFIEDUNSPECIFIEDUNSPECIFIED
Date:7 June 2024
Journal or Publication Title:Signal Processing, Sensor/Information Fusion, and Target Recognition XXXIII 2024
Refereed publication:No
Open Access:Yes
Gold Open Access:No
In SCOPUS:Yes
In ISI Web of Science:No
Volume:XXXIII
DOI:10.1117/12.3012567
Publisher:SPIE
Series Name:Signal Processing, Sensor/Information Fusion, and Target Recognition
ISSN:0277-786X
ISBN:978-151067432-5
Status:Published
Keywords:Critical infrastructures, Automated detection, Instance segmentation, Machine learning, Multimodal image data, Thermal infrared images, Mask R-CNN, Training efficiency
Event Title:Signal Processing, Sensor/Information Fusion, and Target Recognition XXXIII
Event Location:National Harbour, USA
Event Type:international Conference
Event Start Date:21 April 2024
Event End Date:25 April 2024
Organizer:SPIE
HGF - Research field:Aeronautics, Space and Transport
HGF - Program:Space
HGF - Program Themes:Earth Observation
DLR - Research area:Raumfahrt
DLR - Program:R EO - Earth Observation
DLR - Research theme (Project):R - Synergy project Automated Model Generation
Location: Rhein-Sieg-Kreis
Institutes and Institutions:Institute for the Protection of Terrestrial Infrastructures > Detection Systems
Institute for the Protection of Terrestrial Infrastructures
Deposited By: Tarant, Yannick
Deposited On:13 Dec 2024 14:12
Last Modified:18 Jun 2025 09:27

Repository Staff Only: item control page

Browse
Search
Help & Contact
Information
OpenAIRE Validator logo electronic library is running on EPrints 3.3.12
Website and database design: Copyright © German Aerospace Center (DLR). All rights reserved.