elib
DLR-Header
DLR-Logo -> http://www.dlr.de
DLR Portal Home | Imprint | Privacy Policy | Accessibility | Contact | Deutsch
Fontsize: [-] Text [+]

An Approach for Fusing Two Training-Datasets with Partially Overlapping Classes

Niemeijer, Joshua and Srinivas, Gurucharan and Leich, Andreas and Battistella, Federico (2023) An Approach for Fusing Two Training-Datasets with Partially Overlapping Classes. In: 17th IEEE International Conference on Semantic Computing, ICSC 2023. 17th International Conference on Semantic Computing (ICSC), 2023-02-01 - 2023-02-03, Laguna Hills, CA, USA. doi: 10.1109/icsc56153.2023.00017. ISBN 978-166548263-9.

[img] PDF - Only accessible within DLR
2MB

Abstract

Supervised deep learning techniques in image processing require training data, typically consisting of manually labeled ground truth annotations. Since manual labeling is costly, using as many existing training datasets as possible is necessary. This paper introduces a novel approach for combining training datasets into a new one. The naive approach to this is a plain concatenation of the existing datasets. However, this approach fails with partially overlapping datasets when certain annotated instances specific to one dataset also appear in the other dataset without their annotations. Therefore, we present a novel method for combining existing training datasets using a pseudo-labeling technique with uncertainty quantification. The effectiveness of our method is evaluated by fusing two datasets consisting of partially overlapping traffic sign annotations in street view images. The results demonstrate that the pseudo-labeling errors weigh less than those introduced by the naive fusion. Furthermore, our work provides evidence for practitioners to use a pseudolabeling-based fusion technique with uncertainty quantificationrather than naively combining training datasets into a new one.

Item URL in elib:https://elib.dlr.de/198542/
Document Type:Conference or Workshop Item (Speech)
Title:An Approach for Fusing Two Training-Datasets with Partially Overlapping Classes
Authors:
AuthorsInstitution or Email of AuthorsAuthor's ORCID iDORCID Put Code
Niemeijer, JoshuaUNSPECIFIEDUNSPECIFIEDUNSPECIFIED
Srinivas, GurucharanUNSPECIFIEDhttps://orcid.org/0000-0001-5151-9150UNSPECIFIED
Leich, AndreasUNSPECIFIEDhttps://orcid.org/0000-0001-5242-2051148334099
Battistella, FedericoUNSPECIFIEDUNSPECIFIEDUNSPECIFIED
Date:2023
Journal or Publication Title:17th IEEE International Conference on Semantic Computing, ICSC 2023
Refereed publication:Yes
Open Access:No
Gold Open Access:No
In SCOPUS:Yes
In ISI Web of Science:Yes
DOI:10.1109/icsc56153.2023.00017
ISBN:978-166548263-9
Status:Published
Keywords:Dataset Fusion, Traffic Sign detection, Deep Learning, Uncertainty Quantification, Pseudo-Labeling
Event Title:17th International Conference on Semantic Computing (ICSC)
Event Location:Laguna Hills, CA, USA
Event Type:international Conference
Event Start Date:1 February 2023
Event End Date:3 February 2023
Organizer:IEEE
HGF - Research field:Aeronautics, Space and Transport
HGF - Program:Transport
HGF - Program Themes:Road Transport
DLR - Research area:Transport
DLR - Program:V ST Straßenverkehr
DLR - Research theme (Project):V - KoKoVI - Koordinierter kooperativer Verkehr mit verteilter, lernender Intelligenz
Location: Berlin-Adlershof , Braunschweig
Institutes and Institutions:Institute of Transportation Systems
Institute of Transportation Systems > Cooperative Systems, BS
Institute of Transportation Systems > Information Gathering and Modelling, BA
Deposited By: Niemeijer, Joshua
Deposited On:08 Dec 2023 14:43
Last Modified:25 Aug 2025 15:10

Repository Staff Only: item control page

Browse
Search
Help & Contact
Information
OpenAIRE Validator logo electronic library is running on EPrints 3.3.12
Website and database design: Copyright © German Aerospace Center (DLR). All rights reserved.