elib
DLR-Header
DLR-Logo -> http://www.dlr.de
DLR Portal Home | Impressum | Datenschutz | Barrierefreiheit | Kontakt | English
Schriftgröße: [-] Text [+]

Optimizing Synthetic and Real Training Data Distributions for Deep Learning in Image Recognition

Niemeijer, Joshua (2025) Optimizing Synthetic and Real Training Data Distributions for Deep Learning in Image Recognition. Dissertation, Universität zu Lübeck.

[img] PDF - Nur DLR-intern zugänglich
6MB

Kurzfassung

The recent advances in deep learning have enabled a large variety of applications. Among these are, for example, the environment perception of robots, including self-driving cars, and medical image analysis, which helps identify medical conditions or planning treatment. To build deep learning systems that generalize well, large quantities of relevant human-labeled data must be available for training. This requirement introduces several challenges. Annotations are costly due to the large amounts of data that need to be labeled and the complex nature of the annotation process. This is made more complex by the fact that relevant data needs to be recorded before data can be labeled. Depending on the field of application, this can be challenging. The challenge arises because relevant data is seldom available, which introduces the need to capture large quantities of data to find rare but critical cases. The work investigates a more efficient use of manual annotation through intelligent data selection for labeling, utilizing active learning (AL). In this context, semi-supervised learning (SSL), which aims to replace manual annotation, is utilized. The thesis investigates the use of synthetic data to replace the acquisition of data itself. The work presents strategies to guide the generation process towards creating seldom but critical data. Finally, it is shown how to utilize these insights to create models that generalize well toward unseen distributions with minimal human intervention. For each of these methodologies, the thesis contributes novel approaches and analyses. It is shown that the choice of active learning approaches is highly dependent on the type of distribution the selection is performed on and the annotation budget. Next, the work shows how AL and semi-supervised learning are effectively integrated. This insight shows how to develop best practices for the application of AL and SSL. For the use of SSL in adapting networks to novel data domains, this work provides an extensive review of this dynamic field and derives novel low-complexity methods from it. These methods prove useful in their application to the environment perception of autonomous vehicles and the medical domain, as well as for adapting from synthetic to real data. The work provides novel methods for the targeted creation of synthetic data. Building on the creation of synthetic data and the research on SSL, the thesis presents an approach for generalizing to unseen domains. Overall, this thesis provides solutions for minimizing the cost and human effort involved in annotating and acquiring relevant data. The solutions provide efficient adaptation and generalization to new domains and distributions.

elib-URL des Eintrags:https://elib.dlr.de/221315/
Dokumentart:Hochschulschrift (Dissertation)
Titel:Optimizing Synthetic and Real Training Data Distributions for Deep Learning in Image Recognition
Autoren:
AutorenInstitution oder E-Mail-AdresseAutoren-ORCID-iDORCID Put Code
Niemeijer, JoshuaJoshua.Niemeijer (at) dlr.deNICHT SPEZIFIZIERTNICHT SPEZIFIZIERT
Datum:19 November 2025
Open Access:Nein
Seitenanzahl:186
Status:veröffentlicht
Stichwörter:Computer Vision and Pattern Recognition, Artificial Intelligence, Environment Perception, Medical Image Processing
Institution:Universität zu Lübeck
HGF - Forschungsbereich:Luftfahrt, Raumfahrt und Verkehr
HGF - Programm:Verkehr
HGF - Programmthema:Straßenverkehr
DLR - Schwerpunkt:Verkehr
DLR - Forschungsgebiet:V ST Straßenverkehr
DLR - Teilgebiet (Projekt, Vorhaben):V - ACT4Transformation - Automated and Connected Technologies for Mobility Transformation
Standort: Braunschweig
Institute & Einrichtungen:Institut für Verkehrssystemtechnik > Kooperative Straßenfahrzeuge und Systeme
Hinterlegt von: Niemeijer, Joshua
Hinterlegt am:16 Dez 2025 16:18
Letzte Änderung:16 Dez 2025 16:18

Nur für Mitarbeiter des Archivs: Kontrollseite des Eintrags

Blättern
Suchen
Hilfe & Kontakt
Informationen
OpenAIRE Validator logo electronic library verwendet EPrints 3.3.12
Gestaltung Webseite und Datenbank: Copyright © Deutsches Zentrum für Luft- und Raumfahrt (DLR). Alle Rechte vorbehalten.