Niemeijer, Joshua (2025) Optimizing Synthetic and Real Training Data Distributions for Deep Learning in Image Recognition. Dissertation, Universität zu Lübeck.
|
PDF
- Nur DLR-intern zugänglich
6MB |
Kurzfassung
The recent advances in deep learning have enabled a large variety of applications. Among these are, for example, the environment perception of robots, including self-driving cars, and medical image analysis, which helps identify medical conditions or planning treatment. To build deep learning systems that generalize well, large quantities of relevant human-labeled data must be available for training. This requirement introduces several challenges. Annotations are costly due to the large amounts of data that need to be labeled and the complex nature of the annotation process. This is made more complex by the fact that relevant data needs to be recorded before data can be labeled. Depending on the field of application, this can be challenging. The challenge arises because relevant data is seldom available, which introduces the need to capture large quantities of data to find rare but critical cases. The work investigates a more efficient use of manual annotation through intelligent data selection for labeling, utilizing active learning (AL). In this context, semi-supervised learning (SSL), which aims to replace manual annotation, is utilized. The thesis investigates the use of synthetic data to replace the acquisition of data itself. The work presents strategies to guide the generation process towards creating seldom but critical data. Finally, it is shown how to utilize these insights to create models that generalize well toward unseen distributions with minimal human intervention. For each of these methodologies, the thesis contributes novel approaches and analyses. It is shown that the choice of active learning approaches is highly dependent on the type of distribution the selection is performed on and the annotation budget. Next, the work shows how AL and semi-supervised learning are effectively integrated. This insight shows how to develop best practices for the application of AL and SSL. For the use of SSL in adapting networks to novel data domains, this work provides an extensive review of this dynamic field and derives novel low-complexity methods from it. These methods prove useful in their application to the environment perception of autonomous vehicles and the medical domain, as well as for adapting from synthetic to real data. The work provides novel methods for the targeted creation of synthetic data. Building on the creation of synthetic data and the research on SSL, the thesis presents an approach for generalizing to unseen domains. Overall, this thesis provides solutions for minimizing the cost and human effort involved in annotating and acquiring relevant data. The solutions provide efficient adaptation and generalization to new domains and distributions.
| elib-URL des Eintrags: | https://elib.dlr.de/221315/ | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| Dokumentart: | Hochschulschrift (Dissertation) | ||||||||
| Titel: | Optimizing Synthetic and Real Training Data Distributions for Deep Learning in Image Recognition | ||||||||
| Autoren: |
| ||||||||
| Datum: | 19 November 2025 | ||||||||
| Open Access: | Nein | ||||||||
| Seitenanzahl: | 186 | ||||||||
| Status: | veröffentlicht | ||||||||
| Stichwörter: | Computer Vision and Pattern Recognition, Artificial Intelligence, Environment Perception, Medical Image Processing | ||||||||
| Institution: | Universität zu Lübeck | ||||||||
| HGF - Forschungsbereich: | Luftfahrt, Raumfahrt und Verkehr | ||||||||
| HGF - Programm: | Verkehr | ||||||||
| HGF - Programmthema: | Straßenverkehr | ||||||||
| DLR - Schwerpunkt: | Verkehr | ||||||||
| DLR - Forschungsgebiet: | V ST Straßenverkehr | ||||||||
| DLR - Teilgebiet (Projekt, Vorhaben): | V - ACT4Transformation - Automated and Connected Technologies for Mobility Transformation | ||||||||
| Standort: | Braunschweig | ||||||||
| Institute & Einrichtungen: | Institut für Verkehrssystemtechnik > Kooperative Straßenfahrzeuge und Systeme | ||||||||
| Hinterlegt von: | Niemeijer, Joshua | ||||||||
| Hinterlegt am: | 16 Dez 2025 16:18 | ||||||||
| Letzte Änderung: | 16 Dez 2025 16:18 |
Nur für Mitarbeiter des Archivs: Kontrollseite des Eintrags