elib
DLR-Header
DLR-Logo -> http://www.dlr.de
DLR Portal Home | Impressum | Datenschutz | Barrierefreiheit | Kontakt | English
Schriftgröße: [-] Text [+]

Camera-Supported Uncertainty Awareness for Guided Real-World Reinforcement Learning

Martensen, Lugh (2026) Camera-Supported Uncertainty Awareness for Guided Real-World Reinforcement Learning. Masterarbeit, University of Lübeck.

[img] PDF
14MB

Kurzfassung

Learning from Demonstration (LfD) has become an intuitive and user-friendly method for robot programming. It enables robots to imitate complex behaviors and adapt them to task-specific constraints without requiring detailed models of the environment. However, in scenarios where demonstrations do not fully capture all task dynamics, LfD can become unreliable. For contact-rich tasks, which are common in manufacturing, demonstration-based policies often struggle to generalize due to out-of-distribution states.

Reinforcement Learning (RL) has emerged as a promising solution. By training an RL policy to correct a baseline LfD trajectory based on the robot's state, the resulting behavior adheres to the demonstrations while dynamically adapting to the environment to improve reliability. In recent work, a new approach called Kernelized Guided Reinforcement Learning (KGRL) extends this idea by shaping the RL agent's exploration space according to the probabilistic properties of the demonstrated behavior. Restricting the robot to explore only states that are close to previously observed ones addresses common concerns regarding safety and sample efficiency when applying RL to real-world systems. However, KGRL assumes that the uncertainty encoded in demonstrations sufficiently captures the variability of the task, which is often not the case in practical scenarios.

In this thesis, an extension of KGRL, termed External-Uncertainty KGRL (EU-KGRL), is proposed. The key idea is to incorporate externally provided covariance information into the framework, allowing the exploration space to be shaped by additional sources of uncertainty, such as perception systems, alongside demonstrations. To this end, a markerless, camera-based pose estimation pipeline is implemented, achieving millimeter-level position accuracy. Its accuracy and precision are analyzed to characterize the resulting uncertainty.

The proposed extension is evaluated both in simulation and on a real robotic system for scenarios involving target pose uncertainty. In simulation, EU-KGRL demonstrates the ability to solve tasks that KGRL alone cannot address by enabling structured exploration in task-relevant directions. In real-world experiments, the method is applied to an Ethernet connector insertion task, where the target device pose is estimated using the implemented vision pipeline. The results show that, in certain settings, EU-KGRL can compensate for positional inaccuracies more effectively than KGRL. At the same time, the challenges of deploying EU-KGRL in real-world scenarios are highlighted, as its performance depends strongly on the availability of an accurate uncertainty model.

elib-URL des Eintrags:https://elib.dlr.de/224922/
Dokumentart:Hochschulschrift (Masterarbeit)
Titel:Camera-Supported Uncertainty Awareness for Guided Real-World Reinforcement Learning
Autoren:
AutorenInstitution oder E-Mail-AdresseAutoren-ORCID-iDORCID Put Code
Martensen, Lughlugh.martensen (at) dlr.deNICHT SPEZIFIZIERTNICHT SPEZIFIZIERT
DLR-Supervisor:
BeitragsartDLR-SupervisorInstitution oder E-Mail-AdresseDLR-Supervisor-ORCID-iD
Thesis advisorWillibald, ChristophChristoph.Willibald (at) dlr.dehttps://orcid.org/0000-0003-3579-4130
Thesis advisorPadalkar, AbhishekAbhishek.Padalkar (at) dlr.dehttps://orcid.org/0000-0002-3917-4767
Thesis advisorNottensteiner, Korbiniankorbinian.nottensteiner (at) dlr.dehttps://orcid.org/0000-0002-6016-6235
Thesis advisorSilverio, Joaojoao.silverio (at) dlr.dehttps://orcid.org/0000-0003-1428-8933
Datum:2026
Erschienen in:Camera-Supported Uncertainty Awareness for Guided Real-World Reinforcement Learning
Open Access:Ja
Seitenanzahl:95
Status:veröffentlicht
Stichwörter:Reinforcement Learning
Institution:University of Lübeck
HGF - Forschungsbereich:Luftfahrt, Raumfahrt und Verkehr
HGF - Programm:Raumfahrt
HGF - Programmthema:Robotik
DLR - Schwerpunkt:Raumfahrt
DLR - Forschungsgebiet:R RO - Robotik
DLR - Teilgebiet (Projekt, Vorhaben):R - Synergieprojekt ASPIRO
Standort: Oberpfaffenhofen
Institute & Einrichtungen:Institut für Robotik und Mechatronik (ab 2013)
Hinterlegt von: Silverio, Joao
Hinterlegt am:16 Jun 2026 10:17
Letzte Änderung:16 Jun 2026 10:17

Nur für Mitarbeiter des Archivs: Kontrollseite des Eintrags

Blättern
Suchen
Hilfe & Kontakt
Informationen
OpenAIRE Validator logo electronic library verwendet EPrints 3.3.12
Gestaltung Webseite und Datenbank: Copyright © Deutsches Zentrum für Luft- und Raumfahrt (DLR). Alle Rechte vorbehalten.