elib
DLR-Header
DLR-Logo -> http://www.dlr.de
DLR Portal Home | Imprint | Privacy Policy | Accessibility | Contact | Deutsch
Fontsize: [-] Text [+]

Guided Reinforcement Learning with Vision Feedback

Özer, Baran (2026) Guided Reinforcement Learning with Vision Feedback. Master's, Technical University of Munich (TUM).

[img] PDF
8MB

Abstract

This thesis investigates how to improve reinforcement learning (RL) for contact-rich robotic assembly by combining probabilistic trajectory guidance with visual representation learning. Building on Kernelized Guided Reinforcement Learning (KGRL), where Kernelized Movement Primitives (KMP) encode demonstrations as probabilistic trajectory priors and guide policy learning through uncertainty-aware null space actions, we introduce Vision-KGRL, which augments this framework with compact visual features learned via task-specific Variational Autoencoders (VAEs). The approach is evaluated on peg insertion, gear meshing, and nut threading tasks in the NVIDIA Isaac Lab Factory and Forge environments, showing that visual augmentation preserves the faster convergence of KGRL over standard RL, with stronger gains in higher-dimensional action spaces. Systematic ablations across observation modalities (proprioception, force, vision) and action spaces (4D vs 6D) highlight the complementary roles of trajectory guidance and learned visual features, while results further show that KMP-guided policies significantly reduce interaction forces, leading to smoother and more stable behavior. Overall, Vision-KGRL provides a data-efficient and robust solution for contact-rich manipulation, combining learned visual representations with trajectory priors to improve both learning performance and execution safety.

Item URL in elib:https://elib.dlr.de/224243/
Document Type:Thesis (Master's)
Title:Guided Reinforcement Learning with Vision Feedback
Authors:
AuthorsInstitution or Email of AuthorsAuthor's ORCID iDORCID Put Code
Özer, Baranbaran.oezer (at) dlr.deUNSPECIFIEDUNSPECIFIED
DLR Supervisors:
ContributionDLR SupervisorInstitution or E-MailDLR Supervisor's ORCID iD
Thesis advisorPadalkar, AbhishekAbhishek.Padalkar (at) dlr.dehttps://orcid.org/0000-0002-3917-4767
Thesis advisorSilverio, Joaojoao.silverio (at) dlr.dehttps://orcid.org/0000-0003-1428-8933
Date:2026
Journal or Publication Title:Guided Reinforcement Learning with Vision Feedback
Open Access:Yes
Number of Pages:81
Status:Published
Keywords:Reinforcement Learning; Imitation Learning; Robot Learning
Institution:Technical University of Munich (TUM)
HGF - Research field:Aeronautics, Space and Transport
HGF - Program:Transport
HGF - Program Themes:Road Transport
DLR - Research area:Transport
DLR - Program:V ST Straßenverkehr
DLR - Research theme (Project):V - ASPIRO - Aerospace production using intelligent robotic systems
Location: Oberpfaffenhofen
Institutes and Institutions:Institute of Robotics and Mechatronics (since 2013) > Cognitive Robotics
Deposited By: Silverio, Joao
Deposited On:05 May 2026 10:25
Last Modified:05 May 2026 10:25

Repository Staff Only: item control page

Browse
Search
Help & Contact
Information
OpenAIRE Validator logo electronic library is running on EPrints 3.3.12
Website and database design: Copyright © German Aerospace Center (DLR). All rights reserved.