elib
DLR-Header
DLR-Logo -> http://www.dlr.de
DLR Portal Home | Imprint | Privacy Policy | Contact | Deutsch
Fontsize: [-] Text [+]

Guiding Reinforcement Learning with Shared Control Templates

Padalkar, Abhishek and Quere, Gabriel and Steinmetz, Franz and Raffin, Antonin and Nieuwenhuisen, Matthias and Silvério, João and Stulp, Freek (2023) Guiding Reinforcement Learning with Shared Control Templates. In: 2023 IEEE International Conference on Robotics and Automation, ICRA 2023. IEEE. IEEE International Conference on Robotics and Automation 2023, 2023-05-29 - 2023-06-02, London. doi: 10.1109/ICRA48891.2023.10161058. ISBN 979-835032365-8. ISSN 1050-4729.

[img] PDF
4MB

Official URL: https://ieeexplore.ieee.org/document/10161058

Abstract

Purposeful interaction with objects usually requires certain constraints to be respected, e.g. keeping a bottle upright to avoid spilling. In reinforcement learning, such constraints are typically encoded in the reward function. As a consequence, constraints can only be learned by violating them. This often precludes learning on the physical robot, as it may take many trials to learn the constraints, and the necessity to violate them during the trial-and-error learning may be unsafe. We have serendipitously discovered that constraint representations for shared control -- in particular Shared Control Templates (SCTs) -- are ideally suited for guiding RL. Representing constraints explicitly (rather than implicitly in the reward function) also simplifies the design of the reward function. We evaluate the advantages of the approach (faster learning without constraint violations, even with sparse reward functions) in a simulated pouring task. Furthermore, we demonstrate that these advantages enable the real robot to learn this task in only 65 episodes taking 16 minutes.

Item URL in elib:https://elib.dlr.de/193739/
Document Type:Conference or Workshop Item (Poster)
Title:Guiding Reinforcement Learning with Shared Control Templates
Authors:
AuthorsInstitution or Email of AuthorsAuthor's ORCID iDORCID Put Code
Padalkar, AbhishekUNSPECIFIEDUNSPECIFIEDUNSPECIFIED
Quere, GabrielUNSPECIFIEDhttps://orcid.org/0000-0002-1788-3685UNSPECIFIED
Steinmetz, FranzUNSPECIFIEDUNSPECIFIEDUNSPECIFIED
Raffin, AntoninUNSPECIFIEDhttps://orcid.org/0000-0001-6036-6950UNSPECIFIED
Nieuwenhuisen, MatthiasUNSPECIFIEDUNSPECIFIEDUNSPECIFIED
Silvério, JoãoUNSPECIFIEDhttps://orcid.org/0000-0003-1428-8933UNSPECIFIED
Stulp, FreekUNSPECIFIEDUNSPECIFIEDUNSPECIFIED
Date:4 July 2023
Journal or Publication Title:2023 IEEE International Conference on Robotics and Automation, ICRA 2023
Refereed publication:Yes
Open Access:Yes
Gold Open Access:No
In SCOPUS:Yes
In ISI Web of Science:No
DOI:10.1109/ICRA48891.2023.10161058
Publisher:IEEE
ISSN:1050-4729
ISBN:979-835032365-8
Status:Published
Keywords:Reinforcement Learning, Constrained Motion, Safe Robot Learning
Event Title:IEEE International Conference on Robotics and Automation 2023
Event Location:London
Event Type:international Conference
Event Start Date:29 May 2023
Event End Date:2 June 2023
Organizer:IEEE
HGF - Research field:Aeronautics, Space and Transport
HGF - Program:Space
HGF - Program Themes:Robotics
DLR - Research area:Raumfahrt
DLR - Program:R RO - Robotics
DLR - Research theme (Project):R - Autonomous learning robots [RO]
Location: Oberpfaffenhofen
Institutes and Institutions:Institute of Robotics and Mechatronics (since 2013)
Institute of Robotics and Mechatronics (since 2013) > Cognitive Robotics
Deposited By: Padalkar, Abhishek
Deposited On:31 Jan 2023 15:47
Last Modified:24 Apr 2024 20:54

Repository Staff Only: item control page

Browse
Search
Help & Contact
Information
electronic library is running on EPrints 3.3.12
Website and database design: Copyright © German Aerospace Center (DLR). All rights reserved.