elib
DLR-Header
DLR-Logo -> http://www.dlr.de
DLR Portal Home | Imprint | Privacy Policy | Accessibility | Contact | Deutsch
Fontsize: [-] Text [+]

Efficiently Learning Robust Torque-Based Locomotion Through Reinforcement With Model-Based Supervision

Yan, Yashuai and Egle, Tobias and Ott, Christian and Lee, Dongheui (2026) Efficiently Learning Robust Torque-Based Locomotion Through Reinforcement With Model-Based Supervision. IEEE Robotics and Automation Letters, 11 (4), pp. 4155-4162. IEEE - Institute of Electrical and Electronics Engineers. doi: 10.1109/LRA.2026.3664534. ISSN 2377-3766.

Full text not available from this repository.

Official URL: https://ieeexplore.ieee.org/document/11395590

Abstract

We propose a control framework that integrates model-based bipedal locomotion with residual reinforcement learning (RL) to achieve robust and adaptive walking in the presence of real-world uncertainties. Our approach leverages a model-based controller--comprising a Divergent Component of Motion (DCM) trajectory planner and a whole-body controller--as a reliable base policy. To address the uncertainties of inaccurate dynamics modeling and sensor noise, we introduce a residual policy trained through RL with domain randomization. Crucially, we employ a model-based oracle policy, which has privileged access to ground-truth dynamics during training, to supervise the residual policy via a novel supervised loss. This supervision enables the policy to efficiently learn corrective behaviors that compensate for unmodeled effects without extensive reward shaping. Our method demonstrates improved robustness and generalization across a range of randomized conditions, offering a scalable solution for sim-to-real transfer in bipedal locomotion.

Item URL in elib:https://elib.dlr.de/224123/
Document Type:Article
Title:Efficiently Learning Robust Torque-Based Locomotion Through Reinforcement With Model-Based Supervision
Authors:
AuthorsInstitution or Email of AuthorsAuthor's ORCID iDORCID Put Code
Yan, YashuaiTU WienUNSPECIFIEDUNSPECIFIED
Egle, TobiasEgle (at) acin.tuwien.ac.atUNSPECIFIEDUNSPECIFIED
Ott, ChristianChristian.Ott (at) dlr.dehttps://orcid.org/0000-0003-0987-7493UNSPECIFIED
Lee, DongheuiDongheui.Lee (at) dlr.dehttps://orcid.org/0000-0003-1897-7664UNSPECIFIED
Date:13 February 2026
Journal or Publication Title:IEEE Robotics and Automation Letters
Refereed publication:Yes
Open Access:No
Gold Open Access:No
In SCOPUS:Yes
In ISI Web of Science:Yes
Volume:11
DOI:10.1109/LRA.2026.3664534
Page Range:pp. 4155-4162
Publisher:IEEE - Institute of Electrical and Electronics Engineers
ISSN:2377-3766
Status:Published
Keywords:Locomotion
HGF - Research field:Aeronautics, Space and Transport
HGF - Program:Space
HGF - Program Themes:Robotics
DLR - Research area:Raumfahrt
DLR - Program:R RO - Robotics
DLR - Research theme (Project):R - Walking robot/locomotion [RO], R - Basic Technologies [RO]
Location: Oberpfaffenhofen
Institutes and Institutions:Institute of Robotics and Mechatronics (since 2013)
Institute of Robotics and Mechatronics (since 2013) > Analysis and Control of Advanced Robotic Systems
Deposited By: Strobl, Dr.-Ing. Klaus H.
Deposited On:29 Apr 2026 14:26
Last Modified:29 Apr 2026 14:26

Repository Staff Only: item control page

Browse
Search
Help & Contact
Information
OpenAIRE Validator logo electronic library is running on EPrints 3.3.12
Website and database design: Copyright © German Aerospace Center (DLR). All rights reserved.