Yan, Yashuai and Egle, Tobias and Ott, Christian and Lee, Dongheui (2026) Efficiently Learning Robust Torque-Based Locomotion Through Reinforcement With Model-Based Supervision. IEEE Robotics and Automation Letters, 11 (4), pp. 4155-4162. IEEE - Institute of Electrical and Electronics Engineers. doi: 10.1109/LRA.2026.3664534. ISSN 2377-3766.
Full text not available from this repository.
Official URL: https://ieeexplore.ieee.org/document/11395590
Abstract
We propose a control framework that integrates model-based bipedal locomotion with residual reinforcement learning (RL) to achieve robust and adaptive walking in the presence of real-world uncertainties. Our approach leverages a model-based controller--comprising a Divergent Component of Motion (DCM) trajectory planner and a whole-body controller--as a reliable base policy. To address the uncertainties of inaccurate dynamics modeling and sensor noise, we introduce a residual policy trained through RL with domain randomization. Crucially, we employ a model-based oracle policy, which has privileged access to ground-truth dynamics during training, to supervise the residual policy via a novel supervised loss. This supervision enables the policy to efficiently learn corrective behaviors that compensate for unmodeled effects without extensive reward shaping. Our method demonstrates improved robustness and generalization across a range of randomized conditions, offering a scalable solution for sim-to-real transfer in bipedal locomotion.
| Item URL in elib: | https://elib.dlr.de/224123/ | ||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Document Type: | Article | ||||||||||||||||||||
| Title: | Efficiently Learning Robust Torque-Based Locomotion Through Reinforcement With Model-Based Supervision | ||||||||||||||||||||
| Authors: |
| ||||||||||||||||||||
| Date: | 13 February 2026 | ||||||||||||||||||||
| Journal or Publication Title: | IEEE Robotics and Automation Letters | ||||||||||||||||||||
| Refereed publication: | Yes | ||||||||||||||||||||
| Open Access: | No | ||||||||||||||||||||
| Gold Open Access: | No | ||||||||||||||||||||
| In SCOPUS: | Yes | ||||||||||||||||||||
| In ISI Web of Science: | Yes | ||||||||||||||||||||
| Volume: | 11 | ||||||||||||||||||||
| DOI: | 10.1109/LRA.2026.3664534 | ||||||||||||||||||||
| Page Range: | pp. 4155-4162 | ||||||||||||||||||||
| Publisher: | IEEE - Institute of Electrical and Electronics Engineers | ||||||||||||||||||||
| ISSN: | 2377-3766 | ||||||||||||||||||||
| Status: | Published | ||||||||||||||||||||
| Keywords: | Locomotion | ||||||||||||||||||||
| HGF - Research field: | Aeronautics, Space and Transport | ||||||||||||||||||||
| HGF - Program: | Space | ||||||||||||||||||||
| HGF - Program Themes: | Robotics | ||||||||||||||||||||
| DLR - Research area: | Raumfahrt | ||||||||||||||||||||
| DLR - Program: | R RO - Robotics | ||||||||||||||||||||
| DLR - Research theme (Project): | R - Walking robot/locomotion [RO], R - Basic Technologies [RO] | ||||||||||||||||||||
| Location: | Oberpfaffenhofen | ||||||||||||||||||||
| Institutes and Institutions: | Institute of Robotics and Mechatronics (since 2013) Institute of Robotics and Mechatronics (since 2013) > Analysis and Control of Advanced Robotic Systems | ||||||||||||||||||||
| Deposited By: | Strobl, Dr.-Ing. Klaus H. | ||||||||||||||||||||
| Deposited On: | 29 Apr 2026 14:26 | ||||||||||||||||||||
| Last Modified: | 29 Apr 2026 14:26 |
Repository Staff Only: item control page