Horsch, Julius (2025) Optimal Periodic Orbit Stabilization Applied To Elastic Bipedal Locomotion. Master's, Technical University of Munich.
|
PDF
- Only accessible within DLR
23MB |
Abstract
Efficient and robust bipedal locomotion remains a fundamentally challenging control problem due to underactuation, compliance, and contact dynamics. This thesis investigates two different approaches to generating and stabilizing periodic gaits for elastic bipeds. The first approach is purely reinforcement learning based, where both trajectory generation and stabilization are handled jointly. A policy is trained to imitate human motion data and learns robust, energy-efficient gaits through motion imitation and domain randomization. The second approach separates the problem into two stages: trajectory generation, followed by stabilization of the resulting periodic orbit using optimal control, the latter being the focus of this thesis. The stabilization is formulated in transverse coordinates, and an optimal feedback law is computed for the linearized system along the orbit by solving the periodic Riccati equation. To improve the online applicability of the controller, two methods are presented to eliminate the need for coordinate transformations during execution: a Kalman filter-based estimator and a neural network approximation of the value function. A reinforcement learning policy with a structure similar to the optimal controller is also trained for comparison. Both approaches are implemented on the C-Runner platform and compared in terms of robustness and applicability to real-world systems.
| Item URL in elib: | https://elib.dlr.de/216290/ | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| Document Type: | Thesis (Master's) | ||||||||
| Title: | Optimal Periodic Orbit Stabilization Applied To Elastic Bipedal Locomotion | ||||||||
| Authors: |
| ||||||||
| DLR Supervisors: |
| ||||||||
| Date: | 1 August 2025 | ||||||||
| Open Access: | No | ||||||||
| Number of Pages: | 74 | ||||||||
| Status: | Published | ||||||||
| Keywords: | locomotion, underactuation, compliance, optimal control, reinforcement learning, C-Runner | ||||||||
| Institution: | Technical University of Munich | ||||||||
| Department: | TUM School of Computation, Information and Technology | ||||||||
| HGF - Research field: | Aeronautics, Space and Transport | ||||||||
| HGF - Program: | Space | ||||||||
| HGF - Program Themes: | Robotics | ||||||||
| DLR - Research area: | Raumfahrt | ||||||||
| DLR - Program: | R RO - Robotics | ||||||||
| DLR - Research theme (Project): | R - Walking robots/locomotion | ||||||||
| Location: | Oberpfaffenhofen | ||||||||
| Institutes and Institutions: | Institute of Robotics and Mechatronics (since 2013) > Analysis and Control of Advanced Robotic Systems | ||||||||
| Deposited By: | Beck, Fabian | ||||||||
| Deposited On: | 08 Sep 2025 07:43 | ||||||||
| Last Modified: | 08 Sep 2025 07:43 |
Repository Staff Only: item control page