Optimal Periodic Orbit Stabilization Applied To Elastic Bipedal Locomotion

Horsch, Julius (2025) Optimal Periodic Orbit Stabilization Applied To Elastic Bipedal Locomotion. Masterarbeit, Technical University of Munich.

PDF - Nur DLR-intern zugänglich
23MB

Kurzfassung

Efficient and robust bipedal locomotion remains a fundamentally challenging control problem due to underactuation, compliance, and contact dynamics. This thesis investigates two different approaches to generating and stabilizing periodic gaits for elastic bipeds.

The first approach is purely reinforcement learning based, where both trajectory generation and stabilization are handled jointly. A policy is trained to imitate human motion data and learns robust, energy-efficient gaits through motion imitation and domain randomization.

The second approach separates the problem into two stages: trajectory generation, followed by stabilization of the resulting periodic orbit using optimal control, the latter being the focus of this thesis. The stabilization is formulated in transverse coordinates, and an optimal feedback law is computed for the linearized system along the orbit by solving the periodic Riccati equation. To improve the online applicability of the controller, two methods are presented to eliminate the need for coordinate transformations during execution: a Kalman filter-based estimator and a neural network approximation of the value function. A reinforcement learning policy with a structure similar to the optimal controller is also trained for comparison.

Both approaches are implemented on the C-Runner platform and compared in terms of robustness and applicability to real-world systems.

elib-URL des Eintrags:

https://elib.dlr.de/216290/

Dokumentart:

Hochschulschrift (Masterarbeit)

Titel:

Optimal Periodic Orbit Stabilization Applied To Elastic Bipedal Locomotion

Autoren:

Autoren	Institution oder E-Mail-Adresse	Autoren-ORCID-iD	ORCID Put Code
Horsch, Julius	julius.horsch (at) dlr.de	NICHT SPEZIFIZIERT	NICHT SPEZIFIZIERT

DLR-Supervisor:

Beitragsart	DLR-Supervisor	Institution oder E-Mail-Adresse	DLR-Supervisor-ORCID-iD
Thesis advisor	Beck, Fabian	Fabian.Beck (at) dlr.de	https://orcid.org/0000-0003-3239-5505

Datum:

1 August 2025

Open Access:

Nein

Seitenanzahl:

Status:

veröffentlicht

Stichwörter:

locomotion, underactuation, compliance, optimal control, reinforcement learning, C-Runner

Institution:

Technical University of Munich

Abteilung:

TUM School of Computation, Information and Technology

HGF - Forschungsbereich:

Luftfahrt, Raumfahrt und Verkehr

HGF - Programm:

Raumfahrt

HGF - Programmthema:

Robotik

DLR - Schwerpunkt:

Raumfahrt

DLR - Forschungsgebiet:

R RO - Robotik

DLR - Teilgebiet (Projekt, Vorhaben):

R - Laufroboter/Lokomotion

Standort:

Oberpfaffenhofen

Institute & Einrichtungen:

Institut für Robotik und Mechatronik (ab 2013) > Analyse und Regelung komplexer Robotersysteme

Hinterlegt von:

Beck, Fabian

Hinterlegt am:

08 Sep 2025 07:43

Letzte Änderung:

08 Sep 2025 07:43

Nur für Mitarbeiter des Archivs: Kontrollseite des Eintrags