Policy search in continuous action domains: an overview

Sigaud, Olivier und Stulp, Freek (2019) Policy search in continuous action domains: an overview. Neural Networks, 113, Seiten 28-40. Elsevier. doi: 10.1016/j.neunet.2019.01.011. ISSN 0893-6080.

PDF - Preprintversion (eingereichte Entwurfsversion)
758kB

Kurzfassung

Continuous action policy search is currently the focus of intensive research, driven both by the recent success of deep reinforcement learning algorithms and the emergence of competitors based on evolutionary algorithms. In this paper, we present a broad survey of policy search methods, providing a unified perspective on very different approaches, including also Bayesian Optimization and directed exploration methods. The main message of this overview is in the relationship between the families of methods, but we also outline some factors underlying sample efficiency properties of the various approaches.

elib-URL des Eintrags:

https://elib.dlr.de/130603/

Dokumentart:

Zeitschriftenbeitrag

Titel:

Policy search in continuous action domains: an overview

Autoren:

Autoren	Institution oder E-Mail-Adresse	Autoren-ORCID-iD	ORCID Put Code
Sigaud, Olivier	NICHT SPEZIFIZIERT	https://orcid.org/0000-0002-8544-0229	NICHT SPEZIFIZIERT
Stulp, Freek	Freek.Stulp (at) dlr.de	https://orcid.org/0000-0001-9555-9517	NICHT SPEZIFIZIERT

Datum:

Mai 2019

Erschienen in:

Neural Networks

Referierte Publikation:

Open Access:

Gold Open Access:

Nein

In SCOPUS:

In ISI Web of Science:

Band:

113

DOI:

10.1016/j.neunet.2019.01.011

Seitenbereich:

Seiten 28-40

Verlag:

Elsevier

ISSN:

0893-6080

Status:

veröffentlicht

Stichwörter:

Reinforcement Learning, Policy Search, Artificial Intelligence

HGF - Forschungsbereich:

Luftfahrt, Raumfahrt und Verkehr

HGF - Programm:

Raumfahrt

HGF - Programmthema:

Technik für Raumfahrtsysteme

DLR - Schwerpunkt:

Raumfahrt

DLR - Forschungsgebiet:

R SY - Technik für Raumfahrtsysteme

DLR - Teilgebiet (Projekt, Vorhaben):

R - Vorhaben Intelligente Mobilität (alt)

Standort:

Oberpfaffenhofen

Institute & Einrichtungen:

Institut für Robotik und Mechatronik (ab 2013) > Kognitive Robotik

Hinterlegt von:

Stulp, Freek

Hinterlegt am:

18 Nov 2019 08:54

Letzte Änderung:

13 Feb 2020 09:34

Nur für Mitarbeiter des Archivs: Kontrollseite des Eintrags