Sigaud, Olivier und Stulp, Freek (2019) Policy search in continuous action domains: an overview. Neural Networks, 113, Seiten 28-40. Elsevier. doi: 10.1016/j.neunet.2019.01.011. ISSN 0893-6080.
PDF
- Preprintversion (eingereichte Entwurfsversion)
758kB |
Kurzfassung
Continuous action policy search is currently the focus of intensive research, driven both by the recent success of deep reinforcement learning algorithms and the emergence of competitors based on evolutionary algorithms. In this paper, we present a broad survey of policy search methods, providing a unified perspective on very different approaches, including also Bayesian Optimization and directed exploration methods. The main message of this overview is in the relationship between the families of methods, but we also outline some factors underlying sample efficiency properties of the various approaches.
elib-URL des Eintrags: | https://elib.dlr.de/130603/ | ||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Dokumentart: | Zeitschriftenbeitrag | ||||||||||||
Titel: | Policy search in continuous action domains: an overview | ||||||||||||
Autoren: |
| ||||||||||||
Datum: | Mai 2019 | ||||||||||||
Erschienen in: | Neural Networks | ||||||||||||
Referierte Publikation: | Ja | ||||||||||||
Open Access: | Ja | ||||||||||||
Gold Open Access: | Nein | ||||||||||||
In SCOPUS: | Ja | ||||||||||||
In ISI Web of Science: | Ja | ||||||||||||
Band: | 113 | ||||||||||||
DOI: | 10.1016/j.neunet.2019.01.011 | ||||||||||||
Seitenbereich: | Seiten 28-40 | ||||||||||||
Verlag: | Elsevier | ||||||||||||
ISSN: | 0893-6080 | ||||||||||||
Status: | veröffentlicht | ||||||||||||
Stichwörter: | Reinforcement Learning, Policy Search, Artificial Intelligence | ||||||||||||
HGF - Forschungsbereich: | Luftfahrt, Raumfahrt und Verkehr | ||||||||||||
HGF - Programm: | Raumfahrt | ||||||||||||
HGF - Programmthema: | Technik für Raumfahrtsysteme | ||||||||||||
DLR - Schwerpunkt: | Raumfahrt | ||||||||||||
DLR - Forschungsgebiet: | R SY - Technik für Raumfahrtsysteme | ||||||||||||
DLR - Teilgebiet (Projekt, Vorhaben): | R - Vorhaben Intelligente Mobilität (alt) | ||||||||||||
Standort: | Oberpfaffenhofen | ||||||||||||
Institute & Einrichtungen: | Institut für Robotik und Mechatronik (ab 2013) > Kognitive Robotik | ||||||||||||
Hinterlegt von: | Stulp, Freek | ||||||||||||
Hinterlegt am: | 18 Nov 2019 08:54 | ||||||||||||
Letzte Änderung: | 13 Feb 2020 09:34 |
Nur für Mitarbeiter des Archivs: Kontrollseite des Eintrags