Sigaud, Olivier and Stulp, Freek (2019) Policy search in continuous action domains: an overview. Neural Networks, 113, pp. 28-40. Elsevier. doi: 10.1016/j.neunet.2019.01.011. ISSN 0893-6080.
![]() |
PDF
- Preprint version (submitted draft)
758kB |
Abstract
Continuous action policy search is currently the focus of intensive research, driven both by the recent success of deep reinforcement learning algorithms and the emergence of competitors based on evolutionary algorithms. In this paper, we present a broad survey of policy search methods, providing a unified perspective on very different approaches, including also Bayesian Optimization and directed exploration methods. The main message of this overview is in the relationship between the families of methods, but we also outline some factors underlying sample efficiency properties of the various approaches.
Item URL in elib: | https://elib.dlr.de/130603/ | ||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Document Type: | Article | ||||||||||||
Title: | Policy search in continuous action domains: an overview | ||||||||||||
Authors: |
| ||||||||||||
Date: | May 2019 | ||||||||||||
Journal or Publication Title: | Neural Networks | ||||||||||||
Refereed publication: | Yes | ||||||||||||
Open Access: | Yes | ||||||||||||
Gold Open Access: | No | ||||||||||||
In SCOPUS: | Yes | ||||||||||||
In ISI Web of Science: | Yes | ||||||||||||
Volume: | 113 | ||||||||||||
DOI: | 10.1016/j.neunet.2019.01.011 | ||||||||||||
Page Range: | pp. 28-40 | ||||||||||||
Publisher: | Elsevier | ||||||||||||
ISSN: | 0893-6080 | ||||||||||||
Status: | Published | ||||||||||||
Keywords: | Reinforcement Learning, Policy Search, Artificial Intelligence | ||||||||||||
HGF - Research field: | Aeronautics, Space and Transport | ||||||||||||
HGF - Program: | Space | ||||||||||||
HGF - Program Themes: | Space System Technology | ||||||||||||
DLR - Research area: | Raumfahrt | ||||||||||||
DLR - Program: | R SY - Space System Technology | ||||||||||||
DLR - Research theme (Project): | R - Vorhaben Intelligente Mobilität (old) | ||||||||||||
Location: | Oberpfaffenhofen | ||||||||||||
Institutes and Institutions: | Institute of Robotics and Mechatronics (since 2013) > Cognitive Robotics | ||||||||||||
Deposited By: | Stulp, Freek | ||||||||||||
Deposited On: | 18 Nov 2019 08:54 | ||||||||||||
Last Modified: | 13 Feb 2020 09:34 |
Repository Staff Only: item control page