Sefrin, Oliver and Wölk, Sabine Esther (2025) A hybrid learning agent for episodic learning tasks with unknown target distance. Quantum Machine Intelligence, 7 (1), p. 52. Springer Nature. doi: 10.1007/s42484-025-00269-1. ISSN 2524-4906.
|
PDF
- Published version
994kB |
Official URL: https://link.springer.com/article/10.1007/s42484-025-00269-1
Abstract
The "hybrid agent for quantum-accessible reinforcement learning," as defined in (Hamann and Wölk New J Phys 24:033044 2022), provides a proven quasi-quadratic speedup and is experimentally tested. However, the standard version can only be applied to episodic learning tasks with fixed episode length. In many real-world applications, the information about the necessary number of steps within an episode to reach a defined target is not available in advance and especially before reaching the target for the first time. Furthermore, in such scenarios, classical agents have the advantage of observing at which step they reach the target. How to best deal with an unknown target distance in classical and quantum reinforcement learning and whether the hybrid agent can provide an advantage in such learning scenarios is unknown so far. In this work, we introduce a hybrid agent with a stochastic episode length selection strategy to alleviate the need for knowledge about the necessary episode length. Through simulations, we test the adapted hybrid agent’s performance versus classical counterparts with and without similar episode selection strategies. Our simulations demonstrate a speedup in certain scenarios due to our developed episode length selection strategy for classical learning agents as well as an additional speedup for our resulting hybrid learning agent.
| Item URL in elib: | https://elib.dlr.de/213691/ | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Document Type: | Article | ||||||||||||
| Title: | A hybrid learning agent for episodic learning tasks with unknown target distance | ||||||||||||
| Authors: |
| ||||||||||||
| Date: | 11 April 2025 | ||||||||||||
| Journal or Publication Title: | Quantum Machine Intelligence | ||||||||||||
| Refereed publication: | Yes | ||||||||||||
| Open Access: | Yes | ||||||||||||
| Gold Open Access: | No | ||||||||||||
| In SCOPUS: | Yes | ||||||||||||
| In ISI Web of Science: | Yes | ||||||||||||
| Volume: | 7 | ||||||||||||
| DOI: | 10.1007/s42484-025-00269-1 | ||||||||||||
| Page Range: | p. 52 | ||||||||||||
| Publisher: | Springer Nature | ||||||||||||
| ISSN: | 2524-4906 | ||||||||||||
| Status: | Published | ||||||||||||
| Keywords: | Quantum reinforcement learning, Amplitude amplification, Hybrid algorithm, Navigation problem | ||||||||||||
| HGF - Research field: | other | ||||||||||||
| HGF - Program: | other | ||||||||||||
| HGF - Program Themes: | other | ||||||||||||
| DLR - Research area: | Quantum Computing Initiative | ||||||||||||
| DLR - Program: | QC SW - Software | ||||||||||||
| DLR - Research theme (Project): | QC - Qlearning | ||||||||||||
| Location: | Ulm | ||||||||||||
| Institutes and Institutions: | Institute of Quantum Technologies > Quantum Information and Communication | ||||||||||||
| Deposited By: | Sefrin, Oliver | ||||||||||||
| Deposited On: | 26 May 2025 21:51 | ||||||||||||
| Last Modified: | 26 May 2025 21:51 |
Repository Staff Only: item control page