elib
DLR-Header
DLR-Logo -> http://www.dlr.de
DLR Portal Home | Imprint | Privacy Policy | Accessibility | Contact | Deutsch
Fontsize: [-] Text [+]

A hybrid learning agent for episodic learning tasks with unknown target distance

Sefrin, Oliver and Wölk, Sabine Esther (2025) A hybrid learning agent for episodic learning tasks with unknown target distance. Quantum Machine Intelligence, 7 (1), p. 52. Springer Nature. doi: 10.1007/s42484-025-00269-1. ISSN 2524-4906.

[img] PDF - Published version
994kB

Official URL: https://link.springer.com/article/10.1007/s42484-025-00269-1

Abstract

The "hybrid agent for quantum-accessible reinforcement learning," as defined in (Hamann and Wölk New J Phys 24:033044 2022), provides a proven quasi-quadratic speedup and is experimentally tested. However, the standard version can only be applied to episodic learning tasks with fixed episode length. In many real-world applications, the information about the necessary number of steps within an episode to reach a defined target is not available in advance and especially before reaching the target for the first time. Furthermore, in such scenarios, classical agents have the advantage of observing at which step they reach the target. How to best deal with an unknown target distance in classical and quantum reinforcement learning and whether the hybrid agent can provide an advantage in such learning scenarios is unknown so far. In this work, we introduce a hybrid agent with a stochastic episode length selection strategy to alleviate the need for knowledge about the necessary episode length. Through simulations, we test the adapted hybrid agent’s performance versus classical counterparts with and without similar episode selection strategies. Our simulations demonstrate a speedup in certain scenarios due to our developed episode length selection strategy for classical learning agents as well as an additional speedup for our resulting hybrid learning agent.

Item URL in elib:https://elib.dlr.de/213691/
Document Type:Article
Title:A hybrid learning agent for episodic learning tasks with unknown target distance
Authors:
AuthorsInstitution or Email of AuthorsAuthor's ORCID iDORCID Put Code
Sefrin, OliverUNSPECIFIEDhttps://orcid.org/0000-0002-1111-7787184797597
Wölk, Sabine EstherUNSPECIFIEDUNSPECIFIEDUNSPECIFIED
Date:11 April 2025
Journal or Publication Title:Quantum Machine Intelligence
Refereed publication:Yes
Open Access:Yes
Gold Open Access:No
In SCOPUS:Yes
In ISI Web of Science:Yes
Volume:7
DOI:10.1007/s42484-025-00269-1
Page Range:p. 52
Publisher:Springer Nature
ISSN:2524-4906
Status:Published
Keywords:Quantum reinforcement learning, Amplitude amplification, Hybrid algorithm, Navigation problem
HGF - Research field:other
HGF - Program:other
HGF - Program Themes:other
DLR - Research area:Quantum Computing Initiative
DLR - Program:QC SW - Software
DLR - Research theme (Project):QC - Qlearning
Location: Ulm
Institutes and Institutions:Institute of Quantum Technologies > Quantum Information and Communication
Deposited By: Sefrin, Oliver
Deposited On:26 May 2025 21:51
Last Modified:26 May 2025 21:51

Repository Staff Only: item control page

Browse
Search
Help & Contact
Information
OpenAIRE Validator logo electronic library is running on EPrints 3.3.12
Website and database design: Copyright © German Aerospace Center (DLR). All rights reserved.