Performance analysis of a hybrid agent for quantum-accessible reinforcement learning

Hamann, Arne und Wölk, Sabine Esther (2022) Performance analysis of a hybrid agent for quantum-accessible reinforcement learning. New Journal of Physics, 24, 033044. Institute of Physics (IOP) Publishing. doi: 10.1088/1367-2630/ac5b56. ISSN 1367-2630.

PDF - Verlagsversion (veröffentlichte Fassung)
2MB

Kurzfassung

In the last decade quantum machine learning has provided fascinating and fundamental improvements to supervised, unsupervised and reinforcement learning (RL). In RL, a so-called agent is challenged to solve a task given by some environment. The agent learns to solve the task by exploring the environment and exploiting the rewards it gets from the environment. For some classical task environments, an analogue quantum environment can be constructed which allows to find rewards quadratically faster by applying quantum algorithms. In this paper, we analytically analyze the behavior of a hybrid agent which combines this quadratic speedup in exploration with the policy update of a classical agent. This leads to a faster learning of the hybrid agent compared to the classical agent. We demonstrate that if the classical agent needs on average ⟨J⟩ rewards and ⟨T⟩_cl epochs to learn how to solve the task, the hybrid agent will take <T>_q \leqslant {\alpha }_{s}{\alpha }_{o}\sqrt{< T>_c * < J> epochs on average. Here, \alpha_s and \alpha_o denote constants depending on details of the quantum search and are independent of the problem size. Additionally, we prove that if the environment allows for maximally \alph_o*k_max sequential coherent interactions, e.g. due to noise effects, an improvement given by ⟨T⟩q ≈ \alpha_o ⟨T⟩_cl/(4*k_max) is still possible

elib-URL des Eintrags:

https://elib.dlr.de/194355/

Dokumentart:

Zeitschriftenbeitrag

Titel:

Performance analysis of a hybrid agent for quantum-accessible reinforcement learning

Autoren:

Autoren	Institution oder E-Mail-Adresse	Autoren-ORCID-iD	ORCID Put Code
Hamann, Arne	Universität Innsbruck	NICHT SPEZIFIZIERT	NICHT SPEZIFIZIERT
Wölk, Sabine Esther	Sabine.Woelk (at) dlr.de	NICHT SPEZIFIZIERT	NICHT SPEZIFIZIERT

Datum:

2022

Erschienen in:

New Journal of Physics

Referierte Publikation:

Open Access:

Gold Open Access:

In SCOPUS:

In ISI Web of Science:

Band:

DOI:

10.1088/1367-2630/ac5b56

Seitenbereich:

033044

Verlag:

Institute of Physics (IOP) Publishing

ISSN:

1367-2630

Status:

veröffentlicht

Stichwörter:

Quantum computing, reinforcement learning

HGF - Forschungsbereich:

Luftfahrt, Raumfahrt und Verkehr

HGF - Programm:

Raumfahrt

HGF - Programmthema:

Kommunikation, Navigation, Quantentechnologien

DLR - Schwerpunkt:

Raumfahrt

DLR - Forschungsgebiet:

R KNQ - Kommunikation, Navigation, Quantentechnologie

DLR - Teilgebiet (Projekt, Vorhaben):

R - Bestärkendes Lernen mit Quantenalgorithmen

Standort:

Ulm

Institute & Einrichtungen:

Institut für Quantentechnologien > Theoretische Quantenphysik

Hinterlegt von:

Wölk, Sabine Esther

Hinterlegt am:

18 Mär 2023 12:16

Letzte Änderung:

18 Mär 2023 12:16

Nur für Mitarbeiter des Archivs: Kontrollseite des Eintrags