Sefrin, Oliver und Wölk, Sabine Esther (2022) A Quantum Enhanced Learning Algorithm for Maze Problems. DPG-Frühjahrstagung der Sektion Atome, Moleküle, Quantenoptik und Photonik (SAMOP), 2022-03-14 - 2022-03-18, Erlangen, Deutschland. (nicht veröffentlicht)
PDF
2MB |
Kurzfassung
In reinforcement learning, a so-called agent should learn to optimally solve a given task by performing actions within an environment. As an example, we consider the grid-world, a two-dimensional maze for which the shortest way from an initial position to a given goal has to be found. The agent receives rewards for helpful actions which enables him to learn optimal solutions. For large action spaces, a mapping of actions to a quantum setting can be beneficial in finding rewarded actions faster and thus in speeding up the learning process. A hybrid agent which alternates between classical and quantum behavior has been developed previously for deterministic and strictly epochal environments. Here, strictly epochal means that an epoch consists of a fixed number of actions, after which the environment is reset to its initial state. We present and analyze strategies which aim at resolving the hybrid agent’s current restriction of searching for action sequences with a fixed length. This is a first step towards applying the hybrid agent on environments with a generally unknown optimal action sequence length such as in the grid-world problem.
elib-URL des Eintrags: | https://elib.dlr.de/202514/ | ||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Dokumentart: | Konferenzbeitrag (Poster) | ||||||||||||
Titel: | A Quantum Enhanced Learning Algorithm for Maze Problems | ||||||||||||
Autoren: |
| ||||||||||||
Datum: | März 2022 | ||||||||||||
Referierte Publikation: | Nein | ||||||||||||
Open Access: | Ja | ||||||||||||
Gold Open Access: | Nein | ||||||||||||
In SCOPUS: | Nein | ||||||||||||
In ISI Web of Science: | Nein | ||||||||||||
Status: | nicht veröffentlicht | ||||||||||||
Stichwörter: | Reinforcement Learning; Hybrid Quantum Algorithm | ||||||||||||
Veranstaltungstitel: | DPG-Frühjahrstagung der Sektion Atome, Moleküle, Quantenoptik und Photonik (SAMOP) | ||||||||||||
Veranstaltungsort: | Erlangen, Deutschland | ||||||||||||
Veranstaltungsart: | nationale Konferenz | ||||||||||||
Veranstaltungsbeginn: | 14 März 2022 | ||||||||||||
Veranstaltungsende: | 18 März 2022 | ||||||||||||
Veranstalter : | Deutsche Physikalische Gesellschaft (DPG) | ||||||||||||
HGF - Forschungsbereich: | keine Zuordnung | ||||||||||||
HGF - Programm: | keine Zuordnung | ||||||||||||
HGF - Programmthema: | keine Zuordnung | ||||||||||||
DLR - Schwerpunkt: | Quantencomputing-Initiative | ||||||||||||
DLR - Forschungsgebiet: | QC SW - Software | ||||||||||||
DLR - Teilgebiet (Projekt, Vorhaben): | QC - Qlearning | ||||||||||||
Standort: | Ulm | ||||||||||||
Institute & Einrichtungen: | Institut für Quantentechnologien > Theoretische Quantenphysik | ||||||||||||
Hinterlegt von: | Sefrin, Oliver | ||||||||||||
Hinterlegt am: | 15 Jul 2024 17:22 | ||||||||||||
Letzte Änderung: | 16 Jul 2024 09:43 |
Nur für Mitarbeiter des Archivs: Kontrollseite des Eintrags