Pisheh Var, Mahrad and Fairbank, Michael and Samothrakis, Spyridon (2023) Finding Eulerian tours in mazes using amemory-augmented fixed policy function. In: Computing Conference, 2023-06-22 - 2023-06-23, London.
Pisheh Var, Mahrad and Fairbank, Michael and Samothrakis, Spyridon (2023) Finding Eulerian tours in mazes using amemory-augmented fixed policy function. In: Computing Conference, 2023-06-22 - 2023-06-23, London.
Pisheh Var, Mahrad and Fairbank, Michael and Samothrakis, Spyridon (2023) Finding Eulerian tours in mazes using amemory-augmented fixed policy function. In: Computing Conference, 2023-06-22 - 2023-06-23, London.
Abstract
This paper describes a simple memory augmentation technique that employs tabular Q-learning to solve binary cell structured mazes with exits generated randomly at the start of each solution attempt. A standard tabular Q-learning can solve any maze with continuous learning; however, if the learning is stopped and the policy is frozen, the agent will not adapt to solve newly generated exits. To avoid using Recurrent Neural Networks RNNs to solve memory-required tasks, we designed and implemented a simple external memory to remember the agent’s cell visit history. This memory also expands the state information to hold more information, assisting tabular Q-learning in distinguishing its path from entering and exiting a maze corridor. Experiments on five maze problems of varying complexity are presented. The maze has two and four predefined exits; the exit will be randomly assigned at the start of each solution attempt. The results show that tabular Q-learning with a frozen policy can outperform standard deep-learning algorithms without incorporating RNNs into the model structure
Item Type: | Conference or Workshop Item (Paper) |
---|---|
Divisions: | Faculty of Science and Health Faculty of Science and Health > Computer Science and Electronic Engineering, School of |
SWORD Depositor: | Unnamed user with email elements@essex.ac.uk |
Depositing User: | Unnamed user with email elements@essex.ac.uk |
Date Deposited: | 24 May 2023 13:39 |
Last Modified: | 01 Sep 2024 01:00 |
URI: | http://repository.essex.ac.uk/id/eprint/35671 |
Available files
Filename: Solving_Mazes_with_Randomized_Exits_usingMemory_Augmented_Tabular_Q_Learning (5).pdf