Pisheh Var, Mahrad (2024) Minimalistic Adaptive Dynamic-Programming Agents for Memory-Driven Exploration. Doctoral thesis, University Of Essex.
Pisheh Var, Mahrad (2024) Minimalistic Adaptive Dynamic-Programming Agents for Memory-Driven Exploration. Doctoral thesis, University Of Essex.
Pisheh Var, Mahrad (2024) Minimalistic Adaptive Dynamic-Programming Agents for Memory-Driven Exploration. Doctoral thesis, University Of Essex.
Abstract
Adaptive Dynamic Programming (ADP) and Reinforcement Learning (RL) are pivotal frameworks in machine learning, each presenting unique benefits and hurdles. This thesis examines the performance and adaptability of ADP agents using Backpropagation Through Time (BPTT) in continuous spaces. A Memory-Based Backpropagation Through Time (MBPTT) is reviewed, enhancing the conventional BPTT approach by integrating memory mechanisms to refine decision-making in partially observable environments. Drawing upon foundational and recent developments in RL and ADP, this study explores the capability of BPTT agents across various environmental settings. It critically assesses different algorithms and memory models, including Long Short-Term Memory (LSTM) and Gated Recurrent Units (GRU), in simulating a ``functionally sentient'' organism seeking food. The research makes two main contributions. Firstly, it empirically shows that even the simplest forms of memory-augmented agents can effectively navigate through a maze, performing better than existing techniques. This highlights the practical use of memory-based algorithms in spatial tasks. Secondly, the study investigates the performance of Backpropagation Through Time (BPTT) in bicycle navigation. It introduces a simulated organism that successfully combines BPTT with memory functions, demonstrating efficiency in environmental mapping and food search tasks. This work provides a solid foundation for future research in integrated learning systems. In conclusion, this thesis reconciles the theoretical distinctions between memory and adaptive dynamic programming. Combining theoretical understanding with practical applications contributes to the ongoing effort to create more resilient, efficient, and adaptive agents in the rapidly advancing field of machine learning.
Item Type: | Thesis (Doctoral) |
---|---|
Uncontrolled Keywords: | Adaptive Dynamic Programming, Backpropagation Through Time, Continuous Spaces, Discrete Spaces, Environment Exploration, Food-Seeking Behavior, Gated Recurrent Unit, Long Short-Term Memory, Maze Navigation, Memory-Augmented Agents, Optimal Policies, Partially Observable Environments, Reinforcement Learning, Simulated Organism, Spatial Intelligence |
Subjects: | Q Science > QA Mathematics > QA75 Electronic computers. Computer science |
Divisions: | Faculty of Science and Health > Computer Science and Electronic Engineering, School of |
Depositing User: | Mahrad Pisheh Var |
Date Deposited: | 19 Jun 2024 15:22 |
Last Modified: | 19 Jun 2024 15:22 |
URI: | http://repository.essex.ac.uk/id/eprint/38575 |
Available files
Filename: Minimalistic Adaptive Dynamic-Programming Agents for Memory-Driven Exploration Thesis.pdf