Luna Ramirez, Wulfrano Arturo (2019) Plan Acquisition Through Intentional Learning in BDI Multi-Agent Systems. PhD thesis, University of Essex.
Luna Ramirez, Wulfrano Arturo (2019) Plan Acquisition Through Intentional Learning in BDI Multi-Agent Systems. PhD thesis, University of Essex.
Luna Ramirez, Wulfrano Arturo (2019) Plan Acquisition Through Intentional Learning in BDI Multi-Agent Systems. PhD thesis, University of Essex.
Abstract
Multi-Agent Systems (MAS), a technique emanating from Distributed Artificial Intelligence, is a suitable technique to study complex systems. They make it possible to represent and simulate both elements and interrelations of systems in a variety of domains. The most commonly used approach to develop the individual components (agents) within MAS is reactive agency. However, other architectures, like cognitive agents, enable richer behaviours and interactions to be captured and modelled. The well-known Belief-Desire-Intentions architecture (BDI) is a robust approach to develop cognitive agents and it can emulate aspects of autonomous behaviour and is thus a promising tool to simulate social systems. Machine Learning has been applied to improve the behaviour of agents both individually or collectively. However, the original BDI model of agency, is lacking learning as part of its core functionalities. To cope with learning, the BDI agency has been extended by Intentional Learning (IL) operating at three levels: belief adjustment, plan selection, and plan acquisition. The latter makes it possible to increase the agent’s catalogue of skills by generating new procedural knowledge to be used onwards. The main contributions of this thesis are: a) the development of IL in a fully-fledged BDI framework at the plan acquisition level, b) extending IL from the single-agent case to the collective perspective; and c) a novel framework that melts reactive and BDI agents through integrating both MAS and Agent-Based Modelling approaches, it allows the configuration of diverse domains and environments. Learning is demonstrated in a test-bed environment to acquire a set of plans that drive the agent to exhibit behaviours such as target-searching and left-handed wall-following. Learning in both decision strata, single and collective, is tested in a more challenging and socially relevant environment: the Disaster-Rescue problem.
Item Type: | Thesis (PhD) |
---|---|
Uncontrolled Keywords: | Multi-Agent Systems, Machine Learning, Agent-Based Modelling, Disaster-Rescue Simulations, Jason, NetLogo |
Subjects: | Q Science > QA Mathematics > QA75 Electronic computers. Computer science |
Divisions: | Faculty of Science and Health > Computer Science and Electronic Engineering, School of |
Depositing User: | Wulfrano Luna Ramirez |
Date Deposited: | 14 Aug 2019 08:07 |
Last Modified: | 14 Aug 2019 08:10 |
URI: | http://repository.essex.ac.uk/id/eprint/25083 |
Available files
Filename: thesisLRWA_1400277.pdf