Lucas, Simon M (2010) Estimating learning rates in evolution and TDL: Results on a simple grid-world problem. In: 2010 IEEE Symposium on Computational Intelligence and Games (CIG), 2010-08-18 - 2010-08-21.
Lucas, Simon M (2010) Estimating learning rates in evolution and TDL: Results on a simple grid-world problem. In: 2010 IEEE Symposium on Computational Intelligence and Games (CIG), 2010-08-18 - 2010-08-21.
Lucas, Simon M (2010) Estimating learning rates in evolution and TDL: Results on a simple grid-world problem. In: 2010 IEEE Symposium on Computational Intelligence and Games (CIG), 2010-08-18 - 2010-08-21.
Abstract
When learning to play a game or perform some task, it is important to learn as quickly and effectively as possible by making best use of the available information. Interesting insights can be gained by studying the learning process from an information theory perspective, and analysing the learning speed in terms of the maximum number of bits that could be learned per game/task, or per action. Previous work has applied this analysis to co-evolution and to temporal difference learning (TDL) for a simple board game with a fixed number of moves. This paper analyses a grid-world problem and calculates the upper bounds on the information rates for evolution and for TDL. The results show an interesting relationship between the upper bounds of the learning rates and the actual information acquisition rates that are achieved in practice. Also, which method works best is highly dependent on the choice of function approximator. © 2010 IEEE.
Item Type: | Conference or Workshop Item (Paper) |
---|---|
Additional Information: | Published proceedings: Proceedings of the 2010 IEEE Conference on Computational Intelligence and Games, CIG2010 |
Subjects: | Q Science > QA Mathematics > QA75 Electronic computers. Computer science |
Divisions: | Faculty of Science and Health Faculty of Science and Health > Computer Science and Electronic Engineering, School of |
SWORD Depositor: | Unnamed user with email elements@essex.ac.uk |
Depositing User: | Unnamed user with email elements@essex.ac.uk |
Date Deposited: | 19 Oct 2012 14:43 |
Last Modified: | 24 Oct 2024 20:44 |
URI: | http://repository.essex.ac.uk/id/eprint/4059 |