Mengel, Friederike and Rivas, Javier (2012) An Axiomatization of Learning Rules when Counterfactuals are not Observed. The B.E. Journal of Theoretical Economics, 12 (1). pp. 1-19. DOI https://doi.org/10.1515/1935-1704.1828
Mengel, Friederike and Rivas, Javier (2012) An Axiomatization of Learning Rules when Counterfactuals are not Observed. The B.E. Journal of Theoretical Economics, 12 (1). pp. 1-19. DOI https://doi.org/10.1515/1935-1704.1828
Mengel, Friederike and Rivas, Javier (2012) An Axiomatization of Learning Rules when Counterfactuals are not Observed. The B.E. Journal of Theoretical Economics, 12 (1). pp. 1-19. DOI https://doi.org/10.1515/1935-1704.1828
Abstract
In this paper we study learning procedures when counterfactuals (payoffs of not chosen actions) are not observed. The decision maker reasons in two steps: First, she updates her propensities for choosing each action after every payoff experience, where propensities can be interpreted as preferences. Then, she transforms these propensities into choice probabilities. We introduce a set of axioms on how propensities are updated and on how these propensities are translated into choices and study the decision marker's behavior when such axioms are in place. Our characterization includes the linear reinforcement learning rule from Roth and Erev (1995).
Item Type: | Article |
---|---|
Uncontrolled Keywords: | learning without counterfactuals; partial information; reinforcement learning |
Subjects: | H Social Sciences > HB Economic Theory |
Divisions: | Faculty of Social Sciences Faculty of Social Sciences > Economics, Department of |
SWORD Depositor: | Unnamed user with email elements@essex.ac.uk |
Depositing User: | Unnamed user with email elements@essex.ac.uk |
Date Deposited: | 12 May 2013 16:36 |
Last Modified: | 05 Dec 2024 16:54 |
URI: | http://repository.essex.ac.uk/id/eprint/5816 |