Convergence of reinforcement learning to Nash equilibrium: A search-market experiment

Authored by E Darmon, R Waldeck

Date Published: 2005-09-01

DOI: 10.1016/j.physa.2005.02.074

Sponsors: French National Center for Scientific Research

Platforms: Java

Model Documentation: Other Narrative Mathematical description

Model Code URLs: Model code not found

Abstract

Since the introduction of Reinforcement Learning (RL) in Game Theory, a growing literature is concerned with the theoretical convergence of RL-driven outcomes towards Nash equilibrium. In this paper, we apply this issue to a search-theoretic framework (posted-price market) where sellers are confronted with a population of imperfectly informed buyers and take one decision per period (posted prices) with no direct interactions between sellers. We focus on three different scenarios with varying buyers' characteristics. For each of these scenarios, we quantitatively and qualitatively test whether the learned variable (price strategy) converges to the Nash equilibrium. We also study the impact of the temperature parameter (defining. the exploitation/exploration trade off) on these results. (c) 2005 Elsevier B.V. All rights reserved.
Tags
Agent-based modeling reinforcement learning Nash Equilibrium search market