Artigo Acesso aberto Revisado por pares

Existence of the uniform value in zero-sum repeated games with a more informed controller

2014; American Institute of Mathematical Sciences; Volume: 1; Issue: 3 Linguagem: Inglês

10.3934/jdg.2014.1.411

ISSN

2164-6074

Autores

Fabien Gensbittel, Miquel Oliu‐Barton, Xavier Venel,

Tópico(s)

Reinforcement Learning in Robotics

Resumo

We prove that in a two-player zero-sum repeated game where one of the players, say player $1$, is more informed than his opponent and controls the evolution of information on the state, the uniform value exists.This result extends previous results on Markov decision processes with partial observation (Rosenberg, Solan, Vieille[15]), and repeated games with an informed controller (Renault [14]). Our formal definition of a more informed player is more general than the inclusion of signals, allowing therefore for imperfect monitoring of actions. We construct an auxiliary stochastic game whose state space is the set of second order beliefs of player $2$ (beliefs about beliefs of player $1$ on the state variable of the original game) with perfect monitoring and we prove it has a value by using a result of Renault [14]. A key element in this work is to prove that player $1$ can use strategies of the auxiliary game in the original game in our general framework, from whichwe deduce that the value of the auxiliary game is also the value of our original game by using classical arguments.

Referência(s)