Exploitation-Oriented Learning with Deep Learning – Introducing Profit Sharing to a Deep Q-Network

Artigo Acesso aberto Revisado por pares

Exploitation-Oriented Learning with Deep Learning – Introducing Profit Sharing to a Deep Q-Network –

2017; Fuji Technology Press Ltd.; Volume: 21; Issue: 5 Linguagem: Inglês

10.20965/jaciii.2017.p0849

ISSN

1343-0130

Autores

Kazuteru Miyazaki,

Tópico(s)

Digital Games and Media

Resumo

Currently, deep learning is attracting significant interest. Combining deep Q-networks (DQNs) and Q-learning has produced excellent results for several Atari 2600 games. In this paper, we propose an exploitation-oriented learning (XoL) method that incorporates deep learning to reduce the number of trial-and-error searches. We focus on a profit sharing (PS) method that is an XoL method, and combine it with a DQN to propose a DQNwithPS method. This method is compared with a DQN in Atari 2600 games. We demonstrate that the proposed DQNwithPS method can learn stably with fewer trial-and-error searches than required by only a DQN.

Ver no editor

Altmetric

PlumX

Entrar

Lembrar minha senha

Receber meu e-mail de confirmação

Exploitation-Oriented Learning with Deep Learning – Introducing Profit Sharing to a Deep Q-Network –