Exploitation-Oriented Learning with Deep Learning – Introducing Profit Sharing to a Deep Q-Network –
2017; Fuji Technology Press Ltd.; Volume: 21; Issue: 5 Linguagem: Inglês
10.20965/jaciii.2017.p0849
ISSN1343-0130
Autores Tópico(s)Digital Games and Media
ResumoCurrently, deep learning is attracting significant interest. Combining deep Q-networks (DQNs) and Q-learning has produced excellent results for several Atari 2600 games. In this paper, we propose an exploitation-oriented learning (XoL) method that incorporates deep learning to reduce the number of trial-and-error searches. We focus on a profit sharing (PS) method that is an XoL method, and combine it with a DQN to propose a DQNwithPS method. This method is compared with a DQN in Atari 2600 games. We demonstrate that the proposed DQNwithPS method can learn stably with fewer trial-and-error searches than required by only a DQN.
Referência(s)