Learning to Prune Dominated Action Sequences in Online Black-Box Planning

Artigo Acesso aberto

Learning to Prune Dominated Action Sequences in Online Black-Box Planning

2017; Association for the Advancement of Artificial Intelligence; Volume: 31; Issue: 1 Linguagem: Inglês

10.1609/aaai.v31i1.10663

ISSN

2374-3468

Autores

Yuu Jinnai, Alex Fukunaga,

Tópico(s)

Machine Learning and Algorithms

Resumo

Black-box domains where the successor states generated by applying an action are generated by a completely opaque simulator pose a challenge for domain-independent planning. The main computational bottleneck in search-based planning for such domains is the number of calls to the black-box simulation. We propose a method for significantly reducing the number of calls to the simulator by the search algorithm by detecting and pruning sequences of actions which are dominated by others. We apply our pruning method to Iterated Width and breadth-first search in domain-independent black-box planning for Atari 2600 games in the Arcade Learning Environment (ALE), adding our pruning method significantly improves upon the baseline algorithms.

Ver no editor

Altmetric

PlumX

Entrar

Lembrar minha senha

Receber meu e-mail de confirmação

Learning to Prune Dominated Action Sequences in Online Black-Box Planning