Learning to Prune Dominated Action Sequences in Online Black-Box Planning
2017; Association for the Advancement of Artificial Intelligence; Volume: 31; Issue: 1 Linguagem: Inglês
10.1609/aaai.v31i1.10663
ISSN2374-3468
Autores Tópico(s)Machine Learning and Algorithms
ResumoBlack-box domains where the successor states generated by applying an action are generated by a completely opaque simulator pose a challenge for domain-independent planning. The main computational bottleneck in search-based planning for such domains is the number of calls to the black-box simulation. We propose a method for significantly reducing the number of calls to the simulator by the search algorithm by detecting and pruning sequences of actions which are dominated by others. We apply our pruning method to Iterated Width and breadth-first search in domain-independent black-box planning for Atari 2600 games in the Arcade Learning Environment (ALE), adding our pruning method significantly improves upon the baseline algorithms.
Referência(s)