Neural Network Heuristic Functions for Classical Planning: Bootstrapping and Comparison to Other Methods.

International Conference on Automated Planning and Scheduling(2022)

引用 4|浏览50
暂无评分
摘要
How can we train neural network (NN) heuristic functions for classicalplanning, using only states as the NN input? Prior work addressed thisquestion by (a) per-instance imitation learning and/or (b) per-domainlearning. The former limits the approach to instances small enough fortraining data generation, the latter to domains where the necessaryknowledge generalizes across instances. Here we explore three methodsfor (a) that make training data generation scalable throughbootstrapping and approximate value iteration. In particular, weintroduce a new bootstrapping variant that estimates search effortinstead of goal distance, which as we show converges to the perfectheuristic under idealized circumstances. We empirically compare thesemethods to (a) and (b), aligning three different NN heuristic functionlearning architectures for cross-comparison in an experiment ofunprecedented breadth in this context. Key lessons are that ourmethods and imitation learning are highly complementary; thatper-instance learning often yields stronger heuristics than per-domainlearning; and the LAMA planner is still dominant but our methodsoutperform it in one benchmark domain.
更多
查看译文
关键词
Classical Planning,Heuristic Search,Learning Heuristic Functions
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要