开发者

Why does AlphaZero perform better than vanilla MCTS? [closed]

Closed. This question is not about programming or software development. It is not currently accepting answers.

This question does not appear to be about a specific programming problem, a software algorithm, or software 开发者_高级运维tools primarily used by programmers. If you believe the question would be on-topic on another Stack Exchange site, you can leave a comment to explain where the question may be able to be answered.

Closed 1 hour ago.

Improve this question

I understand main difference between AlphaZero and the classic Monte Carlo tree search is the playout (simulation) step is replaced with a neural network prediction which itself is trained from the output of the MCTS. How does this additional complexity improve the performance?

My guess is that classic MCTS would not perform worse than the AlphaZero's hybrid approach on a system with unlimited memory. Since memory is a constraint in the real world, the neural network is a work-around.

0

上一篇:

下一篇:

精彩评论

暂无评论...
验证码 换一张
取 消

最新问答

问答排行榜