Posted on Tue 27 October 2020

MuZero talk - ICAPS 2020

I gave a detailed talk about MuZero at ICAPS 2020, at the workshop "Bridging the Gap Between AI Planning and Reinforcement Learning".

In addition to giving an overview of the algorithm in general, I also went into more detail about reanalyse - the technique that allows MuZero to use the model based search to repeatedly learn more from the same episode data.

I hope you find the talk useful! I've also uploaded my slides for easy reference.

Tags: ai, programming, rl, go, atari, muzero, alphazero

© Julian Schrittwieser. Built using 開板. Theme by Giulio Fidente on github. .