I was asked some questions about the development of AlphaGo and figured my answers might be of wider interest, so here we go:
How low-level does the code need to be in order to get the maximum performance both in training and, possibly, live game?
To maximize playing strength it is necessary to balance maximum utilization of the accelerators (GPUs or TPUs) and focusing on exploring and evaluating the most promising line of play. Unfortunately there are several conflicting tendencies:
- The more board positions we can evaluate the better our estimates of how good each ...