(4) AlphaGo Zero is the program described in this paper. It learns from self play reinforcement learning, starting from random initial weights, ...
確定! 回上一頁