MuZero is model-based and it's learning the model. This is stated in the paper (page 2, first statement) and mentioned in Agent57 paper as ...
確定! 回上一頁