Whereas, a model-based algorithm is an algorithm that uses the transition function (and the reward function) in order to estimate the optimal ...
確定! 回上一頁