We refer to this proposed curiosity formulation as Intrinsic Curiosity Module (ICM). ... a sparse terminal reward of +1 if it finds the vest and ...
確定! 回上一頁