With this reward function, the policy synthesis procedure is "constrained" by the given specification. These constraints guide the MDP ...
確定! 回上一頁