Choice and return an an action by given the action probability distribution. 奖励函数¶. tensorlayer.rein. discount_episode_rewards (rewards=None, gamma=0.99 ...
確定! 回上一頁