... -F.cross_entropy(model_outs['actor'], actions.long(), reduction='none') ratio = torch.exp(new_log_probs - log_probs) clip_ratio = torch.clamp(ratio, ...
確定! 回上一頁