... batch_size = len(batch) states_v = torch.tensor(states).to(device) ... next_distr[range(batch_size), next_actions] dones = dones.astype(np.bool) ...
確定! 回上一頁