def __init__(self, obs_size: int, n_actions: int, hidden_size: int = 128): ... we choose an action at random, and otherwise we choose the “greedy” action ...
確定! 回上一頁