This lets the agent choose the best action in each state. In this example, our agent has 4 actions (up, down, left, right) and 5 possible states (Start, Blank, ...
確定! 回上一頁