For example, a commonly used meta-reinforcement learning benchmark uses different running velocities for a simulated robot as different tasks.
確定! 回上一頁