More specifically, CartPole-v0 is solved if the pole can be balanced for 200 steps ... env = gym.make('CartPole-v1') print(env.observation_space) # four ...
確定! 回上一頁