Ptt 大爆卦 | CartPole v0 vs v1 - 前往 https://mropengate.blogspot.com/2016/12/q-learning-openai-gym-cart-pole-system.html

你即將離開本站

並前往https://mropengate.blogspot.com/2016/12/q-learning-openai-gym-cart-pole-system.html

Q-learning 與類神經網路：用OpenAI gym 模擬木棒台車平衡 ...

... ACE) : 產生一個比較好的reward 訊號，使用Temporal Difference (TD) 方法。 ... https://gym.openai.com/envs/CartPole-v0 ...

確定！回上一頁

查詢「CartPole v0 vs v1」的人也找了：

Gym environment

Pip install gym

Gym reward threshold