Ptt 大爆卦 | CartPole DQN - 前往 https://ithelp.ithome.com.tw/articles/10251599

你即將離開本站

並前往https://ithelp.ithome.com.tw/articles/10251599

Day 25 / DL x RL / Hello Reinforcement Learning —— CartPole

Deep Q-network (DQN) 在前一篇Atari 的paper 介紹過，簡單來說就是用neural network 來預測Q(s, a)。Input 是state，可以是連續值，而output 總數等於action 的數量， ...

確定！回上一頁

查詢「CartPole DQN」的人也找了：

cartpole-v0 q learning

Cartpole github

Dqn-pytorch github