Ptt 大爆卦 | Critic - 前往 http://incompleteideas.net/book/ebook/node66.html

你即將離開本站

並前往http://incompleteideas.net/book/ebook/node66.html

6.6 Actor-Critic Methods

Actor-critic methods are the natural extension of the idea of reinforcement comparison methods (Section 2.8) to TD learning and to the full reinforcement ...

確定！回上一頁

查詢「Critic」的人也找了：