這章分享價值函數,內容會圍繞有A2C後,怎直接做價值評估。藉由推導估計與迭代方法,我們會得到著名已久的Q-learning。. “Value Function Methods” is published by ...
確定! 回上一頁