Q -Learning Numerical Examples ... and initial state as room B. ... First we set matrix Q as a zero matrix. ... I put again the instant reward matrix R that represents ...
確定! 回上一頁