{si,ai}k i=0 ,sk+1. ) . The saddle-point problem (8) is similar to the one-step Lagrangian (6): the dual critic, V , and weighted k-step actor, α(s0) ∏.
確定! 回上一頁