One of the challenging problems is controlling dynamic behaviour systems. This paper used policy gradient to balance cart pole inverted pendulum. The purpose of ...
確定! 回上一頁