Deep Deterministic Policy Gradient是延續著Actor-Critic的觀念而來,是融合了Actor-Critic與DQN的experience replay而演化而來的演算法, ...
確定! 回上一頁