We compare subjects' behavior in the three different games to two continuous reinforcement learning models, where one is a policy gradient model with a ...
確定! 回上一頁