As the architecture in the paper suggests, you basically want to push each of the hidden states (which are themselves time distributed) into ...
確定! 回上一頁