Training was performed on 8 Nvidia V100 GPUs (16 GB) using a batch size of 32, layer normaliza-tion for both the encoder and the decoder (Xu ...
確定! 回上一頁