The batch size is 512 and the maximum length of a BERT input sequence is 64. Note that in the original BERT model, the maximum length is 512. pytorchmxnet.
確定! 回上一頁