Base Model. 1. Pretrained BERT bert-base-multilingual-cased by default. 2. Encoder/Mapping: Single linear layer 768 -> 768. 3. Learning rate: 0.0001.
確定! 回上一頁