an optimizer with weight decay fixed that can be used to fine-tuned models, and ... replace AdamW with Adafactor optimizer = Adafactor( model.parameters(), ...
確定! 回上一頁