AdamW (PyTorch) ... replace AdamW with Adafactor optimizer = Adafactor( model.parameters(), lr=1e-3, eps=(1e-30, 1e-3), ... AdamWeightDecay (TensorFlow) ...
確定! 回上一頁