Use PyTorch LayerNorm and improve weight init ... -12,7 +12,7 @@ import torch.nn.functional as F ... nn.init.xavier_uniform(p.data).
確定! 回上一頁