AdamW () An Adam variant proposed in “Decoupled Weight Decay Regularization” SparseAdam() A version of Adam suitable for sparse tensors Adamax() A variant of ...
確定! 回上一頁