See Notes for more details regarding sparse gradients. Variables: weight (Tensor) – the learnable weights of the module of shape (num_embeddings, embedding_dim) ...
確定! 回上一頁