... that can cause the softmax we will apply next to saturate. Note The torch.bmm() function performs a batch matrix-matrix product that simplifies.
確定! 回上一頁