vocab_size) ) — Prediction scores of the language modeling head (scores for each vocabulary token before SoftMax). hidden_states ( tuple(torch.FloatTensor) , ...
確定! 回上一頁