The output of the teacher model for any input xi ∈ X is a vector of class probabilities. PT computed for each class using the softmax ...
確定! 回上一頁