It allows the model to learn a bidirectional representation of the sentence. Next sentence prediction (NSP): the models concatenates two masked ...
確定! 回上一頁