Training maximizes the dot-product score of sentence pairs that are trans- lations of each other at the expense of sampled negatives. We contrast using random ...
確定! 回上一頁