With an objective to learn word embeddings instead of modeling the word distribution, the NCE loss can be simplified to use negative sampling.
確定! 回上一頁