next token (~word) prediction = autoregressive language model · full name = Retrieval-Enhanced Transformer (RETRO) · introduced in DeepMind's ...
確定! 回上一頁