Faster Transformer Decoding: N-gram Masked Self-Attention. Ciprian Chelba · Mia Chen · Ankur Bapna; Noam Shazeer. ArXiv, Google Research (2020).
確定! 回上一頁