Masking plays an important role in the transformer. It serves two purposes: In the encoder and decoder: To zero attention outputs wherever there is just padding ...
確定! 回上一頁