BigBird, a sparse attention mechanism. ... tfm.nlp.layers.BigBirdAttention ... This layer follows the paper "Big Bird: Transformers for Longer Sequences" ...
確定! 回上一頁