Most of the models in Huggingface Transformers are some version of BERT and thus not autoregressive, the only exceptions are decoder-only ...
確定! 回上一頁