Here we propose a simple network architecture, gMLP, based on MLPs with gating, and show that it can perform as well as Transformers in key ...
確定! 回上一頁