To address this problem, we propose an Attention-based Dropout Layer. (ADL), which utilizes the self-attention mechanism to pro- cess the feature maps of the ...
確定! 回上一頁