Thanks! Sorry I didn't understand that. So 3 problems with sparse: it doesn't work with weight_decay, or with Adam, and it's slower here!
確定! 回上一頁