由 S Ioffe 著作 · 2015 · 被引用 34106 次 — Batch Normalization allows us to use much higher learning rates and be less careful about initialization. It also acts as a regularizer, ...
確定! 回上一頁