当学习率设置的较小,训练收敛较慢,需要更多的epoch才能到达一个较好的局部最小值 ... 学习率设为:0.1,0.01,0.001,0.0001等来观察网络初始阶段epoch的loss情况:.
確定! 回上一頁