摘要: Ref: Effect of batch size on training dynamics Don’t decay the learning rate increase the batch size We can often achieve the benefits of decaying the 阅读全文
posted @ 2021-08-16 16:12 郝壹贰叁 阅读(69) 评论(0) 推荐(0)