摘要: 1. On Large-Batch Training for Deep Learning: Generalization Gap and Sharp Minima(Optional 2017 Northwestern University) 动机 SGD and its variants理论属性: 阅读全文
posted @ 2022-08-09 18:08 撬动地球的coder 阅读(187) 评论(0) 推荐(0)