随笔档案「2016年12月1日」：Multi-armed Bandit Problem与增强学习的联系 ... - Shuzi_rank

2016年12月1日

摘要：选自《Reinforcement Learning: An Introduction》, version 2, 2016, Chapter2 https://webdocs.cs.ualberta.ca/~sutton/book/bookdraft2016sep.pdf 引言中是这样引出Chapte 阅读全文

posted @ 2016-12-01 11:23 Shuzi_rank 阅读(4198) 评论(0) 推荐(0)