2018 年 10月 22 日随笔档案 - 不吃腊肉的猫

2018年10月22日

Reinforcement Learning: An Introduction读书笔记(2)--多臂机

摘要： > 目录 < k-armed bandit problem Incremental Implementation Tracking a Nonstationary Problem Initial Values (*) Upper-Confidence-Bound Action Selection( 阅读全文

posted @ 2018-10-22 14:02 不吃腊肉的猫阅读(559) 评论(0) 推荐(0)

Reinforcement Learning: An Introduction读书笔记(1)--Introduction

摘要： > 目录 < learning & intelligence 的基本思想 RL的定义、特点、四要素与其他learning methods、evolutionary methods的比较例子(井字棋 tic-tac-toe)及早期发展史 > 笔记 < learning & intelligen 阅读全文

posted @ 2018-10-22 14:02 不吃腊肉的猫阅读(598) 评论(0) 推荐(0)

不吃腊肉的猫

公告