2018 年 10月 23 日随笔档案 - 不吃腊肉的猫

2018年10月23日

Reinforcement Learning: An Introduction读书笔记(3)--finite MDPs

摘要： > 目录 < Agent–Environment Interface Goals and Rewards Returns and Episodes Policies and Value Functions Optimal Policies and Optimal Value Functions > 阅读全文

posted @ 2018-10-23 16:00 不吃腊肉的猫阅读(413) 评论(0) 推荐(0)

不吃腊肉的猫

公告