随笔分类 - Machine Learning

Machine Learning - Lecture 16

摘要：Reinforcement Learning (R.L.) ① MDPs (Markov Decision Processes) ② Value Functions ③ Value Iteration ④ Policy Iteration (both ③ and ④ are algorithms f 阅读全文

posted @ 2016-12-12 21:18 DIMSUMBOY 阅读(176) 评论(0) 推荐(0)

DIMSUMBOY

茶餘飯後整件點心

随笔分类 - Machine Learning

公告

DIMSUMBOY

茶餘飯後 整件點心

随笔分类 - Machine Learning

公告

茶餘飯後整件點心