随笔分类 -  Machine Learning

摘要:Reinforcement Learning (R.L.) ① MDPs (Markov Decision Processes) ② Value Functions ③ Value Iteration ④ Policy Iteration (both ③ and ④ are algorithms f 阅读全文
posted @ 2016-12-12 21:18 DIMSUMBOY 阅读(176) 评论(0) 推荐(0)