2019 年 1月随笔档案 - 林小奚

MDP和RL的区别和联系

摘要：The general relationship between RL and MDP is that RL is a framework for solving problems that can be expressed as MDPs. DP requires you to fully des 阅读全文

posted @ 2019-01-23 06:14 林小奚阅读(1053) 评论(0) 推荐(0)

手头现有文章的一些清点

摘要：1. 《Variational Inference》:知道observation x, 求hidden variable z的posterior distribution p(z | x, \alpha). Main idea是用一个 q(z_{1:m} | v) 来近似这个posterior di 阅读全文

posted @ 2019-01-14 09:39 林小奚阅读(142) 评论(0) 推荐(0)

POMDP相关：Dr. Wu给的几篇论文的一点总结。

摘要：Estimating the use of higher order theory of mind using computational agents 1. a pedestrian who wants to cross a street behaves differently depending 阅读全文

posted @ 2019-01-14 09:33 林小奚阅读(259) 评论(0) 推荐(0)

林小奚

你我既为学者，尽我学者之力便是，其余不必多想。

01 2019 档案

公告