摘要:
Stadie, Bradly C., Sergey Levine, and Pieter Abbeel. "Incentivizing exploration in reinforcement learning with deep predictive models." arXiv preprint 阅读全文
posted @ 2017-08-13 19:00
Shiyu_Huang
阅读(426)
评论(0)
推荐(0)
摘要:
1.Delayed, sparse reward(feedback), Long-term planning Hierarchical Deep Reinforcement Learning, Sub-goal, SAMDP, optoins, Thompson sampling, Boltzman 阅读全文
posted @ 2017-08-13 15:47
Shiyu_Huang
阅读(260)
评论(0)
推荐(0)
摘要:
Zahavy, Tom, Nir Ben-Zrihem, and Shie Mannor. "Graying the black box: Understanding DQNs." International Conference on Machine Learning. 2016. 这篇论文想要做 阅读全文
posted @ 2017-08-13 14:56
Shiyu_Huang
阅读(379)
评论(0)
推荐(0)


浙公网安备 33010602011771号