摘要:
Markov Decision Processes (MDPs) Named after Andrey Markov, known at least as early as 1950s.(cf. Bellman 1957) Discrete time stochastic control process. State: Action: Reward: Markov property: given ... 阅读全文
posted @ 2011-05-05 20:39
justin_s
阅读(480)
评论(0)
推荐(0)

浙公网安备 33010602011771号