摘要:
贝尔曼方程 \[v_{\pi}(s)=\sum_{a,s'}\pi(a|s)p(s'|s,a)\{r(s,a,s')+\gamma v_\pi(s')\} \]\[q_\pi(s,a)=\sum_{s'}p(s'|s,a)\{r(s,a,s')+\gamma \sum_{a'}\pi(a'|s')q 阅读全文
posted @ 2025-05-03 10:17
-Z00-
阅读(14)
评论(0)
推荐(0)
About me...