摘要:
make compromise between learnt policy and minimal cost! π hat is using states π theta is using observations 阅读全文
posted @ 2018-05-27 23:01
ecoflex
阅读(198)
评论(0)
推荐(0)
摘要:
MPC means replan every step Every N step, rebuild the dynamic model 阅读全文
posted @ 2018-05-27 18:15
ecoflex
阅读(247)
评论(0)
推荐(0)

浙公网安备 33010602011771号