随笔档案「2024年2月14日」：Prioritized Experience Replay ... - initial_h

2024年2月14日

摘要：发表时间：2016（ICLR 2016）文章要点：这篇文章提出了很经典的experience replay的方法PER，通过temporal-difference (TD) error来给采样赋权重（Sequences associated with rewards appear to be re 阅读全文

posted @ 2024-02-14 08:29 initial_h 阅读(125) 评论(0) 推荐(0)

initial_h

https://github.com/initial-h

公告