随笔档案「2021年8月28日」：Reinforcement Learning as One Big Sequen... - initial_h

2021年8月28日

Reinforcement Learning as One Big Sequence Modeling Problem

摘要： **发表时间：**2021 **文章要点：**这篇文章把RL看作序列建模问题（sequence modeling problem），直接用transformer来拟合整个序列（reats states, actions, and rewards as simply a stream of data 阅读全文

posted @ 2021-08-28 05:31 initial_h 阅读(444) 评论(0) 推荐(0)

initial_h

https://github.com/initial-h

公告