2022 年 5月随笔档案 - juneyiiii

可用于强化学习的quadrotor_env

摘要：可用于强化学习的quadrotor_env 遵循gym API的第三方环境：（Gym 是用于强化学习的标准 API，以及各种参考环境的集合。） GymFC 是专注于姿态控制的飞行控制调整框架。GymFC 最初是在“无人机姿态控制的强化学习”手稿中引入的，其中使用模拟器合成神经飞行姿态控制器，其性能超阅读全文

posted @ 2022-05-19 16:06 juneyiiii 阅读(367) 评论(0) 推荐(0)

花书学习笔记-第6章深度前馈网络

摘要：深度前馈网络（deep feedforward network）深度前馈网络（deep feedforward network)也叫做前馈神经网络（feedforward neural network)，也叫做多层感知机（multilayer perceptron,MLP),是典型的深度学习模型。阅读全文

posted @ 2022-05-16 20:51 juneyiiii 阅读(102) 评论(0) 推荐(0)

Representation and General Value Functions——General Value Functions（GVFs）

摘要：https://sites.ualberta.ca/~pilarski/docs/theses/Sherstan_Craig_D_202009_PhD.pd 原文链接 General value functions (GVFs) make two relaxations to the value f 阅读全文

posted @ 2022-05-12 12:59 juneyiiii 阅读(70) 评论(0) 推荐(0)

Universal Value Function Approximators（通用价值函数近似器）

摘要：Universal Value Function Approximators（通用价值函数近似器）之前有看过hindsight experience replay（HER）论文，其中用到的核心思想来自于这篇Universal Value Function Approximators（通用价值函数近阅读全文

posted @ 2022-05-09 14:26 juneyiiii 阅读(290) 评论(0) 推荐(0)

juneyiiii

05 2022 档案

公告