凯鲁嘎吉 - 博客园

2022年2月14日

Deep Reinforcement Learning Hands-On——Tabular Learning and the Bellman Equation

摘要： Deep Reinforcement Learning Hands-On——Tabular Learning and the Bellman Equation 作者：凯鲁嘎吉 - 博客园 http://www.cnblogs.com/kailugaji/ 更多请看：Reinforcement Lea 阅读全文

posted @ 2022-02-14 10:04 凯鲁嘎吉阅读(346) 评论(0) 推荐(0)

2022年2月11日

用Python绘制冬奥会吉祥物冰墩墩

摘要：用Python绘制冬奥会吉祥物冰墩墩作者：凯鲁嘎吉 - 博客园 http://www.cnblogs.com/kailugaji/ 想要保存Python中turtle模块的图片为jpg格式，前提需要在https://ghostscript.com/releases/gsdnld.html下载gs9 阅读全文

posted @ 2022-02-11 15:06 凯鲁嘎吉阅读(6186) 评论(0) 推荐(1)

2022年1月10日

Hands-On Reinforcement Learning With Python——Temporal Difference Learning

摘要： Hands-On Reinforcement Learning With Python——Temporal Difference Learning 作者：凯鲁嘎吉 - 博客园 http://www.cnblogs.com/kailugaji/ 更多请看：Reinforcement Learning 阅读全文

posted @ 2022-01-10 09:58 凯鲁嘎吉阅读(328) 评论(0) 推荐(0)

2022年1月4日

Windows下OpenAI gym环境的使用

摘要： Windows下OpenAI gym环境的使用作者：凯鲁嘎吉 - 博客园 http://www.cnblogs.com/kailugaji/ 1. gym环境搭建用到的关键语句 1.1 准备工作首先创建一个虚拟环境conda create -n RL python=3.8，激活activate 阅读全文

posted @ 2022-01-04 01:06 凯鲁嘎吉阅读(1697) 评论(0) 推荐(1)

2021年11月23日

Meta-RL——Efficient Off-Policy Meta-Reinforcement Learning via Probabilistic Context Variables

摘要： Meta-RL——Efficient Off-Policy Meta-Reinforcement Learning via Probabilistic Context Variables 作者：凯鲁嘎吉 - 博客园 http://www.cnblogs.com/kailugaji/ 这篇博客是“Ef 阅读全文

posted @ 2021-11-23 13:04 凯鲁嘎吉阅读(1336) 评论(10) 推荐(2)

2021年11月18日

深度聚类算法研究综述(A Survey of Deep Clustering Algorithms)

摘要：深度聚类算法研究综述(A Survey of Deep Clustering Algorithms) 作者：凯鲁嘎吉 - 博客园 http://www.cnblogs.com/kailugaji/ 深度聚类的博客写了几篇，也曾总结过专门的一篇博客：深度聚类算法，但并不全面。这篇博客对现有的深度聚类算阅读全文

posted @ 2021-11-18 20:23 凯鲁嘎吉阅读(28219) 评论(9) 推荐(15)

2021年11月16日

RL——Deep Reinforcement Learning amidst Continual/Lifelong Structured Non-Stationarity

摘要： RL——Deep Reinforcement Learning amidst Continual/Lifelong Structured Non-Stationarity 作者：凯鲁嘎吉 - 博客园 http://www.cnblogs.com/kailugaji/ 这篇博客简要回顾论文“Deep 阅读全文

posted @ 2021-11-16 17:26 凯鲁嘎吉阅读(820) 评论(0) 推荐(0)

2021年11月12日

多元/多维高斯/正态分布概率密度函数推导 (Derivation of the Multivariate/Multidimensional Normal/Gaussian Density)

摘要：多元/多维高斯/正态分布概率密度函数推导 (Derivation of the Multivariate/Multidimensional Normal/Gaussian Density) 作者：凯鲁嘎吉 - 博客园 http://www.cnblogs.com/kailugaji/ 当年在学《概率阅读全文

posted @ 2021-11-12 08:45 凯鲁嘎吉阅读(16237) 评论(0) 推荐(2)

2021年11月10日

Meta-RL——Decoupling Exploration and Exploitation for Meta-Reinforcement Learning without Sacrifices

摘要： Meta-RL——Decoupling Exploration and Exploitation for Meta-Reinforcement Learning without Sacrifices 作者：凯鲁嘎吉 - 博客园 http://www.cnblogs.com/kailugaji/ 这篇阅读全文

posted @ 2021-11-10 15:22 凯鲁嘎吉阅读(388) 评论(0) 推荐(0)

2021年11月5日

元学习——Meta-Amortized Variational Inference and Learning

摘要：元学习——Meta-Amortized Variational Inference and Learning 作者：凯鲁嘎吉 - 博客园 http://www.cnblogs.com/kailugaji/ 这篇博客是论文“Meta-Amortized Variational Inference an 阅读全文

posted @ 2021-11-05 16:00 凯鲁嘎吉阅读(1494) 评论(0) 推荐(1)