随笔档案「2018年8月」 - initial_h

375. Guess Number Higher or Lower II (Python)

摘要："375. Guess Number Higher or Lower II" Description We are playing the Guess Game. The game is as follows: I pick a number from 1 to n. You have to gue 阅读全文

posted @ 2018-08-26 10:31 initial_h 阅读(595) 评论(0) 推荐(0)

Gumbel-Softmax Trick和Gumbel分布

摘要：之前看MADDPG论文的时候，作者提到在离散的信息交流环境中，使用了Gumbel-Softmax estimator。于是去搜了一下，发现该技巧应用甚广，如深度学习中的各种GAN、强化学习中的A2C和MADDPG算法等等。只要涉及在离散分布上运用重参数技巧时(re-parameterization) 阅读全文

posted @ 2018-08-13 17:03 initial_h 阅读(64881) 评论(20) 推荐(19)

《Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments》论文解读

摘要："MADDPG原文链接" "OpenAI blog" "DDPG链接" 目录 "一、摘要" "二、效果展示" "三、方法细节" "问题分析" "具体方法" "伪代码" "网络结构" "四、实验结果" "五、总结" "附录" "Proposition 1" 一、摘要文章探索了多智能体(multi a 阅读全文

posted @ 2018-08-06 13:15 initial_h 阅读(19467) 评论(16) 推荐(7)

initial_h

https://github.com/initial-h

08 2018 档案

公告