08 2018 档案

摘要:"375. Guess Number Higher or Lower II" Description We are playing the Guess Game. The game is as follows: I pick a number from 1 to n. You have to gue 阅读全文
posted @ 2018-08-26 10:31 initial_h 阅读(595) 评论(0) 推荐(0)
摘要:之前看MADDPG论文的时候,作者提到在离散的信息交流环境中,使用了Gumbel-Softmax estimator。于是去搜了一下,发现该技巧应用甚广,如深度学习中的各种GAN、强化学习中的A2C和MADDPG算法等等。只要涉及在离散分布上运用重参数技巧时(re-parameterization) 阅读全文
posted @ 2018-08-13 17:03 initial_h 阅读(64881) 评论(20) 推荐(19)
摘要:"MADDPG原文链接" "OpenAI blog" "DDPG链接" 目录 "一、摘要" "二、效果展示" "三、方法细节" "问题分析" "具体方法" "伪代码" "网络结构" "四、实验结果" "五、总结" "附录" "Proposition 1" 一、摘要 文章探索了多智能体(multi a 阅读全文
posted @ 2018-08-06 13:15 initial_h 阅读(19467) 评论(16) 推荐(7)