【5分钟 Paper】Continuous Control With Deep Reinforcement Learning-阿里云开发者社区

【5分钟 Paper】Continuous Control With Deep Reinforcement Learning

2023-08-05 343 发布于吉林

版权

本文内容由阿里云实名注册用户自发贡献，版权归原作者所有，阿里云开发者社区不拥有其著作权，亦不承担相应法律责任。具体规则请查看《阿里云开发者社区用户服务协议》和《阿里云开发者社区知识产权保护指引》。如果您发现本社区中有涉嫌抄袭的内容，填写侵权投诉表单进行举报，一经查实，本社区将立刻删除涉嫌侵权内容。

简介： 【5分钟 Paper】Continuous Control With Deep Reinforcement Learning

论文题目：Continuous Control With Deep Reinforcement Learning

所解决的问题？

这篇文章将Deep Q-Learning运用到Deterministic Policy Gradient算法中。如果了解DPG的话，那这篇文章就是引入DQN改进了一下DPG的state value function。解决了DQN需要寻找maximizes action-value只能运用于离散动作空间的局限。

背景

其实就是这两篇文章的组合：

所采用的方法？

这个DDPG我太熟悉，我实在不想再写啥了，附录一个伪代码吧：

取得的效果？

实验结果如下图所示：

所出版信息？作者信息？

这篇文章是ICLR2016上面的一篇文章。第一作者TimothyP.Lillicrap是Google DeepMind的research Scientist。

Research focuses on machine learning and statistics for optimal control and decision making, as well as using these mathematical frameworks to understand how the brain learns. In recent work, I’ve developed new algorithms and approaches for exploiting deep neural networks in the context of reinforcement learning, and new recurrent memory architectures for one-shot learning. Applications of this work include approaches for recognizing images from a single example, visual question answering, deep learning for robotics problems, and playing games such as Go and StarCraft. I’m also fascinated by the development of deep network models that might shed light on how robust feedback control laws are learned and employed by the central nervous system.

个人主页：http://contrastiveconvergence.net/~timothylillicrap/index.php

文章标签：

算法

【5分钟 Paper】Continuous Control With Deep Reinforcement Learning

所解决的问题？

背景

所采用的方法？

取得的效果？

所出版信息？作者信息？

热门文章

最新文章

相关电子书

探索云世界

热门

云计算

大数据

云原生

人工智能

数据库

开发与运维

活动广场

任务中心

训练营

直播

乘风者计划

下载

镜像站

技术资料

【5分钟 Paper】Continuous Control With Deep Reinforcement Learning

所解决的问题？

背景

所采用的方法？

取得的效果？

所出版信息？作者信息？

热门文章

最新文章

相关电子书