【5分钟 Paper】Continuous Control With Deep Reinforcement Learning

简介: 【5分钟 Paper】Continuous Control With Deep Reinforcement Learning
  • 论文题目:Continuous Control With Deep Reinforcement Learning

所解决的问题?

  这篇文章将Deep Q-Learning运用到Deterministic Policy Gradient算法中。如果了解DPG的话,那这篇文章就是引入DQN改进了一下DPGstate value function。解决了DQN需要寻找maximizes action-value只能运用于离散动作空间 的局限。

背景

  其实就是这两篇文章的组合:

所采用的方法?

  这个DDPG我太熟悉,我实在不想再写啥了,附录一个伪代码吧:

取得的效果?

  实验结果如下图所示:

所出版信息?作者信息?

  这篇文章是ICLR2016上面的一篇文章。第一作者TimothyP.LillicrapGoogle DeepMindresearch Scientist

  Research focuses on machine learning and statistics for optimal control and decision making, as well as using these mathematical frameworks to understand how the brain learns. In recent work, I’ve developed new algorithms and approaches for exploiting deep neural networks in the context of reinforcement learning, and new recurrent memory architectures for one-shot learning. Applications of this work include approaches for recognizing images from a single example, visual question answering, deep learning for robotics problems, and playing games such as Go and StarCraft. I’m also fascinated by the development of deep network models that might shed light on how robust feedback control laws are learned and employed by the central nervous system.

相关文章
|
机器学习/深度学习 算法
【RLchina第四讲】Model-Based Reinforcement Learning(下)
【RLchina第四讲】Model-Based Reinforcement Learning(下)
158 0
|
机器学习/深度学习 资源调度 算法
【RLchina第四讲】Model-Based Reinforcement Learning(上)
【RLchina第四讲】Model-Based Reinforcement Learning(上)
525 0
|
机器学习/深度学习 人工智能 算法
【5分钟 Paper】Reinforcement Learning with Deep Energy-Based Policies
【5分钟 Paper】Reinforcement Learning with Deep Energy-Based Policies
112 0
|
机器学习/深度学习 编解码 算法
【5分钟 Paper】Dueling Network Architectures for Deep Reinforcement Learning
【5分钟 Paper】Dueling Network Architectures for Deep Reinforcement Learning
105 0
|
机器学习/深度学习 存储 自然语言处理
【论文解读】A review on the attention mechanism of deep learning
注意力已经成为深度学习中最重要的概念之一。本文旨在对近年来提出的最新注意力模型作概述。我们建立了一个较为通用的模型,此外根据四个标准即注意力的柔软性、输入特征的形式、输入表示和输出表示来对当前注意力模型进行分类。最后讨论了注意力在深度学习可解释上的作用。
513 0
|
机器学习/深度学习 传感器 数据挖掘
Review on the Recent Welding Research with Application of CNN-Based Deep Learning
Guo等人16)将CNN应用于线管制造过程中的电阻焊,提出了一种正常焊缝与缺陷焊缝的分类模型,准确率达到99.01%。
96 0
|
机器学习/深度学习 算法 数据挖掘
A Generative Adversarial Network-based Deep Learning Method for Low-quality Defect ImageReconstructi
本文提出了一种基于生成对抗网络 (GAN) 的 DL 方法,用于低质量缺陷图像识别。 GAN用于重建低质量缺陷图像,并建立VGG16网络识别重建图像。
134 0
|
机器学习/深度学习 人工智能 自然语言处理
18 Issues in Current Deep Reinforcement Learning from ZhiHu
深度强化学习的18个关键问题   from: https://zhuanlan.zhihu.com/p/32153603     85 人赞了该文章 深度强化学习的问题在哪里?未来怎么走?哪些方面可以突破? 这两天我阅读了两篇篇猛文A Brief Survey of Deep Reinforcement Learning 和 Deep Reinforcement Learning: An Overview ,作者排山倒海的引用了200多篇文献,阐述强化学习未来的方向。
2063 0
|
人工智能 Go
The Differences between AI, Machine Learning, and Deep Learning
In the past two years, the growth of artificial intelligence and machine learning has been immense. Machine learning, as a type of artificial intellig
2482 0