RL Implementations

내 맘대로 만드는 RL 구현체

Deep Q Network

Playing Atari with Deep Reinforcement Learning. NIPS 2013.

We present the first deep learning model to successfully learn control policies directly from high-dimensional sensory input using reinforcement learning. The model is a convolutional neural network, trained with a variant of Q-learning, whose input is raw pixels and whose output is a value function estimating future rewards. We apply our method to seven Atari 2600 games from the Arcade Learning Environment, with no adjustment of the architecture or learning algorithm. We find that it outperforms all previous approaches on six of the games and surpasses a human expert on three of them.

Deep Deterministic Policy Gradient

Continuous control with deep reinforcement learning. ICRL 2016.

We adapt the ideas underlying the success of Deep Q-Learning to the continuous action domain. We present an actor-critic, model-free algorithm based on the deterministic policy gradient that can operate over continuous action spaces. Using the same learning algorithm, network architecture and hyper-parameters, our algorithm robustly solves more than 20 simulated physics tasks, including classic problems such as cartpole swing-up, dexterous manipulation, legged locomotion and car driving. Our algorithm is able to find policies whose performance is competitive with those found by a planning algorithm with full access to the dynamics of the domain and its derivatives. We further demonstrate that for many of the tasks the algorithm can learn policies end-to-end: directly from raw pixel inputs.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
__pycache__		__pycache__
DDPG.ipynb		DDPG.ipynb
DQN.ipynb		DQN.ipynb
README.md		README.md
ddpg.py		ddpg.py
dqn.py		dqn.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RL Implementations

Deep Q Network

Deep Deterministic Policy Gradient

About

Releases

Packages

Languages

Jaewoopudding/rl_implementation

Folders and files

Latest commit

History

Repository files navigation

RL Implementations

Deep Q Network

Deep Deterministic Policy Gradient

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages