OpenAI Lab

NOTICE: Please use the next version, SLM-Lab.

An experimentation framework for Reinforcement Learning using OpenAI Gym, Tensorflow, and Keras.

OpenAI Lab is created to do Reinforcement Learning (RL) like science - theorize, experiment. It provides an easy interface to OpenAI Gym and Keras, with an automated experimentation and evaluation framework.

Features

Unified RL environment and agent interface using OpenAI Gym, Tensorflow, Keras, so you can focus on developing the algorithms.
Core RL algorithms implementations, with reusable modular components for developing deep RL algorithms.
An experimentation framework for running hundreds of trials of hyperparameter optimizations, with logs, plots and analytics for testing new RL algorithms. Experimental settings are stored in standardized JSONs for reproducibility and comparisons.
Automated analytics of the experiments for evaluating the RL agents and environments, and to help pick the best solution.
The Fitness Matrix, a table of the best scores of RL algorithms v.s. the environments; useful for research.

With OpenAI Lab, we could focus on researching the essential elements of reinforcement learning such as the algorithm, policy, memory, and parameter tuning. It allows us to build agents efficiently using existing components with the implementations from research ideas. We could then test the research hypotheses systematically by running experiments.

Read more about the research problems the Lab addresses in Motivations. Ultimately, the Lab is a generalized framework for doing reinforcement learning, agnostic of OpenAI Gym and Keras. E.g. Pytorch-based implementations are on the roadmap.

Implemented Algorithms

A list of the core RL algorithms implemented/planned.

To see their scores against OpenAI gym environments, go to Fitness Matrix.

algorithm	implementation	eval score (pending)
DQN	DQN	-
Double DQN	DoubleDQN	-
Dueling DQN	-	-
Sarsa	DeepSarsa	-
Off-Policy Sarsa	OffPolicySarsa	-
PER (Prioritized Experience Replay)	PrioritizedExperienceReplay	-
CEM (Cross Entropy Method)	next	-
REINFORCE	-	-
DPG (Deterministic Policy Gradient) off-policy actor-critic	ActorCritic	-
DDPG (Deep-DPG) actor-critic with target networks	DDPG	-
A3C (asynchronous advantage actor-critic)	-	-
Dyna	next	-
TRPO	-	-
Q*(lambda)	-	-
Retrace(lambda)	-	-
Neural Episodic Control (NEC)	-	-
EWC (Elastic Weight Consolidation)	-	-

Run the Lab

Next, see Installation and jump to Quickstart.

Timelapse of OpenAI Lab, solving CartPole-v0.

Name		Name	Last commit message	Last commit date
Latest commit History 1,556 Commits
.github		.github
bin		bin
config		config
data		data
rl		rl
test		test
.gitignore		.gitignore
.snyk		.snyk
Gruntfile.js		Gruntfile.js
LICENSE		LICENSE
README.md		README.md
circle.yml		circle.yml
environment.yml		environment.yml
main.py		main.py
package.json		package.json
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OpenAI Lab

Features

Implemented Algorithms

Run the Lab

About

Releases 6

Packages

Contributors 3

Languages

License

kengz/openai_lab

Folders and files

Latest commit

History

Repository files navigation

OpenAI Lab

Features

Implemented Algorithms

Run the Lab

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 6

Packages 0

Contributors 3

Languages

Packages