Whats and Whys

What?

AgentNet is a lightweight library for deep reinforcement learning using Theano+Lasagne. It also provides support for attentive and long-term memory networks.

The main objectives are:

Easy prototyping for reinforcement learning architectures
Support for arbitrary recurrent, attentive and long-term memory networks
Ease of experiment conduction and reproducibility
Full integration with Lasagne and Theano
[next step - full integration with TensorFlow]

Why?

[long story short: create a platform to play with *QN, attentive and LTM architectures without spending months reading code]

[short story long:

The last several years have marked the rediscovery of neural networks applied to Reinforcement Learning domain. The idea has first been introduced in early 90's [0] or even earlier, but was mostly forgotten soon afterwards.

Years later, these methods were reborn under Deep Learning sauce and popularized by Deepmind [1,2]. Several other researchers have already jumped into the domain with their architectures [3,4] and even dedicated playgrounds [5] to play with them.

The problem is that all these models exist in their own problem setup and implementation bubbles. Simply comparing your new architecture the ones you know requires

10% implementing architecture
20% implementing experiment setup
70% reimplementing all the other network architectures

This process is not only inefficient, but also very unstable, since a single mistake while implementing 'other' architecture can lead to incorrect results.

So here we are, attempting to build yet another bridge between eager researchers [primarily ourselves so far] and deep reinforcement learning.

The key objective is to make it easy to build new architectures and test is against others on a number of problems. The easier it is to reproduce the experiment setup, the simpler it is to architect something new and wonderful, the quicker we get to solutions directly applicable to real world problems.

]

[0] an dusty old journal issue - https://books.google.ru/books?id=teHhVHk3a54C&printsec=frontcover#v=onepage&q&f=false
[1] DQN by DeepMind - http://www.nature.com/nature/journal/v518/n7540/full/nature14236.html
[2] DQN explained - https://www.cs.toronto.edu/~vmnih/docs/dqn.pdf
[4] deep recurrent - http://arxiv.org/abs/1507.06527
[3] attentive DRQN - http://arxiv.org/pdf/1512.01693.pdf
[5] MazeBaze by Facebook - http://arxiv.org/pdf/1511.07401.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Whats and Whys

What?

Why?

Clone this wiki locally