Added Policy Gradient Tutorial #82

shreyas-kowshik · 2019-01-12T16:28:16Z

*Description
Implementation of Vanilla Monte Carlo Policy Gradients on the CartPole-v0 environment added as a tutorial.

*Tests
Run the script

tejank10 · 2019-01-17T16:47:15Z

tutorials/Policy_Gradient_Vanilla.jl

+    return mean(-logpi .* A_t)
+end
+
+opt = ADAM(params(policy),η)


opt = ADAM(params(policy),η) could be
opt = ADAM(η) in accordance with the new optimizer API

tejank10 · 2019-01-17T16:47:50Z

tutorials/Policy_Gradient_Vanilla.jl

+
+        G_t = γ*G_t + r
+
+        l = l .+ loss(state,act,G_t)   


Broadcasting is not required for a variable element.

tejank10 · 2019-01-17T16:49:43Z

tutorials/Policy_Gradient_Vanilla.jl

+
+        l = l .+ loss(state,act,G_t)   
+        Flux.back!(loss(state,act,G_t))
+        opt()


WRT the new Optimizer API, this will become update!(opt, params(model))

shreyas-kowshik · 2019-01-31T04:30:44Z

@tejank10

update!(opt,params(policy))

does not find a matching candidate.
Tried using

grads = Tracker.gradient(() -> loss(state,act,G_t), params(policy))

for p in params(policy)
  update!(opt, p, grads[p])
end

but even this throws up errors.

DhairyaLGandhi · 2019-01-31T04:59:14Z

Please add a Project.toml and Manifest.toml as well so it easier to standardize the environment.

shreyas-kowshik · 2019-02-26T18:57:03Z

@dhairyagandhi96 Added the files

MikeInnes · 2019-03-25T14:41:08Z

vision/mnist/DCGAN/dcgan.md

@@ -0,0 +1,195 @@
+# ***Generative Adversarial Network Tutorial***


Can we just use normal headings for these rather than the extra formatting / html tags?

MikeInnes · 2019-03-25T14:42:11Z

@tejank10 are you happy with the changes made here, or is there more to do?

shreyas-kowshik · 2019-03-25T19:21:47Z

@MikeInnes Thanks for the reply. I apologize for making a few errors before. The DCGAN code should not have been included in this PR. There is a separate PR for that. I have corrected it by removing the GAN code. The changes you mentioned for the GAN part will be updated in the respective PR.

shreyas-kowshik · 2019-03-26T18:45:04Z

@tejank10 I have made the changes requested. Sorry for having delayed this for so long. I got into other work and did not fix the errors that were coming up. The changes have been completed now. I have also added functions to normalize the discounted rewards which would aid in training the network.

MikeInnes · 2019-04-04T14:05:59Z

It will also need to be in its own folder, and have a simple README. Otherwise this is looking good I think, but it'd be good to hear from @tejank10.

shreyas-kowshik · 2019-04-05T17:37:42Z

@MikeInnes Sorry for the delayed response. I have made the changes. Is the README sufficient for now or is there something more to be added?

Added Policy Gradient Tutorial

7d08577

tejank10 suggested changes Jan 17, 2019

View reviewed changes

MikeInnes assigned DhairyaLGandhi Jan 24, 2019

Shifted code to games.Added Project.toml and Manifest.toml

04d9349

WIP DCGAN Tutorial

866b018

shreyas-kowshik mentioned this pull request Mar 7, 2019

Project Proposal For Contribution to GSoC'19 #99

Closed

MikeInnes reviewed Mar 25, 2019

View reviewed changes

Removed GANs

a639f87

Made Requested Changes

e937fa5

Moved files to seperate folder, added README.md

0733b98

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added Policy Gradient Tutorial #82

Added Policy Gradient Tutorial #82

shreyas-kowshik commented Jan 12, 2019

tejank10 Jan 17, 2019

tejank10 Jan 17, 2019

tejank10 Jan 17, 2019 •

edited by DhairyaLGandhi

Loading

shreyas-kowshik commented Jan 31, 2019

DhairyaLGandhi commented Jan 31, 2019 •

edited

Loading

shreyas-kowshik commented Feb 26, 2019

MikeInnes Mar 25, 2019

MikeInnes commented Mar 25, 2019

shreyas-kowshik commented Mar 25, 2019

shreyas-kowshik commented Mar 26, 2019 •

edited

Loading

MikeInnes commented Apr 4, 2019

shreyas-kowshik commented Apr 5, 2019

		@@ -0,0 +1,195 @@
		# *Generative Adversarial Network Tutorial*

Added Policy Gradient Tutorial #82

Are you sure you want to change the base?

Added Policy Gradient Tutorial #82

Conversation

shreyas-kowshik commented Jan 12, 2019

tejank10 Jan 17, 2019

Choose a reason for hiding this comment

tejank10 Jan 17, 2019

Choose a reason for hiding this comment

tejank10 Jan 17, 2019 • edited by DhairyaLGandhi Loading

Choose a reason for hiding this comment

shreyas-kowshik commented Jan 31, 2019

DhairyaLGandhi commented Jan 31, 2019 • edited Loading

shreyas-kowshik commented Feb 26, 2019

MikeInnes Mar 25, 2019

Choose a reason for hiding this comment

MikeInnes commented Mar 25, 2019

shreyas-kowshik commented Mar 25, 2019

shreyas-kowshik commented Mar 26, 2019 • edited Loading

MikeInnes commented Apr 4, 2019

shreyas-kowshik commented Apr 5, 2019

tejank10 Jan 17, 2019 •

edited by DhairyaLGandhi

Loading

DhairyaLGandhi commented Jan 31, 2019 •

edited

Loading

shreyas-kowshik commented Mar 26, 2019 •

edited

Loading