Multiple Trials for Reinforcement Learning Suggestion #416

jinan-zhou · 2019-03-02T07:41:21Z

Now the reinforcement learning suggestion supports spawning multiple trials in each iteration. Users can activate this function by specifying the num_trials suggestion parameter in StudyJob yaml file.

Please notice that this multiple trials support is for extending exploration only, not for asynchronous update. The suggestion will use the average metrics of all the trials to calculate the policy gradient, which provides a more justified evaluation of the LSTM cell's internal state compared to using single trial. And the suggestion will not generate new candidates until all the previous ones are finished.

Fixes #396

This change is

jinan-zhou · 2019-03-02T07:52:39Z

/assign @hougangliu @YujiOshima

hougangliu · 2019-03-03T23:49:41Z

pkg/suggestion/nasrl_service.py

@@ -33,11 +35,13 @@ def __init__(self, request, logger):
        self.search_space = None
        self.opt_direction = None
        self.objective_name = None
+        self.num_trials = 1


api.GetSuggestionsRequest.request_number has defined num_trials, you need not add a new one in suggestionParameters
Thanks!

Thank you for notice! We will change it.

Fixed. I removed num_trials from the suggestion parameters and used requestNumber instead.

jinan-zhou · 2019-03-05T19:08:33Z

/retest

hougangliu · 2019-03-06T00:00:38Z

/lgtm
/approve

k8s-ci-robot · 2019-03-06T00:01:05Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: hougangliu

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [hougangliu]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

andreyvelich · 2019-03-06T00:46:06Z

/retest

supoort multiple trials

e6234d1

k8s-ci-robot requested review from libbyandhelen and texasmichelle March 2, 2019 07:41

k8s-ci-robot added the size/L label Mar 2, 2019

adjust To Do

deba168

k8s-ci-robot assigned hougangliu and YujiOshima Mar 2, 2019

jinan-zhou changed the title ~~Support Multiple Trials~~ Multiple Trials for Reinforcement Learning Suggestion Mar 2, 2019

language improvement in README.md

ef3ff96

hougangliu reviewed Mar 3, 2019

View reviewed changes

fix several problems

d328904

DeeperMind added 2 commits March 5, 2019 13:46

fix a potential problem

2f415f3

handle the GetEvaluationResult() return None problem

8986a7f

k8s-ci-robot added the lgtm label Mar 6, 2019

k8s-ci-robot added the approved label Mar 6, 2019

k8s-ci-robot merged commit feee2f9 into kubeflow:master Mar 6, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Multiple Trials for Reinforcement Learning Suggestion #416

Multiple Trials for Reinforcement Learning Suggestion #416

jinan-zhou commented Mar 2, 2019 •

edited

Loading

jinan-zhou commented Mar 2, 2019

hougangliu Mar 3, 2019

andreyvelich Mar 5, 2019

jinan-zhou Mar 5, 2019

jinan-zhou commented Mar 5, 2019

hougangliu commented Mar 6, 2019

k8s-ci-robot commented Mar 6, 2019

andreyvelich commented Mar 6, 2019

Multiple Trials for Reinforcement Learning Suggestion #416

Multiple Trials for Reinforcement Learning Suggestion #416

Conversation

jinan-zhou commented Mar 2, 2019 • edited Loading

jinan-zhou commented Mar 2, 2019

hougangliu Mar 3, 2019

Choose a reason for hiding this comment

andreyvelich Mar 5, 2019

Choose a reason for hiding this comment

jinan-zhou Mar 5, 2019

Choose a reason for hiding this comment

jinan-zhou commented Mar 5, 2019

hougangliu commented Mar 6, 2019

k8s-ci-robot commented Mar 6, 2019

andreyvelich commented Mar 6, 2019

jinan-zhou commented Mar 2, 2019 •

edited

Loading