Able to print gradients in event_handler #3085

typhoonzero · 2017-07-27T10:41:09Z

Able to print gradients in event handle using v2 API to train.

Related: #3040
Fixes: #3211

def event_handler(event):
        if isinstance(event, paddle.event.EndIteration):
            if event.batch_id % 10 == 0:
                for p in parameters:
                    print "parameters:", parameters.get(p)
                    grad = parameters.get_grad(p)
                    print "gradients:", grad
                    print "gradients max abs:" max(grad.min(), grad.max(), key=abs)
            if event.batch_id % 100 == 0:
                print "Pass %d, Batch %d, Cost %f" % (
                    event.pass_id, event.batch_id, event.cost)

Yancey1989 · 2017-07-27T15:44:15Z

python/paddle/v2/trainer.py

@@ -161,14 +161,14 @@ def train(self, reader, num_passes=1, event_handler=None, feeding=None):
                    self.__parameter_updater__.update(each_param)
                cost_sum = out_args.sum()
                cost = cost_sum / len(data_batch)
-                self.__parameter_updater__.finishBatch(cost)


Just a little confusing, why we move the batch_evaluator to the event_handler?

Need to call event_handler before finishBatch operations so we can get inner status before clearing them.

Got it, thanks!

jacquesqiao · 2017-07-28T00:03:01Z

python/paddle/v2/parameters.py

                assert isinstance(val, api.Vector)
                val = val.copyToNumpyArray()
                return val
                # else continue

            raise RuntimeError("Unexpected branch")

+    def __getitem__(self, key):


rename to get_param?

__getitem__ will called when doing param[k], it's an operator reload. Need to keep this the same as before.

lcy-seso · 2017-08-27T01:41:32Z

even though it copies the parameter from the c++ side, very helpful for debugging the training process and tuning the model, thank you very much.

able to print gradients in event_handler

f150d6f

typhoonzero requested review from reyoung and lcy-seso July 27, 2017 10:41

Yancey1989 reviewed Jul 27, 2017

View reviewed changes

jacquesqiao reviewed Jul 28, 2017

View reviewed changes

wangkuiyi approved these changes Aug 3, 2017

View reviewed changes

typhoonzero merged commit 01e9e44 into PaddlePaddle:develop Aug 11, 2017

typhoonzero deleted the print_grad branch August 11, 2017 06:54

This was referenced Aug 27, 2017

add a show_paremeter_status method to v2 parameter #3699

Closed

get_grad returns zero matrix when use_gpu=True #3714

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Able to print gradients in event_handler #3085

Able to print gradients in event_handler #3085

typhoonzero commented Jul 27, 2017 •

edited by wangkuiyi

Loading

Yancey1989 Jul 27, 2017

typhoonzero Jul 28, 2017

Yancey1989 Jul 28, 2017

jacquesqiao Jul 28, 2017

typhoonzero Jul 28, 2017

lcy-seso commented Aug 27, 2017

Able to print gradients in event_handler #3085

Able to print gradients in event_handler #3085

Conversation

typhoonzero commented Jul 27, 2017 • edited by wangkuiyi Loading

Yancey1989 Jul 27, 2017

Choose a reason for hiding this comment

typhoonzero Jul 28, 2017

Choose a reason for hiding this comment

Yancey1989 Jul 28, 2017

Choose a reason for hiding this comment

jacquesqiao Jul 28, 2017

Choose a reason for hiding this comment

typhoonzero Jul 28, 2017

Choose a reason for hiding this comment

lcy-seso commented Aug 27, 2017

typhoonzero commented Jul 27, 2017 •

edited by wangkuiyi

Loading