Feature/evaluator #5331

dzhwinter · 2017-11-02T23:03:17Z

fix #3808
The previous evaluator doesn't separate compile time and runtime, and also we need to create some metric states for each evaluator.

helinwang · 2017-11-02T23:09:33Z

doc/design/evaluator.md

+### Evaluator Design
+Currently, every operation is expressed in the graph. we divide the evaluator process into three steps.
+
+1. Initialize the metric state necessary and add it into the block.


metric state necessary -> metric state

helinwang · 2017-11-02T23:10:02Z

doc/design/evaluator.md

+
+1. Initialize the metric state necessary and add it into the block.
+
+2. Calculate the statistic of the metric state in every mini-batch. The single operator is only responsible for calculating necessary statistics for one mini-batch. For example, accuracy operator only calculate a minibatch data if run once.\


helinwang · 2017-11-02T23:13:47Z

doc/design/evaluator.md

+       """ 
+       pass
+
+    def _clear_state(self):


Maybe this should not be a part of Python.

helinwang · 2017-11-02T23:16:47Z

doc/design/evaluator.md

+      """
+      pass
+
+    def _append_evalutor_op(self):


Perhaps we just need to specify what the interface looks like to the user.

I think a very important general rule is: don't make the decision unless we absolutely have to. By deferring the decision about what private methods to use for as long as possible gives us more information to make a good decision.

typhoonzero · 2017-11-07T10:05:22Z

doc/design/evaluator.md

+2. Calculate the statistic of the metric state in every mini-batch. The single operator is only responsible for calculating necessary statistics for one mini-batch. For example, accuracy operator only calculate a minibatch data if run once.
+
+
+3. Merge the mini-batch statistics to form the evaluation result for multiple mini-batches. When it comes to distributed training/Multi-GPU training, aggregate the value from different devices.


I saw the code below, it seems that we need some detailed description of how to implement evaluator operators in C++, how to save state in C++ and update them in python side.

Sure, I will add the detail.
Currently, we have two options.
option 1, just like the TensorFlow does, composing the low-level operators to compute metric. If the performance is a real bottleneck, rewrite them in the c++ side as a new operator.
option 2, we use c++ operator to calculate every mini-batch metric and maintain some states in Python side.
I'm not sure which is better now. I implement the option 2, how do you think?

JiayiFeng · 2017-11-08T00:19:40Z

python/paddle/v2/framework/evaluator.py

+        else:
+            reset_program = program
+        for k, var in self._states.iteritems():
+            zeros = helper.create_tmp_variable(dtype=var.data_type)


helper can only create vars and ops in main_program and startup_programe. However, here we need to create them in the reset_program. So it's not correct to use helper here.

JiayiFeng · 2017-11-08T00:20:53Z

python/paddle/v2/framework/evaluator.py

+    #   """
+    #     raise NotImplementedError()
+
+    def reset(self, executor, program=None):


Do not expose reset_program to the upper level. It should be created, run and destroyed in the reset function.

JiayiFeng · 2017-11-10T00:59:22Z

python/paddle/v2/framework/evaluator.py

+        """
+        raise NotImplementedError()
+
+    def reset(self, executor, program=None):


reset_program=None

JiayiFeng · 2017-11-10T01:09:42Z

python/paddle/v2/framework/evaluator.py

+
+        return acc_out
+
+    def eval(self, executor, program=None):


eval_program=None

JiayiFeng · 2017-11-10T01:30:51Z

python/paddle/v2/framework/tests/test_fit_a_line.py

@@ -31,6 +32,8 @@
    main_program=main_program,
    startup_program=startup_program)

+accuracy = evaluator.Accuracy(input=y_predict, label=y)


evaluator.accuracy()

helinwang

Approved just because @Canpio approved.

dzhwinter added 2 commits November 1, 2017 21:16

"add evaluator design doc"

cf302bd

"add evaluator design doc"

debfb00

helinwang self-requested a review November 2, 2017 23:07

helinwang reviewed Nov 2, 2017

View reviewed changes

dzhwinter added 7 commits November 2, 2017 20:20

"add accuracy "

796eaf3

Merge remote-tracking branch 'origin/develop' into feature/evaluator

83e6500

"need to write math functors"

233a305

"add eval interface"

bdc832c

"add fit a line test"

c09ad73

Merge remote-tracking branch 'origin/develop' into feature/evaluator

8d9b334

'add f1 test'

c4ac7fa

typhoonzero reviewed Nov 7, 2017

View reviewed changes

Merge remote-tracking branch 'origin/develop' into feature/evaluator

7874399

JiayiFeng requested changes Nov 8, 2017

View reviewed changes

dzhwinter added 5 commits November 7, 2017 19:36

"add small evaluation"

79a2ce4

Merge remote-tracking branch 'origin/develop' into feature/evaluator

e34e129

"add elementwise_add more type"

b8f557f

"add elementwise op support"

46c61b3

"polish document"

cfbc92e

JiayiFeng reviewed Nov 10, 2017

View reviewed changes

dzhwinter added 2 commits November 9, 2017 17:35

"fix based on comments"

9e1799c

"delete test evaluator"

7c79243

JiayiFeng previously approved these changes Nov 10, 2017

View reviewed changes

dzhwinter added 2 commits November 13, 2017 23:55

Merge remote-tracking branch 'origin/develop' into feature/evaluator

fc117ec

"relauch ci"

12858ba

dzhwinter dismissed JiayiFeng’s stale review via 12858ba November 14, 2017 08:28

dzhwinter added 2 commits November 14, 2017 11:20

Merge remote-tracking branch 'origin/develop' into feature/evaluator

2f33f74

"fix import error"

b32faa0

helinwang approved these changes Nov 14, 2017

View reviewed changes

helinwang merged commit d1d2100 into PaddlePaddle:develop Nov 14, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature/evaluator #5331

Feature/evaluator #5331

dzhwinter commented Nov 2, 2017

helinwang Nov 2, 2017

dzhwinter Nov 2, 2017

helinwang Nov 2, 2017

dzhwinter Nov 2, 2017

helinwang Nov 2, 2017

helinwang Nov 2, 2017 •

edited

Loading

typhoonzero Nov 7, 2017

dzhwinter Nov 8, 2017

dzhwinter Nov 9, 2017

JiayiFeng Nov 8, 2017

dzhwinter Nov 9, 2017

JiayiFeng Nov 8, 2017

dzhwinter Nov 9, 2017

JiayiFeng Nov 10, 2017

dzhwinter Nov 10, 2017

JiayiFeng Nov 10, 2017

dzhwinter Nov 10, 2017

JiayiFeng Nov 10, 2017

dzhwinter Nov 10, 2017

helinwang left a comment


		1. Initialize the metric state necessary and add it into the block.

		2. Calculate the statistic of the metric state in every mini-batch. The single operator is only responsible for calculating necessary statistics for one mini-batch. For example, accuracy operator only calculate a minibatch data if run once.\

		2. Calculate the statistic of the metric state in every mini-batch. The single operator is only responsible for calculating necessary statistics for one mini-batch. For example, accuracy operator only calculate a minibatch data if run once.


		3. Merge the mini-batch statistics to form the evaluation result for multiple mini-batches. When it comes to distributed training/Multi-GPU training, aggregate the value from different devices.

Feature/evaluator #5331

Feature/evaluator #5331

Conversation

dzhwinter commented Nov 2, 2017

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

helinwang Nov 2, 2017 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

helinwang left a comment

Choose a reason for hiding this comment

helinwang Nov 2, 2017 •

edited

Loading