Update the networks #3

tailongnguyen · 2018-08-07T04:24:24Z

Hi, thanks for your great work
I have one question about the weight updating protocol.
The gradients of the local network (including all auxiliary tasks) are applied in the process function in trainer.py. However, I notice that the sync function (which copies the weights of global network to the local one) is ran BEFORE the apply_gradient is ran (line 354 and 409 respectively).
Following the code behind these 2 functions, you are copying the shared weights to the local network (so basically the global variables and local variables will be the same by then), then calculating gradients on local variables, then applying the gradients to the global variables. It does not sound logic, does it?

Please correct me if I am wrong.

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update the networks #3

Update the networks #3

tailongnguyen commented Aug 7, 2018 •

edited

Loading

Update the networks #3

Update the networks #3

Comments

tailongnguyen commented Aug 7, 2018 • edited Loading

tailongnguyen commented Aug 7, 2018 •

edited

Loading