-
Notifications
You must be signed in to change notification settings - Fork 3.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Metric aggregation #3321
Metric aggregation #3321
Conversation
Hello @justusschock! Thanks for updating this PR. There are currently no PEP 8 issues detected in this Pull Request. Cheers! 🍻 Comment last updated at 2020-09-13 15:02:08 UTC |
@SkafteNicki Do we need a default reduce op for base classes? I don't think so, since currently it's all gathered and no op is done during ddp sync |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Really nice changes:
I completely agree that we do not need the reduce_op
argument anymore for the base class.
@SkafteNicki I fixed the tests to be compliant to our current behaviour. All changes to aggregation will be done in following PRs :) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
a bit much with all the black formatting :)
just two feedbacks:
- reduce op docstring is left over
- stacking only works if all the dims are the same, is this assumption correct?
Co-authored-by: Adrian Wälchli <aedu.waelchli@gmail.com>
we have one test that are failing: |
Codecov Report
@@ Coverage Diff @@
## master #3321 +/- ##
========================================
- Coverage 85% 80% -5%
========================================
Files 104 118 +14
Lines 8149 9750 +1601
========================================
+ Hits 6950 7844 +894
- Misses 1199 1906 +707 |
it seems to be appearing just in this PR, it is not presented in master... it shall be patched so we do not break master... |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
let's fix the failing test...
This pull request is now in conflict... :( |
Should lightning support variable sizes (in batch dimension) in gather all? |
@Borda I fixed the failing test, can I merge this? |
Allows aggregation on the same device as well as across multiple devices in DDP
Fixes #3245
Before submitting
PR review
Anyone in the community is free to review the PR once the tests have passed.
If we didn't discuss your PR in Github issues there's a high chance it will not be merged.
Did you have fun?
Make sure you had fun coding 🙃