-
Notifications
You must be signed in to change notification settings - Fork 5.6k
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Add mp_all_reduce asynchronize overlap. (#55662)
* [WIP] Add mp_all_reduce asynchronize overlap. * Fix some problems. * Fix dw compute bug, and use a temporary solution to achieve overlap. * Use fused_linear_param_grad_add to compute dw. * Reformat ColumnParallel _overlap_linear. Use environment flags to control following behaviors: 1. export Flags_mp_aysnc_allreduce=True to turn on mp async all_reduce 2. export Flags_skip_mp_c_identity=True to skip two c_identity operators in dygraph mode. 3. export Flags_fused_linear_param_grad_add to enable fused_linear_param_grad_add in ColumnParallel backward with mp async all_reduce. * Polish code. * Remove useless communication API. * Fix some problems in mp_async_all_reduce and skip_c_identity. * Add test cases. * Remove environment variable Flags_fused_linear_param_grad_add in test case. * Reset error threshold. * Reset threshold in test case. * Add useful log. Remove useless test cases.
- Loading branch information
1 parent
a8981be
commit 6b1dfb5
Showing
2 changed files
with
214 additions
and
27 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters