Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

2-node AllReduce improvements #118

Merged
merged 64 commits into from
Jul 7, 2023
Merged

2-node AllReduce improvements #118

merged 64 commits into from
Jul 7, 2023

Conversation

chhwang
Copy link
Contributor

@chhwang chhwang commented Jul 5, 2023

  • Added get() interfaces to SmChannel
  • Improved 2-node (8 gpus/node) AllReduce: algbw 139GB/s for 1GB (kernel 3) and 99GB/s for 48MB (kernel 4)
  • Fixed a FIFO perf bug
  • Several fixes & validations in mscclpp-test

@chhwang chhwang changed the title Binyli/allreduce multinodes 2-node AllReduce improvements Jul 5, 2023
@chhwang chhwang marked this pull request as ready for review July 5, 2023 05:49
Copy link
Contributor

@saeedmaleki saeedmaleki left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

just minor changes.

include/mscclpp/fifo.hpp Show resolved Hide resolved
include/mscclpp/sm_channel.hpp Outdated Show resolved Hide resolved
include/mscclpp/sm_channel.hpp Outdated Show resolved Hide resolved
include/mscclpp/sm_channel.hpp Outdated Show resolved Hide resolved
include/mscclpp/sm_channel.hpp Outdated Show resolved Hide resolved
test/mscclpp-test/allreduce_test.cu Show resolved Hide resolved
test/mscclpp-test/allreduce_test.cu Outdated Show resolved Hide resolved
@Binyang2014 Binyang2014 force-pushed the binyli/allreduce-multinodes branch from 0ac9833 to 61449a8 Compare July 6, 2023 05:32
Copy link
Contributor

@saeedmaleki saeedmaleki left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

a bit worried about the register usage of copy instructions. Did it matter for performance?

include/mscclpp/sm_channel.hpp Outdated Show resolved Hide resolved
include/mscclpp/sm_channel.hpp Show resolved Hide resolved
@chhwang chhwang enabled auto-merge (squash) July 7, 2023 05:47
@chhwang chhwang merged commit bb7b85a into main Jul 7, 2023
@chhwang chhwang deleted the binyli/allreduce-multinodes branch July 7, 2023 07:05
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants