Skip to content

Releases: ROCm/rccl

rccl-2.7.8 for ROCm 4.1.0

23 Mar 01:18
Compare
Choose a tag to compare

Added

  • Experimental support for clique-based kernels (opt in with RCCL_ENABLE_CLIQUE=1)
  • Clique-based kernels may offer better performance for smaller input sizes
  • Clique-based kernels are currently only enabled for AllReduce under a certain byte limit (controlled via RCCL_CLIQUE_ALLREDUCE_BYTE_LIMIT)

Optimizations

  • Performance improvements for Rome-based systems

Known issues

  • Clique-based kernels are currently experimental and have not been fully tested on all topologies. By default, clique-based kernels are disabled if the detected topology is not supported (override with RCCL_FORCE_ENABLE_CLIQUE)
  • Clique-based kernels may hang if there are differences between environment variables set across ranks.
  • Clique-based kernels may fail if the input / output device pointers are not the base device pointers returned by hipMalloc.

Known Issues

  • None

rccl-2.7.9 for ROCm 4.0.0

18 Dec 15:22
Compare
Choose a tag to compare

New Features

  • No new features

Known Issues

  • None

rccl-2.7.9 for ROCm 3.10.0

30 Nov 17:02
Compare
Choose a tag to compare

New Features

  • Added experimental support for clique-based kernels (opt in with RCCL_ENABLE_CLIQUE=1)
    • Clique-based kernels may offer better performance for smaller input sizes
    • Clique-based kernels are currently only enabled for AllReduce under a certain byte limit (controlled via RCCL_CLIQUE_ALL_REDUCE_BYTE_LIMIT)
  • Performance improvements for Rome-based systems

Known Issues

  • Clique-based kernels are currently experimental and have not been fully tested on all topologies. By default clique-based kernels are disabled if the topology is not supported (override with RCCL_FORCE_ENABLE_CLIQUE)
  • Clique-based kernels may hang if there are differences between environment variables set across ranks

rccl-2.7.8 for ROCm 3.9.0

27 Oct 20:13
44fcde7
Compare
Choose a tag to compare

New Features

  • Modifications to topology based on XGMI links
  • Adding support for alltoallv RCCL kernel

Known Issues

  • None

rccl-2.7.6 for ROCm 3.8.0

18 Sep 21:32
391bbf3
Compare
Choose a tag to compare

New Features

  • Support for static library builds

Known Issues

  • None

rccl-2.7.6 for ROCm 3.7.0

15 Aug 04:26
d1e20b4
Compare
Choose a tag to compare

New Features

  • Updated to RCCL API version of 2.7.6
  • Added gather, scatter and all to all collectives

Known Issues

  • None

rccl-2.7.0 for ROCm 3.6.0

11 Jul 00:38
2a45147
Compare
Choose a tag to compare

New Features

  • Updated to RCCL API version of 2.6.4

Known Issues

  • None

RCCL-2.7.0 for ROCm 3.5.0

01 Jun 19:52
250d820
Compare
Choose a tag to compare

New Features

  • Switched to hip-clang as default compiler and deprecated hcc build
  • Re-enable target gfx803.
  • Fixing issues and build time improvements for hip-clang.
  • Compatibility with NCCL 2.6
  • Network interface improvements with API v3
  • Network topology detection
  • Improved CPU type detection
  • Infiniband adaptive routing support

Known Issues

None

HIP-CLANG 3.2

13 Apr 15:47
fa52d4f
Compare
Choose a tag to compare
Merge pull request #187 from aaronenyeshi/fix-hip-vdi-hsa-ext

Fix HIP-Clang build with HSA headers

RCCL 3.3

31 Mar 15:24
ebc823e
Compare
Choose a tag to compare

RCCL 3.3