Releases: ROCm/rccl
Releases · ROCm/rccl
rccl-2.7.8 for ROCm 4.1.0
Added
- Experimental support for clique-based kernels (opt in with RCCL_ENABLE_CLIQUE=1)
- Clique-based kernels may offer better performance for smaller input sizes
- Clique-based kernels are currently only enabled for AllReduce under a certain byte limit (controlled via RCCL_CLIQUE_ALLREDUCE_BYTE_LIMIT)
Optimizations
- Performance improvements for Rome-based systems
Known issues
- Clique-based kernels are currently experimental and have not been fully tested on all topologies. By default, clique-based kernels are disabled if the detected topology is not supported (override with RCCL_FORCE_ENABLE_CLIQUE)
- Clique-based kernels may hang if there are differences between environment variables set across ranks.
- Clique-based kernels may fail if the input / output device pointers are not the base device pointers returned by hipMalloc.
Known Issues
- None
rccl-2.7.9 for ROCm 4.0.0
New Features
- No new features
Known Issues
- None
rccl-2.7.9 for ROCm 3.10.0
New Features
- Added experimental support for clique-based kernels (opt in with RCCL_ENABLE_CLIQUE=1)
- Clique-based kernels may offer better performance for smaller input sizes
- Clique-based kernels are currently only enabled for AllReduce under a certain byte limit (controlled via RCCL_CLIQUE_ALL_REDUCE_BYTE_LIMIT)
- Performance improvements for Rome-based systems
Known Issues
- Clique-based kernels are currently experimental and have not been fully tested on all topologies. By default clique-based kernels are disabled if the topology is not supported (override with RCCL_FORCE_ENABLE_CLIQUE)
- Clique-based kernels may hang if there are differences between environment variables set across ranks
rccl-2.7.8 for ROCm 3.9.0
New Features
- Modifications to topology based on XGMI links
- Adding support for alltoallv RCCL kernel
Known Issues
- None
rccl-2.7.6 for ROCm 3.8.0
New Features
- Support for static library builds
Known Issues
- None
rccl-2.7.6 for ROCm 3.7.0
New Features
- Updated to RCCL API version of 2.7.6
- Added gather, scatter and all to all collectives
Known Issues
- None
rccl-2.7.0 for ROCm 3.6.0
New Features
- Updated to RCCL API version of 2.6.4
Known Issues
- None
RCCL-2.7.0 for ROCm 3.5.0
New Features
- Switched to hip-clang as default compiler and deprecated hcc build
- Re-enable target gfx803.
- Fixing issues and build time improvements for hip-clang.
- Compatibility with NCCL 2.6
- Network interface improvements with API v3
- Network topology detection
- Improved CPU type detection
- Infiniband adaptive routing support
Known Issues
None
HIP-CLANG 3.2
Merge pull request #187 from aaronenyeshi/fix-hip-vdi-hsa-ext Fix HIP-Clang build with HSA headers
RCCL 3.3
RCCL 3.3