Intel(R) oneAPI Collective Communications Library (oneCCL) 2021.13
What's New:
- Optimizations to limit the memory consumed by oneCCL
- Optimizations to limit the number of file descriptors maintained opened by oneCCL.
- Align the support for in-place for the Allgatherv and Reduce-scatter collectives to follow the same behavior as NCCL.
- In particular, the Allgatherv collective is in place when:
- send_buff == recv_buff + rank_offset, where rank_offset = sum (recv_counts[i]), for all I<rank.
- Reduce-scatter is in-place when recv_buff == send_buff + rank *recv_count.
- When using the environment variable CCL_WORKER_AFFINITY, oneCCL enforces the requirement that the length of the list should be equal to the number of workers.
- Bug fixes.