Including test settings, bandwidth result, etc
src/ # Blink+ implementation
include/ # Blnk+ header file
test_bandwidth/ # Bandwidth benchmark test, include bandwidth test for nccl and bandwidth test for Blink+
visualization/ # Visualize experiment result
graphs/ # .xml files that Blink+ use
- Build nccl package v2.7.8
Blink+ relies on NCCL v2.7.8. We have prepared a NCCL package v2.7.8 with some bug fix link
git clone https://github.com/UCBerkeley-Spring2022-CS267-project/blinkplus-nccl-base.git
cd blinkplus-nccl-base
and then follow the INSTALL.md file here link to install NCCL
- Build blink+ library and benchmark
mkdir build
cd build
cmake .. -DCMAKE_CUDA_COMPILER=/usr/local/cuda-11.5/bin/nvcc -DCMAKE_BUILD_TYPE=Release
make -j 10
sudo make install
- Export graph path
export BLINKPLUS_GRAPH_FILE_CHAIN_01=`pwd`/graphs/user01/chain01.xml
export BLINKPLUS_GRAPH_FILE_CHAIN_021=`pwd`/graphs/user01/chain021.xml
export BLINKPLUS_GRAPH_FILE_CHAIN_031=`pwd`/graphs/user01/chain031.xml
export BLINKPLUS_GRAPH_FILE_CHAIN_0321=`pwd`/graphs/user01/chain0321.xml
export BLINKPLUS_GRAPH_FILE_CHAIN_12=`pwd`/graphs/user12/chain12.xml
export BLINKPLUS_GRAPH_FILE_CHAIN_102=`pwd`/graphs/user12/chain102.xml
export BLINKPLUS_GRAPH_FILE_CHAIN_132=`pwd`/graphs/user12/chain132.xml
export BLINKPLUS_GRAPH_FILE_CHAIN_013=`pwd`/graphs/user03/chain013.xml
export BLINKPLUS_GRAPH_FILE_CHAIN_023=`pwd`/graphs/user03/chain023.xml
export BLINKPLUS_GRAPH_FILE_CHAIN_203=`pwd`/graphs/user23/chain203.xml
export BLINKPLUS_GRAPH_FILE_CHAIN_213=`pwd`/graphs/user23/chain213.xml
export BLINKPLUS_GRAPH_FILE_CHAIN_2103=`pwd`/graphs/user23/chain2103.xml
export BLINKPLUS_GRAPH_FILE_CHAIN_032=`pwd`/graphs/user02/chain032.xml
export BLINKPLUS_GRAPH_FILE_CHAIN_012=`pwd`/graphs/user02/chain012.xml
export BLINKPLUS_GRAPH_FILE_CHAIN_103=`pwd`/graphs/user13/chain103.xml
export BLINKPLUS_GRAPH_FILE_CHAIN_123=`pwd`/graphs/user13/chain123.xml
cd test_bandwidth
bash blinkplus_optimal_broadcast_timing.sh
bash blinkplus_optimal_allreduce_timing.sh
bash nccl_allreduce_timing.sh
bash nccl_broadcast_timing.sh
Run the notebook
visualization/test_bandwidth.ipynb
This project is part of the final project for Xiao Song, Yefan Zhou, and Yibai Meng's Spring 2022 CS267 course at UC Berkeley.
Thanks for the help from our GSI Guanhua Wang's project idea and help.
Thanks for the help from NCCL's author Sylvain Jeaugey on his help on answering our github issue questions.