New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

rmda user-manual #190

Closed

ferris-cx wants to merge 2 commits into koordinator-sh:main from ferris-cx:rdma

ferris-cx commented Nov 28, 2024

Ⅰ. Describe what this PR does
Since Gpus in AI scenarios require RDMA computing nics for high-speed NCCL communication, end-to-end support for rdma devices must be added, including device discovery, device registration, node resource update, scheduling, and allocation.
Ⅱ. Does this pull request fix one issue?
No
Ⅲ. Describe how to verify it
Ⅳ. Special notes for reviews
V. Checklist
I have written necessary docs and comments
I have added necessary unit tests and integration tests
All checks passed in make test

ZiMengSheng requested review from ZiMengSheng and saintube and removed request for ZiMengSheng

November 28, 2024 12:52

ZiMengSheng force-pushed the rdma branch 2 times, most recently from f1248db to d8e1f44 Compare

December 4, 2024 08:45


          gpu & rdma joint allocation best practice

dfcacdb

Signed-off-by: iostream2008@163.com <iostream2008@163.com>
Signed-off-by: wangjianyu <wangjianyu.wjy@alibaba-inc.com>

ZiMengSheng force-pushed the rdma branch from d8e1f44 to dfcacdb Compare

December 4, 2024 08:46

ZiMengSheng reviewed

View reviewed changes

docs/best-practices/gpu-rdma-joint-allocation.md

		## A test report on affinity scheduling of rdma nic and GPU on k8s and high speed communication of RDMA computing network

		### Introduction

Contributor

ZiMengSheng Dec 4, 2024

这里应该主要做一下问题描述，以及 Koordinator 是怎么解决该问题的.
这里 Koordinator 已经支持了 GPU & RDMA 联合分配这个功能，不能再说缺乏这个功能了

docs/best-practices/gpu-rdma-joint-allocation.md

		@@ -0,0 +1,1219 @@
		## A test report on affinity scheduling of rdma nic and GPU on k8s and high speed communication of RDMA computing network

Contributor

ZiMengSheng Dec 4, 2024

题目太长了

Author

ferris-cx Dec 17, 2024

oK。我改下标题，尽量简短

docs/best-practices/gpu-rdma-joint-allocation.md


		#### Prerequisite

		<div>The basic K8S cluster environment for GPUs has been installed. The Nvidia driver and containerd have been installed on each GPU node, and the Mellanox NIC driver has been installed on the server.</div>

Contributor

ZiMengSheng Dec 4, 2024

I suggesst the overall structure adjusted as follows

Introduction: problem description and koordinator solution introduction
Experiment Setting:
1. Test Scanarios
2. Cluster and Nodes
Initinalize the nodes
Deploy Koordinator and Multus
Deploy Test Application and Check its allocation result
NCCL Testing

docs/best-practices/gpu-rdma-joint-allocation.md

+                }'
+                ```
+              Plan: Nad configuration file name of NIC ens3f0np0 on node2: sriov-attach-k8s-node2-ens3f0np0-kubeflow-conf.yaml.

Contributor

ZiMengSheng Dec 4, 2024

NAD 的安装应该放在 multus 安装部分？

Author

ferris-cx Dec 17, 2024 •

edited

Loading

应该说，NAD跟multus里引用到，但具体的编排内容还要视pod.yaml而定。所以建议还是跟pod.yaml前面编辑并安装比较合适

docs/best-practices/gpu-rdma-joint-allocation.md

+* GPU1+1* RDMA communication between two Pods 2G data volume communication scenario
+              ```shell
+              mpirun --allow-run-as-root -H 10.244.1.10:1,10.244.2.21:1 -mca plm_rsh_args "-p 20024" -x NCCL_IB_DISABLE=0 -x NCCL_DEBUG=INFO -x NCCL_SOCKET_IFNAME=eth0 -x NCCL_IB_HCA==mlx5_2 -x UCX_NET_DEVICES=eth0 -x NCCL_NET_GDR_READ=1 ./build/all_reduce_perf -b 2M -e 2G -f 2 -g 1 -n 100 -w 5

Contributor

ZiMengSheng Dec 4, 2024

please give a brief intruduction about mpirun


          Merge branch 'koordinator-sh:main' into rdma

548528e

Contributor

ZiMengSheng commented Dec 17, 2024

/close

ZiMengSheng closed this

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet