Skip to content
This repository has been archived by the owner on Jun 6, 2024. It is now read-only.

Commit

Permalink
Add distributed training examples of PyTorch
Browse files Browse the repository at this point in the history
  • Loading branch information
vvfreesoul committed Sep 10, 2020
1 parent f585648 commit 853d112
Show file tree
Hide file tree
Showing 2 changed files with 2 additions and 1 deletion.
1 change: 1 addition & 0 deletions docs/manual/cluster-user/how-to-run-distributed-job.md
Original file line number Diff line number Diff line change
Expand Up @@ -16,6 +16,7 @@ cifar10-single-mul-DDP-nccl | ✓| ✓ |nccl|-| [cifar10-single-mul-DDP-nccl.yam
cifar10-single-mul-DDP-gloo-Apex-mixed | ✓| ✓ | gloo|✓ | [cifar10-single-mul-DDP-gloo-Apex-mixed.yaml](https://github.com/microsoft/pai/tree/master/examples/Distributed-example/cifar10-single-mul-DDP-gloo-Apex-mixed.yaml)|
cifar10-single-mul-DDP-nccl-Apex-mixed | ✓| ✓ | nccl| ✓ | [cifar10-single-mul-DDP-gloo-Apex-mixed.yaml](https://github.com/microsoft/pai/tree/master/examples/Distributed-example/cifar10-single-mul-DDP-gloo-Apex-mixed.yaml)|
imagenet-single-mul-DDP-gloo | ✓| ✓| gloo|-| [imagenet-single-mul-DDP-gloo.yaml](https://github.com/microsoft/pai/tree/master/examples/Distributed-example/Lite-imagenet-single-mul-DDP-gloo.yaml)|

## DataParallel
The single node program is simple. The program executed in PAI is exactly the same as the program in our machine. It should be noted that an Worker can be applied in PAI and a Instance can be applied in Worker. In a worker, we can apply for GPUs that we need. We provide an [example](../../../examples/Distributed-example/cifar10-single-node-gpus-cpu-DP.py) of DP.

Expand Down
2 changes: 1 addition & 1 deletion mkdocs.yml
Original file line number Diff line number Diff line change
Expand Up @@ -7,7 +7,7 @@ nav:
- How to Manage Data: manual/cluster-user/how-to-manage-data.md
- How to Debug Jobs: manual/cluster-user/how-to-debug-jobs.md
- How to Use Advanced Job Settings: manual/cluster-user/how-to-use-advanced-job-settings.md
- How to Run Distributed Job: /manual/cluster-user/how-to-run-distributed-job.md
- How to Run Distributed Job: manual/cluster-user/how-to-run-distributed-job.md
- Use Marketplace: manual/cluster-user/use-marketplace.md
- Use VSCode Extension: manual/cluster-user/use-vscode-extension.md
- Use Jupyter Notebook Extension: manual/cluster-user/use-jupyter-notebook-extension.md
Expand Down

0 comments on commit 853d112

Please sign in to comment.