Does the project support multi-gpu training? #216

Snimm · 2024-10-03T19:16:40Z

Does the project support multi-gpu training?
If yes, how? By default, it only uses one GPU. I am unable to find any parameter that can be used for this purpose.

jonathan-laurent · 2024-10-03T20:21:17Z

Although this is not very well tested, AlphaZero.jl will attempt to use Distributed to parallelise some parts of data generation across all available Julia processes. Each process can then be configured to use its own GPU. Evaluation stages are not parallelised this way though and may become a bottleneck if two many processes are used.

Once again, this is not very well tested and documented so may require some digging in. Don't hesitate to report about your experience and contribute some documentation if you manage to make it work for you.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Does the project support multi-gpu training? #216

Does the project support multi-gpu training? #216

Snimm commented Oct 3, 2024

jonathan-laurent commented Oct 3, 2024

Does the project support multi-gpu training? #216

Does the project support multi-gpu training? #216

Comments

Snimm commented Oct 3, 2024

jonathan-laurent commented Oct 3, 2024