This repository has been archived by the owner on Aug 26, 2022. It is now read-only.
v2.0.0
Official release of OSLO 2.0.0 🎉🎉
This version of OSLO provides the following features:
- Tensor model parallelism
- Efficient activation checkpointing
- Kernel fusion
We plan to add the pipeline model parallelism and the ZeRO optimization in the next versions.
New feature: Kernel Fusion
{
"kernel_fusion": {
"enable": "bool",
"memory_efficient_fusion": "bool",
"custom_cuda_kernels": "list"
}
}
For more information, please check the kernel fusion tutorial