This repository has been archived by the owner on Aug 26, 2022. It is now read-only.
v2.0.0a0
New API
- We paid homage to DeepSpeed. Now it's easier and simpler to use.
import oslo
model = oslo.initialize(model, config="oslo-config.json")
Add new models
- Albert
- Bert
- Bart
- T5
- GPT2
- GPTNeo
- GPTJ
- Electra
- Roberta
Add document
Remove old pipeline parallelism, kernel fusion code
- We'll refurbish them using the latest methods
- Kernel fusion: AOTAutograd
- Pipeline parallelism: Sagemaker PP