Skip to content

v2.0

Latest
Compare
Choose a tag to compare
@sunzeyeah sunzeyeah released this 26 May 01:56

Pipelined implementation of SFT, Reward and RLHF training based on transformers, DeepSpeed and DeepSpeedChat. List of supported models: Pangu, GLM, ChatGLM