Skip to content

Commit

Permalink
require grad_sync to be true now for this ckpt
Browse files Browse the repository at this point in the history
Signed-off-by: Terry Kong <terryk@nvidia.com>
  • Loading branch information
terrykong committed Nov 22, 2024
1 parent 79eed88 commit 7f75e47
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion tests/functional/ppo.sh
Original file line number Diff line number Diff line change
Expand Up @@ -82,7 +82,7 @@ python -u ${GPFS}/examples/nlp/gpt/serve_ppo_critic.py \
++model.global_batch_size=1 \
++model.tensor_model_parallel_size=1 \
++model.optim.bucket_cap_mb=200 \
++model.optim.overlap_grad_sync=False \
++model.optim.overlap_grad_sync=True \
++model.optim.contiguous_grad_buffer=True \
++trainer.ppo.pad_sequence_length_to_multiple=32 \
model.reward_standardization.enable=True \
Expand Down

0 comments on commit 7f75e47

Please sign in to comment.