New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Add NeMo-Run DPO example #381

Draft

hemildesai wants to merge 1 commit into main from hemil/nemo-run-dpo

Collaborator

hemildesai commented Nov 5, 2024

What does this PR do ?

Adds the same DPO example in https://docs.nvidia.com/nemo-framework/user-guide/latest/modelalignment/dpo.html but using NeMo-Run

Changelog

Please update the CHANGELOG.md under next version with high level changes in this PR.

Usage

The example can be run using python examples/nemo_run/dpo.py

# Add a code snippet demonstrating how to use this

Before your PR is "Ready for review"

Pre checks:

Make sure you read and followed Contributor guidelines
Did you write any new necessary tests?
Did you add or update any necessary documentation? Make sure to also update the NeMo Framework User Guide which contains the tutorials

Checklist when contributing a new algorithm

Does the trainer resume and restore model state all states?
Does the trainer support all parallelism techniques(PP, TP, DP)?
Does the trainer support max_steps=-1 and validation?
Does the trainer only call APIs defined in alignable_interface.py?
Does the trainer have proper logging?

Additional Information

Related to # (issue)


          Add NeMo-Run DPO example

bd09529

Signed-off-by: Hemil Desai <hemild@nvidia.com>

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet