Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Refactor reward-modeling likelihood #200

Merged
merged 4 commits into from
Jun 23, 2024
Merged

Conversation

runame
Copy link
Collaborator

@runame runame commented Jun 15, 2024

Suggested refactor of the reward-modeling likelihood, let me know if I missed something. Advantages:

  1. Simplicity and readability. We get rid of self.reward_modeling and self._fitting (actually self._fitting was not even used before, not sure if you had a future use case in mind?).
  2. When a user calls la.likelihood it will always return "reward_modeling" which should be the expected behavior.

(Unrelated: I also removed a superfluous loss_with_var argument.)

@runame runame added the enhancement New feature or request label Jun 15, 2024
@runame runame requested a review from wiseodd June 15, 2024 16:22
@runame runame self-assigned this Jun 15, 2024
Copy link
Collaborator

@wiseodd wiseodd left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM. Thanks for the refactor, definitely makes things cleaner now!

@runame runame merged commit 98779a8 into grid-search Jun 23, 2024
3 checks passed
@runame runame deleted the reward-modeling-refactor branch June 23, 2024 13:26
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants