Thank you for your interests. This is the code repo for the paper "CREAM: Consistency Regularized Self-Rewarding Language Models". The codes are under organized and will be released soon!
-
Notifications
You must be signed in to change notification settings - Fork 1
Code for paper "CREAM: Consistency Regularized Self-Rewarding Language Models".
Raibows/CREAM
About
Code for paper "CREAM: Consistency Regularized Self-Rewarding Language Models".
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published