generated from fastai/nbdev_template
-
Notifications
You must be signed in to change notification settings - Fork 1.3k
Pull requests: huggingface/trl
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
dpo_trainer gather metrics across ranks before logging
#2474
opened Dec 13, 2024 by
zhc7
Loading…
2 of 5 tasks
Add "Language Modeling to Unpaired Preference" in Utilities for converting dataset types
😴 stale
No update from the author, will be closed soon
#2436
opened Dec 4, 2024 by
AMindToThink
Loading…
peft_config & is_loaded_in_4bit check added to Reward_Trainer
#2427
opened Dec 2, 2024 by
shirinyamani
Loading…
5 tasks
🧪 [Experimental] Train LeRobot policy with TRL
#2359
opened Nov 15, 2024 by
qgallouedec
•
Draft
5 tasks
👩🏫 Add SFT notebook for chatbot development
#2321
opened Nov 4, 2024 by
qgallouedec
•
Draft
5 tasks
Asynchronous RLHF: Faster and More Efficient Online DPO
#2278
opened Oct 24, 2024 by
mnoukhov
Loading…
1 of 3 tasks
[SFT VLM] Added support for Molmo models via standalone script
sft_vlm_molmo
#2236
opened Oct 15, 2024 by
sergiopaniego
Loading…
2 of 5 tasks
Remove ds_config scheuduler params to prevent deepseed from creating scheduler for ref_model
#2224
opened Oct 11, 2024 by
Ben-Schneider-code
Loading…
2 of 5 tasks
fixed: OverflowError: out of range integral type conversion attempted
#2206
opened Oct 9, 2024 by
himanshushukla12
Loading…
1 of 5 tasks
Previous Next
ProTip!
Filter pull requests by the default branch with base:main.