Skip to content

Pull requests: huggingface/trl

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Add support for IterableDataset in DPO Trainer
#3559 opened Jun 10, 2025 by h-tonywu Loading…
3 of 5 tasks
Add vllm_gpu_memory_utilization recommendation script
#3554 opened Jun 9, 2025 by toslali-ibm Loading…
5 tasks
🫸 Push model card with checkpoint
#3550 opened Jun 9, 2025 by qgallouedec Loading…
5 tasks
🥳 new rloo
#3533 opened Jun 3, 2025 by shirinyamani Loading…
5 tasks
Push KTAE impl
#3518 opened May 30, 2025 by SamComber Loading…
5 tasks
🎀 New defaults: bf16=True
#3515 opened May 30, 2025 by qgallouedec Loading…
intuit
#3513 opened May 29, 2025 by shirinyamani Loading…
5 tasks
🎀 New defaults: gradient_checkpointing=True
#3510 opened May 29, 2025 by qgallouedec Loading…
5 tasks
Add Bidirectional Knowledge Distillation Option to GKDTrainer
#3508 opened May 29, 2025 by shaischaudhry Loading…
3 of 5 tasks
HF Doc Builder style
#3498 opened May 26, 2025 by qgallouedec Draft
[GRPO] Adds SSR priorized replay buffer
#3496 opened May 26, 2025 by edbeeching Loading…
[GKD] Use vllm for the student model
#3475 opened May 21, 2025 by kashif Draft
5 tasks
Add support for CB with native transformers
#3471 opened May 20, 2025 by ArthurZucker Loading…
Allow an user to train from a local dataset
#3470 opened May 19, 2025 by gogo2464 Loading…
1 of 5 tasks
add support for image inputs in GRPO
#3460 opened May 16, 2025 by hellopahe Loading…
[SFT] add warning if dataset's input_ids exceed max_length
#3449 opened May 15, 2025 by HERIUN Loading…
1 of 5 tasks
Fix logging docs
#3447 opened May 14, 2025 by xingyaoww Draft
2 of 5 tasks
🛠️ quantization support for vllm generation
#3428 opened May 8, 2025 by shirinyamani Loading…
5 tasks
Reintroducing step method in ppo_trainer
#3410 opened May 3, 2025 by jskaf34 Loading…
2 of 5 tasks
ProTip! Filter pull requests by the default branch with base:main.