Skip to content

Commit af061cd

Browse files
authored
Merge pull request #312 from sergiopaniego/add-vlm-grpo
πŸ§‘β€πŸ³ Added `Post training an VLM for reasoning with GRPO using TRL` recipe
2 parents aafb3cc + eda3e7c commit af061cd

File tree

3 files changed

+3399
-1
lines changed

3 files changed

+3399
-1
lines changed

β€Žnotebooks/en/_toctree.ymlβ€Ž

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -130,6 +130,8 @@
130130
title: Fine tuning a VLM for Object Detection Grounding using TRL
131131
- local: fine_tuning_vlm_mpo
132132
title: Fine-Tuning a Vision Language Model with TRL using MPO
133+
- local: fine_tuning_vlm_grpo_trl
134+
title: Post training an VLM for reasoning with GRPO using TRL
133135

134136
- title: Search Recipes
135137
isExpanded: false

β€Žnotebooks/en/fine_tuning_vlm_grpo_trl.ipynbβ€Ž

Lines changed: 3396 additions & 0 deletions
Large diffs are not rendered by default.

β€Žnotebooks/en/index.mdβ€Ž

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -7,11 +7,11 @@ applications and solving various machine learning tasks using open-source tools
77

88
Check out the recently added notebooks:
99

10+
- [Post training an VLM for reasoning with GRPO using TRL](fine_tuning_vlm_grpo_trl)
1011
- [Fine-Tuning a Vision Language Model with TRL using MPO](fine_tuning_vlm_mpo)
1112
- [Fine tuning a VLM for Object Detection Grounding using TRL](fine_tuning_vlm_object_detection_grounding)
1213
- [Hyperparameter Optimization with Optuna and Transformers](optuna_hpo_with_transformers)
1314
- [Fine-tuning T5 for Automatic GitHub Tag Generation with PEFT](finetune_t5_for_search_tag_generation)
14-
- [Documentation Chatbot with Meta Synthetic Data Kit](fine_tune_chatbot_docs_synthetic)
1515

1616
You can also check out the notebooks in the cookbook's [GitHub repo](https://github.com/huggingface/cookbook).
1717

0 commit comments

Comments
Β (0)