Actions: huggingface/cookbook
Actions
90 workflow runs
90 workflow runs
Post training an VLM for reasoning with GRPO using TRL recipe
Build PR Documentation
#806:
Pull request #312
synchronize
by
sergiopaniego
Fine-Tuning a Vision Language Model with TRL using MPO recipe
Build PR Documentation
#803:
Pull request #318
synchronize
by
sergiopaniego
Fine-Tuning a Vision Language Model with TRL using MPO recipe
Build PR Documentation
#802:
Pull request #318
opened
by
sergiopaniego
Fine tuning a VLM for Object Detection Grounding using TRL recipe
Build PR Documentation
#801:
Pull request #315
synchronize
by
sergiopaniego
Fine tuning a VLM for Object Detection Grounding using TRL recipe
Build PR Documentation
#800:
Pull request #315
opened
by
sergiopaniego
Post training an VLM for reasoning with GRPO using TRL recipe
Build PR Documentation
#799:
Pull request #312
synchronize
by
sergiopaniego
Post training an VLM for reasoning with GRPO using TRL recipe
Build PR Documentation
#798:
Pull request #312
synchronize
by
sergiopaniego
Post training an VLM for reasoning with GRPO using TRL recipe
Build PR Documentation
#797:
Pull request #312
synchronize
by
sergiopaniego
Post training an VLM for reasoning with GRPO using TRL recipe
Build PR Documentation
#796:
Pull request #312
synchronize
by
sergiopaniego
Post training an VLM for reasoning with GRPO using TRL recipe
Build PR Documentation
#795:
Pull request #312
synchronize
by
sergiopaniego
Post training an VLM for reasoning with GRPO using TRL recipe
Build PR Documentation
#794:
Pull request #312
synchronize
by
sergiopaniego
Post training an VLM for reasoning with GRPO using TRL recipe
Build PR Documentation
#793:
Pull request #312
synchronize
by
sergiopaniego
Post training an VLM for reasoning with GRPO using TRL recipe
Build PR Documentation
#792:
Pull request #312
opened
by
sergiopaniego
tokenizer -> processor in trl trainers init
Build PR Documentation
#773:
Pull request #306
synchronize
by
sergiopaniego
tokenizer -> processor in trl trainers init
Build PR Documentation
#772:
Pull request #306
opened
by
sergiopaniego
HfApiModel to InferenceClientModel
Build PR Documentation
#742:
Pull request #295
opened
by
sergiopaniego