Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
4 changes: 3 additions & 1 deletion notebooks/en/_toctree.yml
Original file line number Diff line number Diff line change
Expand Up @@ -74,7 +74,9 @@
title: Phoenix Observability Dashboard on HF Spaces
- local: search_and_learn
title: Scaling Test-Time Compute for Longer Thinking in LLMs

- local: fine_tuning_llm_grpo_trl
title: Post training an LLM for reasoning with GRPO in TRL

- title: Computer Vision Recipes
isExpanded: false
sections:
Expand Down
891 changes: 891 additions & 0 deletions notebooks/en/fine_tuning_llm_grpo_trl.ipynb

Large diffs are not rendered by default.

3 changes: 1 addition & 2 deletions notebooks/en/index.md
Original file line number Diff line number Diff line change
Expand Up @@ -7,12 +7,11 @@ applications and solving various machine learning tasks using open-source tools

Check out the recently added notebooks:

- [Post training an LLM for reasoning with GRPO in TRL](fine_tuning_llm_grpo_trl)
- [Evaluating AI Search Engines with `judges` - the open-source library for LLM-as-a-judge evaluators](llm_judge_evaluating_ai_search_engines_with_judges_library)
- [Structured Generation from Images or Documents Using Vision Language Models](structured_generation_vision_language_models)
- [Vector Search on Hugging Face with the Hub as Backend](vector_search_with_hub_as_backend)
- [Multi-Agent Order Management System with MongoDB](mongodb_smolagents_multi_micro_agents)
- [Scaling Test-Time Compute for Longer Thinking in LLMs](search_and_learn)
- [Signature-Aware Model Serving from MLflow with Ray Serve](mlflow_ray_serve)

You can also check out the notebooks in the cookbook's [GitHub repo](https://github.com/huggingface/cookbook).

Expand Down