Repository search results

Filter by

Advanced
Advanced search

0 files

(118 ms)inksm26/Reinforcement-Fine-Tuning-LLMs-with-GRPO (press backspace or delete to remove)

ksm26/Reinforcement-Fine-Tuning-LLMs-with-GRPO

The course teaches how to fine-tune LLMs using Group Relative Policy Optimization (GRPO)—a reinforcement learning method that improves mo…

reinforcement-learning

machine-learning-algorithms

language-model

reward-design

rft

Jupyter Notebook

Updated
on Jun 13

Star

Sponsor open source projects you depend on

Contributors are working behind the scenes to make open source better for everyone—give them the help and recognition they deserve.Explore sponsorable projects

ProTip!

Press the

key to activate the search input again and adjust your query.

Sponsor open source projects you depend on

Contributors are working behind the scenes to make open source better for everyone—give them the help and recognition they deserve.Explore sponsorable projects

ProTip!

Press the

key to activate the search input again and adjust your query.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Filter by

Advanced

ksm26/Reinforcement-Fine-Tuning-LLMs-with-GRPO

Sponsor open source projects you depend on

Sponsor open source projects you depend on

repositories Search Results · repo:ksm26/Reinforcement-Fine-Tuning-LLMs-with-GRPO language:"Jupyter Notebook"

Filter by

Advanced

0 files

ksm26/Reinforcement-Fine-Tuning-LLMs-with-GRPO

Sponsor open source projects you depend on

Sponsor open source projects you depend on