Skip to content

cambridgeltl/inspo

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

3 Commits
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

Code for Agentic Policy Optimization via Instruction-Policy Co-Evolution

inspo

πŸ’‘ Overview

We introduce INSPO, a novel Instruction-Policy Co-Evolution framework that integrates instruction optimization as a dynamic component of the reinforcement learning (RL) loop for training large language models.

πŸ“‘ Citation

If you find INSPO useful for your research and applications, please give us a star. We will release the scripts in the near future.

About

Code for Agentic Policy Optimization via Instruction-Policy Co-Evolution

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors