Introducing ToolsGen π οΈ: A modular Python library for synthesizing tool-calling datasets #7857
Unanswered
atasoglu
asked this question in
Show and tell
Replies: 1 comment
-
|
This is quite cool! I like it |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
I built a tool to solve a problem I kept running into: creating quality datasets for training LLMs to use tools.
ToolsGen takes your JSON tool definitions and automatically generates realistic user requests, corresponding tool calls, and evaluates them using an LLM-as-a-judge pipeline. It outputs datasets ready to use with Hugging Face.
What makes it useful:
Still early days (API isn't stable yet), but it's already helping me generate tool-calling datasets much faster.
Check it out: https://github.com/atasoglu/toolsgen
Here is an example dataset generated with this library: https://huggingface.co/datasets/atasoglu/nano-tool-calling-v1
Happy to hear feedback or ideas!
Beta Was this translation helpful? Give feedback.
All reactions