-
Notifications
You must be signed in to change notification settings - Fork 239
fix: improve MCP tool descriptions and evaluations #2099
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
@onmax is attempting to deploy a commit to the Nuxt Team on Vercel. A member of the Team first needs to authorize it. |
|
I would like to also suggest updating the blog post with this evals solution. What do you think? |
a3b9343 to
0932366
Compare
98af43f to
79ef06f
Compare
|
@onmax Thank you! Give me some time to review this and familiarize myself with Evalite and yes depending on that we'll probably midifira the blog post. FYI I'm also working on another project related to MCP where this could be useful! 😁 |
|
The latest updates on your projects. Learn more about Vercel for GitHub.
|
f05b915 to
64933ac
Compare
|
ok @HugoRCD . I was also thinking to open a PR for Nuxt UI, but I will wait until you decide what's the best approach :) |
64933ac to
d5297d6
Compare
Enhanced tool descriptions with WHEN TO USE guidance, examples, and common paths. Added realistic evaluation scenarios.
d5297d6 to
812bdff
Compare
|
@onmax Do you have a particular config to run the evals because I tried with the current version and there's this error: |
|
yes. totally agreed. but i didn't want to write too much custom code for now :) |
|
Happy to resolve the conflicts @onmax ? |
|
Yes 👍 Should i leave the The CI is currently not happy if I remove them |
1ffa673 to
0073df0
Compare
|
I resolved the conflict btw @atinux |
|
@onmax Not a fan of having to put these |
|
🫂 i feel you... |




Improved MCP tool descriptions with structured guidance and added realistic evaluation scenarios.
Impact
Model:
gpt-5.1-codex-mini: maybe we should try other models like sonnet 4.5 which is more common for developers?Important
Dev server must be running in the same machine at the moment
Changes
Limitations
Several evaluation scenarios are commented out due to MCP prompt limitations:
@ai-sdk/mcpdoes not support converting prompts to tools yetfind_documentation_for_topic,deployment_guide, andmigration_helpprompts are disabledRelated