How can I add a layer of guardrails to a local LLM’s output? #1360
Unanswered
NierWinter
asked this question in
Q&A
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
I want to use NVIDIA NeMo Guardrails to constrain the inference outputs of a llamacpp instance. Currently, I’m running nemoguardrails server with a config that connects to a local LLM service. My goal is to have the model self-check its output while streaming and return a blocked/error response if it violates any rules. However, right now the server is having issues with streaming not working properly. Could anyone share some ideas or examples? Thanks!
config:
prompts:
Beta Was this translation helpful? Give feedback.
All reactions