Replies: 3 comments
-
Seems like could be done on Modal Sandbox |
Beta Was this translation helpful? Give feedback.
0 replies
-
Here's a PoC for what's described above! Still some details to iron out namely prompting format, but the foundations are there: #379 @Muhtasham internally this uses a SMT solver so we went the docker route and deployed on Cloud Run. |
Beta Was this translation helpful? Give feedback.
0 replies
-
.. regardless of where such an API runs, I was more wondering about the "openness" requirement. I.e if all data should be public upfront in order to be used for |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Thinking of verifiable reward models: we have a program verifier that would accept Python program strings as well as pre-and post-condition predicates, and return execution traces as well as counterexamples.
We are exploring the possibility of opening up this functionality as a REST API, as a simpler integration path than reimplementing this functionality in Python for the open-r1 repo.
Questions:
Thanks!
Beta Was this translation helpful? Give feedback.
All reactions