Replies: 2 comments
-
We kinda need human-supervised research programs since not all data on Kaggle is clean, so it can have GIGO not at a "bad idea" level but a "bad data" level, coding tasks are easier to self-correct than data science in this regad SakanaAI/AI-Scientist-v2#2 (comment) |
Beta Was this translation helpful? Give feedback.
0 replies
-
To lower friction, this tool seemed useful https://github.com/SWE-agent/mini-SWE-agent |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
There are a lot of papers that are similar to Absolute Zero, and I do think that having the same LLM split between two different roles and self-duel is a good idea. However, they are often stuck on a single task type. Also, not all self-play has to be so adversarial; it can be that one side suggests fun things to do, and the other has to discover something, anything.
List of stuff with the same Self-Play theme
In the context of OpenEvolve, there are a few things to consider:
Cross-reference to simulated open world training with a focus on "interestingness" jennyzzt/omni#2 (comment)
Beta Was this translation helpful? Give feedback.
All reactions