chore: add final chat step of llm challenge interaction to pipeline #38
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
chore: add final chat step of llm challenge interaction to pipeline
closes #21
Key Changes:
the challenge was terminating before the model could respond to the flag discovery.
Root Cause: flag detection was happening after code execution but before the model could process the execution results in the next conversation turn.
New Flow:
submit_flag()now the model will have a full conversation turn to process the flag discovery:
Added:
Changed:
Removed:
double-checked this by also submitting the flag on behalf of the user account (dynamically unique and linked to that account)
Generated Summary:
This summary was generated with ❤️ by rigging