You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: src/lib/testing/engine.ts
+10Lines changed: 10 additions & 0 deletions
Original file line number
Diff line number
Diff line change
@@ -96,6 +96,7 @@ You'll be given a task description, steps, and success criteria. You need to
96
96
97
97
- Perform the steps in the exact order they are given.
98
98
- Do not search for potential fixes or workarounds.
99
+
- Keep explicit track (e.g. in a list) of the steps you have performed in your actions.
99
100
100
101
101
102
# Success and Failure Criteria for Steps
@@ -120,6 +121,12 @@ For example:
120
121
121
122
# Response Format
122
123
124
+
Return a JSON object with the following format:
125
+
126
+
\`\`\`json
127
+
${JSON.stringify(RESPONSE_JSON_SCHEMA,null,2)}
128
+
\`\`\`
129
+
123
130
Return \`{ status: "pass", steps: undefined, error: undefined }\` if you can successfully perform the task.
124
131
125
132
Return \`{ status: "failing", steps: [ { id: <number>, description: "<action that was taken>" } ], error: "<error message>" }\` if you cannot successfully perform the test. The steps array contains exactly the steps that were successfully performed and nothing more. If you cannot perform a step, the error message contains information about why the step failed. If the final state does not match the success criteria, the error message is a detailed short description explaining what is different on the actual application compared to the expected application state and success criteria.
@@ -129,6 +136,7 @@ Additionally:
129
136
- DO NOT INCLUDE ANY OTHER TEXT IN YOUR RESPONSE.
130
137
- CORRECTLY CHOOSE THE ID FOR EACH STEP.
131
138
- STEPS NEED TO BE RETURNED IN THE EXACT ORDER THEY WERE GIVEN.
139
+
- STRICTLY FOLLOW THE RESPONSE FORMAT DEFINED ABOVE!
132
140
133
141
# Prompt Format
134
142
@@ -182,6 +190,8 @@ The task will be given in the following format:
182
190
\`\`\`
183
191
{ "status": "failing", "steps": [ { "id": "1", "description": "Go to the search page" } ], "error": "The search page is not found" }
0 commit comments