-
Couldn't load subscription status.
- Fork 183
New evals + eval tagging #1015
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: master
Are you sure you want to change the base?
New evals + eval tagging #1015
Changes from all commits
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
| Original file line number | Diff line number | Diff line change |
|---|---|---|
|
|
@@ -24,6 +24,7 @@ tags: | |
| - logs | ||
| - kubernetes | ||
| - toolset-limitation | ||
| - loki | ||
|
|
||
| port_forwards: | ||
| - namespace: app-102 | ||
|
|
||
| Original file line number | Diff line number | Diff line change |
|---|---|---|
|
|
@@ -19,3 +19,4 @@ tags: | |
| - logs | ||
| - transparency | ||
| - medium | ||
| - loki | ||
| Original file line number | Diff line number | Diff line change |
|---|---|---|
|
|
@@ -9,3 +9,4 @@ tags: | |
| - traces | ||
| - newrelic | ||
| - network | ||
| - embeds | ||
| Original file line number | Diff line number | Diff line change |
|---|---|---|
|
|
@@ -9,6 +9,7 @@ expected_output: | |
| tags: | ||
| - newrelic | ||
| - traces | ||
| - medium | ||
|
Comment on lines
9
to
+12
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. 🛠️ Refactor suggestion | 🟠 Major 🧩 Analysis chainVerify that "medium" tag is a declared pytest marker. The "medium" tag is being added. Per the learnings, only pytest markers declared in pyproject.toml should be used. Based on learnings. Run the following script to verify this tag is declared: 🏁 Script executed: #!/bin/bash
# Description: Verify "medium" is a declared pytest marker
rg -A 1 'markers\s*=' pyproject.toml | grep -i medium || echo "⚠️ 'medium' marker not found in pyproject.toml"Length of output: 46 Declare the “medium” pytest marker or remove its usage 🤖 Prompt for AI Agents |
||
|
|
||
| before_test: | | ||
| [ -n "${NEW_RELIC_ACCOUNT_ID:-}" ] && [ -n "${NEW_RELIC_API_KEY:-}" ] && [ -n "${NEW_RELIC_LICENSE_KEY:-}" ] || { for v in NEW_RELIC_ACCOUNT_ID NEW_RELIC_API_KEY NEW_RELIC_LICENSE_KEY; do [ -n "${!v:-}" ] || echo "Missing env var: $v"; done; exit 1; } | ||
|
|
||
| Original file line number | Diff line number | Diff line change |
|---|---|---|
|
|
@@ -7,6 +7,7 @@ expected_output: | |
| tags: | ||
| - prometheus | ||
| - easy | ||
| - metrics | ||
| - no-cicd | ||
|
|
||
| include_tool_calls: true | ||
|
|
||
| Original file line number | Diff line number | Diff line change | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| @@ -1,10 +1,13 @@ | ||||||||||||
| user_prompt: "Give me a graph of `container_memory_working_set_bytes`" | ||||||||||||
| expected_output: | ||||||||||||
| - 'The answer should include something similar to << { "type": "promql", "tool_name": "execute_prometheus_range_query", "random_key": "<alphanumeric_characters>" } >>. Both tool_name and random_key MUST absolutely be present and the same as mentioned here.' | ||||||||||||
|
|
||||||||||||
| tags: | ||||||||||||
| - prometheus | ||||||||||||
| evaluation: | ||||||||||||
| correctness: 1 | ||||||||||||
| - embeds | ||||||||||||
| - metrics | ||||||||||||
| - medium | ||||||||||||
|
|
||||||||||||
|
Comment on lines
4
to
+10
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. 🧩 Analysis chainVerify that all tags are declared pytest markers. The tags Based on learnings. Run the following script to verify these markers are declared: 🏁 Script executed: #!/bin/bash
# Description: Check if the tags used in this test fixture are declared as pytest markers in pyproject.toml
# Search for pytest marker declarations
rg -A 20 '^\[tool\.pytest\.ini_options\]|^markers\s*=' pyproject.tomlLength of output: 1742 🏁 Script executed: #!/bin/bash
# Show lines around port_forwards in the YAML file
rg -n 'port_forwards' -C3 tests/llm/fixtures/test_ask_holmes/34_memory_graph/test_case.yamlLength of output: 152 Fix undeclared pytest markers and namespace
🤖 Prompt for AI Agents |
||||||||||||
| port_forwards: | ||||||||||||
| - namespace: default | ||||||||||||
|
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Use a dedicated namespace following the required naming convention. The namespace As per coding guidelines. Apply this diff to fix the namespace: - - namespace: default
+ - namespace: app-34_memory_graph
service: robusta-kube-prometheus-st-prometheus
local_port: 9090
remote_port: 9090📝 Committable suggestion
Suggested change
🤖 Prompt for AI Agents |
||||||||||||
| service: robusta-kube-prometheus-st-prometheus | ||||||||||||
|
|
||||||||||||
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
🧩 Analysis chain
Verify all tags are declared pytest markers.
The test adds four new tags:
datadog,metrics,medium, andembeds. Whileembedsis confirmed as a new marker in this PR, ensure all other tags (datadog,metrics,medium) are properly declared inpyproject.toml.Based on learnings: Only use pytest markers that are declared in pyproject.toml.
🏁 Script executed:
Length of output: 1836
Add missing pytest markers
datadogandembedsAdd both
datadogandembedsto themarkerslist under[tool.pytest.ini_options]in pyproject.toml so all tags used in tests are declared.🤖 Prompt for AI Agents