ROB-2290 workload issues testing #1074

RoiGlinik · 2025-10-23T09:09:45Z

No description provided.

# Conflicts: # holmes/core/supabase_dal.py # holmes/plugins/toolsets/robusta/robusta.py

coderabbitai · 2025-10-23T10:52:00Z

Walkthrough

Refactors Robusta toolset fetch APIs from change-history to issue-focused _fetch_issues, adds a new resource-scoped tool FetchResourceIssuesMetadata (requires namespace and workload), and extends mocks, fixtures, and toolset loading to support issues metadata retrieval in tests.

Changes

Cohort / File(s)	Summary
Robusta toolset changes `holmes/plugins/toolsets/robusta/robusta.py`	Replaced `_fetch_change_history` usage with `_fetch_issues`, updated `_invoke` call sites and external fetch path; added `FetchResourceIssuesMetadata` class (constructor, `_fetch_issues` override, `get_parameterized_one_liner`), requires `namespace` and `workload` parameters, and registered it in `RobustaToolset.tools`.
Test fixtures (new) `tests/llm/fixtures/test_ask_holmes/18_oom_kill_from_issues_history/issues_metadata.json`, `tests/llm/fixtures/test_ask_holmes/18_oom_kill_from_issues_history/test_case.yaml`, `tests/llm/fixtures/test_ask_holmes/18_oom_kill_from_issues_history/toolsets.yaml`	Added fixture JSON with 4 issue records, a server test case asserting an OOMKilled pod reason via resource-scoped issue fetch, and a toolsets YAML enabling `kubernetes/core`, `robusta`, and `kubernetes/logs`.
Mock DAL updates `tests/llm/utils/mock_dal.py`	`MockSupabaseDal` gains `issues_metadata` constructor parameter and a new `get_issues_metadata(...)` method that filters by date range, `finding_type`, `cluster` (with default), `workload` (subject_name) and `ns` (subject_namespace); loads `issues_metadata.json` during mock load; added imports (`List`, `FindingType`, datetime utilities).
Mock toolset loading `tests/llm/utils/mock_toolset.py`	Toolset initialization now builds/passes a mock DAL into builtin toolset loading (via `load_mock_dal`) so toolsets can use mock issues data; adjusted initialization flow to support mock-backed toolsets and Runbook toolset construction with mocks.

Sequence Diagram(s)

sequenceDiagram
  autonumber
  participant User as User/Test
  participant Toolset as RobustaToolset
  participant Tool as FetchResourceIssuesMetadata
  participant DAL as MockSupabaseDal / SupabaseDal

  Note over User,Toolset: resource-scoped issue fetch request (namespace, workload, time window)
  User->>Toolset: fetch_resource_issues(params)
  Toolset->>Tool: dispatch to _fetch_issues(params)
  Tool->>DAL: get_issues_metadata(start_datetime, end_datetime, limit, workload, ns, cluster, finding_type)
  alt DAL has fixture data
    DAL-->>Tool: filtered issue metadata list
  else no fixture / empty
    DAL-->>Tool: None or []
  end
  Tool-->>Toolset: metadata + parameterized one-liner
  Toolset-->>User: aggregated response

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~45 minutes

Focus areas:
- holmes/plugins/toolsets/robusta/robusta.py: method rename consistency, tool registration, FindingType usage, parameter metadata correctness.
- tests/llm/utils/mock_dal.py: date parsing, filtering semantics (cluster defaulting, workload/ns mapping), limit and sanitization.
- tests/llm/utils/mock_toolset.py: safe integration of mock_dal into toolset loading and modes (LIVE/GENERATE).
- Fixtures vs. test expectations: timestamps and fields alignment.

Possibly related PRs

ROB-2206 history changes improvements #1049 — overlapping changes to Robusta metadata fetching and namespace/workload filtering.
ROB-2290 fetch resource issues tool #1065 — similar additions of resource-level issue fetch and DAL changes for issues metadata.

Suggested reviewers

moshemorad
aantn

Pre-merge checks and finishing touches

❌ Failed checks (1 warning, 1 inconclusive)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 14.29% which is insufficient. The required threshold is 80.00%.	You can run `@coderabbitai generate docstrings` to improve docstring coverage.
Description Check	❓ Inconclusive	No pull request description was provided by the author. While the very lenient description check only requires that provided descriptions be related to the changeset, the complete absence of a description makes it impossible to conclusively determine whether it would meet the "related in some way to the changeset" criterion. Since there is no content to evaluate against the criteria, a definitive pass or fail determination cannot be made.	Consider adding a brief description explaining the purpose of this pull request. At a minimum, a short description of the new `FetchResourceIssuesMetadata` functionality and its testing infrastructure would help reviewers understand the changeset scope and intent.

✅ Passed checks (1 passed)

Check name	Status	Explanation
Title Check	✅ Passed	The title "ROB-2290 workload issues testing" is clearly related to the primary changes in this pull request. According to the raw_summary, the main change is the addition of a new class `FetchResourceIssuesMetadata` that enables resource-level (namespace/workload) issue retrieval within the Robusta toolset. The title accurately captures this focus on "workload issues" and appropriately references the associated testing infrastructure (test fixtures and mock DAL updates) being added. The title is concise, avoids unnecessary noise, and provides sufficient clarity about the changeset's primary objective.

✨ Finishing touches

📝 Generate docstrings

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch ROB-2290-workload-issues-testing

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 4

🧹 Nitpick comments (6)

holmes/plugins/toolsets/robusta/robusta_instructions.jinja2 (1)
6-7: Tighten language and capitalization for new guidance bullets.

Minor clarity/grammar tweaks; no behavior change.
-* When investigating a resource (pod, deployment, or job), if no relevant information is available from the live cluster at the time of investigation, call the fetch_resource_issues_metadata function to retrieve its historical alert data.
-* You can use fetch_resource_issues_metadata to get issues context for a specific kubernetes resource. Start with a 4 hours window and try to expand to 24 hours windows if nothing comes up.
+* When investigating a resource (Pod, Deployment, or Job), if no relevant information is available from the live cluster, call fetch_resource_issues_metadata to retrieve its historical alert data.
+* Use fetch_resource_issues_metadata to get issue context for a specific Kubernetes resource. Start with a 4‑hour window and expand to a 24‑hour window if nothing surfaces.
tests/llm/utils/mock_toolset.py (1)
649-655: Pass DAL by keyword for clarity; behavior unchanged.

Small readability tweak to avoid confusion if the function signature changes.
-        builtin_toolsets = load_builtin_toolsets(mock_dal)
+        builtin_toolsets = load_builtin_toolsets(dal=mock_dal)
tests/llm/fixtures/test_ask_holmes/18_oom_kill_from_issues_history/issues_metadata.json (1)
1-97: Normalize creation_date to RFC3339 with timezone to avoid parsing errors.

Mock DAL currently parses creation_date as tz‑aware; naive timestamps can break comparisons. Recommend adding Z to creation_date values.
-    "creation_date": "2025-10-19 10:59:27.643325",
+    "creation_date": "2025-10-19 10:59:27.643325Z",
@@
-    "creation_date": "2025-10-19 10:59:56.536309",
+    "creation_date": "2025-10-19 10:59:56.536309Z",
@@
-    "creation_date": "2025-10-19 10:59:56.536309",
+    "creation_date": "2025-10-19 10:59:56.536309Z",
Optional: align service_key/labels with subject_name for consistency (not functionally required).
tests/llm/utils/mock_dal.py (1)
136-147: Preserve JSON shape when generating mocks.

When data is an empty list, json.dumps(data or {}) writes {}. Prefer writing [] for lists and {} for dicts.
-                with open(file_path, "w") as f:
-                    f.write(json.dumps(data or {}, indent=2))
+                with open(file_path, "w") as f:
+                    payload = data if data is not None else []
+                    f.write(json.dumps(payload, indent=2))
Apply similarly in get_issue_data/resource_instructions writers.
holmes/core/supabase_dal.py (1)

246-255: Update debug string to reflect API rename.

Verification confirmed no stale references to the old get_configuration_changes_metadata API—all call sites properly use get_issues_metadata. However, the debug string at line 295 still says "Change history metadata" and should be updated to "Issues metadata" for consistency with the renamed method.
holmes/plugins/toolsets/robusta/robusta.py (1)
308-309: Consider consistent capitalization for uniformity.

The one-liner starts with lowercase "fetch" while other tools use "Fetch" (line 83), "Check" (line 147), or "Search" (lines 251, 289). Consider using "Fetch" for consistency.

Apply this diff:
     def get_parameterized_one_liner(self, params: Dict) -> str:
-        return f"Robusta: fetch resource issues metadata {params}"
+        return f"Robusta: Fetch Resource Issues Metadata {params}"

📜 Review details

Configuration used: CodeRabbit UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 2fa86b9 and 45ff238.

📒 Files selected for processing (9)

holmes/core/supabase_dal.py (4 hunks)
holmes/plugins/toolsets/robusta/robusta.py (9 hunks)
holmes/plugins/toolsets/robusta/robusta_instructions.jinja2 (1 hunks)
tests/llm/fixtures/test_ask_holmes/18_oom_kill_from_issues_history/issues_metadata.json (1 hunks)
tests/llm/fixtures/test_ask_holmes/18_oom_kill_from_issues_history/test_case.yaml (1 hunks)
tests/llm/fixtures/test_ask_holmes/18_oom_kill_from_issues_history/toolsets.yaml (1 hunks)
tests/llm/fixtures/test_ask_holmes/93_calling_datadog/toolsets.yaml (0 hunks)
tests/llm/utils/mock_dal.py (4 hunks)
tests/llm/utils/mock_toolset.py (2 hunks)

💤 Files with no reviewable changes (1)

tests/llm/fixtures/test_ask_holmes/93_calling_datadog/toolsets.yaml

🧰 Additional context used

📓 Path-based instructions (8)

**/*.py

📄 CodeRabbit inference engine (CLAUDE.md)

**/*.py: Use Ruff for formatting and linting (configured in pyproject.toml) for all Python code
Type hints are required; code should pass mypy (configured in pyproject.toml)
ALWAYS place Python imports at the top of the file, not inside functions or methods

Files:

holmes/core/supabase_dal.py
tests/llm/utils/mock_toolset.py
tests/llm/utils/mock_dal.py
holmes/plugins/toolsets/robusta/robusta.py

holmes/plugins/toolsets/**/*

📄 CodeRabbit inference engine (CLAUDE.md)

Toolsets must be located as holmes/plugins/toolsets/{name}.yaml or holmes/plugins/toolsets/{name}/

Files:

holmes/plugins/toolsets/robusta/robusta_instructions.jinja2
holmes/plugins/toolsets/robusta/robusta.py

tests/**

📄 CodeRabbit inference engine (CLAUDE.md)

Test files should mirror the source structure under tests/

Files:

tests/llm/fixtures/test_ask_holmes/18_oom_kill_from_issues_history/issues_metadata.json
tests/llm/fixtures/test_ask_holmes/18_oom_kill_from_issues_history/test_case.yaml
tests/llm/utils/mock_toolset.py
tests/llm/fixtures/test_ask_holmes/18_oom_kill_from_issues_history/toolsets.yaml
tests/llm/utils/mock_dal.py

tests/llm/**/*.yaml

📄 CodeRabbit inference engine (CLAUDE.md)

tests/llm/**/*.yaml: In eval manifests, ALWAYS use Kubernetes Secrets for scripts rather than inline manifests or ConfigMaps to prevent script exposure via kubectl describe
Eval resources must use neutral names, each test must use a dedicated namespace 'app-', and all pod names must be unique across tests

Files:

tests/llm/fixtures/test_ask_holmes/18_oom_kill_from_issues_history/test_case.yaml
tests/llm/fixtures/test_ask_holmes/18_oom_kill_from_issues_history/toolsets.yaml

tests/**/test_case.yaml

📄 CodeRabbit inference engine (CLAUDE.md)

Do NOT put toolset configuration directly in test_case.yaml; keep toolset config in a separate toolsets.yaml

Files:

tests/llm/fixtures/test_ask_holmes/18_oom_kill_from_issues_history/test_case.yaml

tests/**/*.py

📄 CodeRabbit inference engine (CLAUDE.md)

Only use pytest markers that are defined in pyproject.toml; never introduce undefined markers/tags

Files:

tests/llm/utils/mock_toolset.py
tests/llm/utils/mock_dal.py

tests/**/toolsets.yaml

📄 CodeRabbit inference engine (CLAUDE.md)

In eval toolsets.yaml, ALL toolset-specific configuration must be under a 'config' field; do not place toolset config at the top level

Files:

tests/llm/fixtures/test_ask_holmes/18_oom_kill_from_issues_history/toolsets.yaml

{holmes/plugins/toolsets/**/*.{yaml,yml},tests/**/toolsets.yaml}

📄 CodeRabbit inference engine (CLAUDE.md)

Only the following top-level fields are valid in toolset YAML: enabled, name, description, additional_instructions, prerequisites, tools, docs_url, icon_url, installation_instructions, config, url (MCP toolsets only)

Files:

tests/llm/fixtures/test_ask_holmes/18_oom_kill_from_issues_history/toolsets.yaml

🧠 Learnings (1)

📚 Learning: 2025-08-05T00:42:23.792Z

Learnt from: vishiy
PR: robusta-dev/holmesgpt#782
File: config.example.yaml:31-49
Timestamp: 2025-08-05T00:42:23.792Z
Learning: In robusta-dev/holmesgpt config.example.yaml, the azuremonitorlogs toolset configuration shows "enabled: true" as an example of how to enable the toolset, not as a default setting. The toolset is disabled by default and requires explicit enablement in user configurations.

Applied to files:

tests/llm/fixtures/test_ask_holmes/18_oom_kill_from_issues_history/toolsets.yaml

🧬 Code graph analysis (4)

holmes/core/supabase_dal.py (1)

tests/llm/utils/mock_dal.py (1)

get_issues_metadata (83-149)

tests/llm/utils/mock_toolset.py (2)

tests/llm/utils/mock_dal.py (1)

load_mock_dal (155-181)

holmes/plugins/toolsets/__init__.py (1)

load_builtin_toolsets (121-145)

tests/llm/utils/mock_dal.py (2)

holmes/core/supabase_dal.py (2)

FindingType (57-59)

get_issues_metadata (246-303)

tests/llm/utils/test_case_utils.py (1)

read_file (50-52)

holmes/plugins/toolsets/robusta/robusta.py (2)

holmes/core/supabase_dal.py (2)

FindingType (57-59)

get_issues_metadata (246-303)

tests/llm/utils/mock_dal.py (1)

get_issues_metadata (83-149)

🪛 Ruff (0.14.1)

tests/llm/utils/mock_dal.py

26-26: Do not catch blind exception: Exception

(BLE001)

holmes/plugins/toolsets/robusta/robusta.py

226-226: Unused method argument: context

(ARG002)

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (5)

GitHub Check: build (3.12)
GitHub Check: build (3.11)
GitHub Check: build (3.10)
GitHub Check: llm_evals
GitHub Check: build

🔇 Additional comments (10)

tests/llm/utils/mock_toolset.py (1)

13-15: Imports placement LGTM.

Top-level imports for load_mock_dal and Path follow our guidelines.

holmes/core/supabase_dal.py (1)

57-60: Enum introduction LGTM.

FindingType with string values is appropriate for PostgREST filtering.

tests/llm/utils/mock_dal.py (1)

161-165: Mock loader additions LGTM.

Reading issues_metadata.json and plumb-through to MockSupabaseDal is correct.

Also applies to: 175-181

tests/llm/fixtures/test_ask_holmes/18_oom_kill_from_issues_history/toolsets.yaml (1)

1-7: Toolset structure is valid; kubernetes/logs is always available.

The top-level structure and use of enabled: true are correct for eval toolsets.yaml per coding guidelines. The concern about USE_LEGACY_KUBERNETES_LOGS gating availability of kubernetes/logs is incorrect. When USE_LEGACY_KUBERNETES_LOGS is False (default), the toolset is loaded from KubernetesLogsToolset() class; when True, it's loaded from kubernetes_logs.yaml. Either way, kubernetes/logs remains available as a builtin toolset.

Likely an incorrect or invalid review comment.

holmes/plugins/toolsets/robusta/robusta.py (6)

7-7: LGTM!

The FindingType import is correctly added and properly used throughout the file to support the new parameterized finding type functionality.

205-223: Well-refactored generalization!

The rename from _fetch_change_history to _fetch_issues with the added finding_type parameter effectively generalizes the method to support multiple finding types while maintaining backward compatibility through the default value.

238-238: Good improvement for subclass reusability.

Using self.name instead of a hardcoded string makes the error message adapt automatically to subclasses, improving maintainability.

254-264: LGTM!

The updated description using "specific kubernetes resource" is more accurate than the previous "specific workload" terminology.

285-286: Implementation is correct.

The override correctly hardcodes cluster="external" for external configuration changes. The # type: ignore suppresses the type checker's complaint about signature narrowing, which is intentional here for this internal method.

323-328: LGTM!

The new FetchResourceIssuesMetadata tool is correctly registered in the toolset, following the same pattern as existing tools.

holmes/plugins/toolsets/robusta/robusta.py

tests/llm/fixtures/test_ask_holmes/18_oom_kill_from_issues_history/test_case.yaml

tests/llm/utils/mock_dal.py

coderabbitai

Actionable comments posted: 1

🧹 Nitpick comments (1)

holmes/plugins/toolsets/robusta/robusta.py (1)

285-286: Method override uses narrower signature.

The # type: ignore comment suppresses a mypy warning about the method signature being narrower than the parent's. While this works at runtime, consider whether the parent method signature could be made more flexible to avoid needing type ignores.

📜 Review details

Configuration used: CodeRabbit UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 45ff238 and 322b1eb.

📒 Files selected for processing (1)

holmes/plugins/toolsets/robusta/robusta.py (9 hunks)

🧰 Additional context used

📓 Path-based instructions (2)

**/*.py

📄 CodeRabbit inference engine (CLAUDE.md)

**/*.py: Use Ruff for formatting and linting (configured in pyproject.toml) for all Python code
Type hints are required; code should pass mypy (configured in pyproject.toml)
ALWAYS place Python imports at the top of the file, not inside functions or methods

Files:

holmes/plugins/toolsets/robusta/robusta.py

holmes/plugins/toolsets/**/*

📄 CodeRabbit inference engine (CLAUDE.md)

Toolsets must be located as holmes/plugins/toolsets/{name}.yaml or holmes/plugins/toolsets/{name}/

Files:

holmes/plugins/toolsets/robusta/robusta.py

🧬 Code graph analysis (1)

holmes/plugins/toolsets/robusta/robusta.py (3)

holmes/core/supabase_dal.py (3)

SupabaseDal (70-603)

FindingType (57-59)

get_issues_metadata (246-303)

tests/llm/utils/mock_dal.py (1)

get_issues_metadata (83-149)

holmes/core/tools.py (3)

ToolInvokeContext (164-170)

StructuredToolResult (79-103)

ToolParameter (155-161)

🪛 Ruff (0.14.1)

holmes/plugins/toolsets/robusta/robusta.py

226-226: Unused method argument: context

(ARG002)

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (3)

GitHub Check: Pre-commit checks
GitHub Check: llm_evals
GitHub Check: build

🔇 Additional comments (5)

holmes/plugins/toolsets/robusta/robusta.py (5)

7-7: LGTM: Import added correctly.

The FindingType enum import is properly added and aligns with its usage throughout the file.

205-223: LGTM: Method refactoring is well-structured.

The rename from _fetch_change_history to _fetch_issues with the added finding_type parameter generalizes the functionality appropriately. The default value FindingType.CONFIGURATION_CHANGE maintains backward compatibility.

226-248: LGTM: Improved message genericity.

The update to use self.name in the no-data message (line 238) makes the method more reusable across subclasses.

Note: The static analysis hint about unused context parameter is a false positive—it's required by the base class signature.

319-320: Method override pattern consistent with external changes tool.

The override correctly passes FindingType.ISSUE to fetch issues instead of configuration changes. The # type: ignore usage is consistent with the pattern in FetchExternalConfigurationChangesMetadata.

337-342: LGTM: New tool properly integrated.

The FetchResourceIssuesMetadata tool is correctly instantiated and added to the toolset's tools list.

holmes/plugins/toolsets/robusta/robusta.py

# Conflicts: # holmes/plugins/toolsets/robusta/robusta.py

coderabbitai

Actionable comments posted: 1

🧹 Nitpick comments (5)

holmes/plugins/toolsets/robusta/robusta.py (5)

205-224: Harden _fetch_issues: key constants, limit validation, and precise typing

Use constants for param keys to avoid drift, validate/normalize limit (avoid treating 0 as “missing” unless desired), and tighten types.

-    def _fetch_issues(
-        self,
-        params: Dict,
-        cluster: Optional[str] = None,
-        finding_type: FindingType = FindingType.CONFIGURATION_CHANGE,
-    ) -> Optional[List[Dict]]:
+    def _fetch_issues(
+        self,
+        params: Dict[str, Any],
+        cluster: Optional[str] = None,
+        finding_type: FindingType = FindingType.CONFIGURATION_CHANGE,
+    ) -> Optional[List[Dict[str, Any]]]:
         if self._dal and self._dal.enabled:
-            return self._dal.get_issues_metadata(
-                start_datetime=params["start_datetime"],
-                end_datetime=params["end_datetime"],
-                limit=min(
-                    params.get("limit") or DEFAULT_LIMIT_CHANGE_ROWS,
-                    MAX_LIMIT_CHANGE_ROWS,
-                ),
-                ns=params.get("namespace"),
-                workload=params.get("workload"),
-                cluster=cluster,
-                finding_type=finding_type,
-            )
+            # normalize and bound limit
+            raw_limit = params.get("limit", DEFAULT_LIMIT_CHANGE_ROWS)
+            try:
+                limit_val = int(raw_limit)
+            except (TypeError, ValueError):
+                limit_val = DEFAULT_LIMIT_CHANGE_ROWS
+            limit_val = max(1, min(limit_val, MAX_LIMIT_CHANGE_ROWS))
+
+            return self._dal.get_issues_metadata(
+                start_datetime=params[START_TIME],
+                end_datetime=params[END_TIME],
+                limit=limit_val,
+                ns=params.get(NAMESPACE),
+                workload=params.get(WORKLOAD),
+                cluster=cluster,
+                finding_type=finding_type,
+            )
         return None

226-249: Neutral naming/messages and stronger typing in _invoke

Make this base path wording generic (works for changes and issues) and tighten the params type.

-    def _invoke(self, params: dict, context: ToolInvokeContext) -> StructuredToolResult:
+    def _invoke(self, params: Dict[str, Any], context: ToolInvokeContext) -> StructuredToolResult:
         try:
-            changes = self._fetch_issues(params)
-            if changes:
+            items = self._fetch_issues(params)
+            if items:
                 return StructuredToolResult(
                     status=StructuredToolResultStatus.SUCCESS,
-                    data=changes,
+                    data=items,
                     params=params,
                 )
             else:
                 return StructuredToolResult(
                     status=StructuredToolResultStatus.NO_DATA,
-                    data=f"{self.name} found no data. {params}",
+                    data=f"{self.name} found no results. {params}",
                     params=params,
                 )
         except Exception as e:
-            msg = f"There was an internal error while fetching changes for {params}. {str(e)}"
+            msg = f"There was an internal error while fetching results for {params}. {str(e)}"
             logging.exception(msg)
             return StructuredToolResult(
                 status=StructuredToolResultStatus.ERROR,
                 data=msg,
                 params=params,
             )

285-286: Remove type: ignore by matching base signature

Align the override with the base method signature and pass through finding_type. This avoids suppressions and keeps mypy happy.

-    def _fetch_issues(self, params: Dict) -> Optional[List[Dict]]:  # type: ignore
-        return super()._fetch_issues(params, cluster="external")
+    def _fetch_issues(
+        self,
+        params: Dict[str, Any],
+        cluster: Optional[str] = None,
+        finding_type: FindingType = FindingType.CONFIGURATION_CHANGE,
+    ) -> Optional[List[Dict[str, Any]]]:
+        return super()._fetch_issues(
+            params,
+            cluster="external",
+            finding_type=finding_type,
+        )

302-316: Use key constants for parameters to avoid drift

You already define NAMESPACE/WORKLOAD constants; use them here for consistency and safety.

-        self.parameters.update(
+        self.parameters.update(
             {
-                "namespace": ToolParameter(
+                NAMESPACE: ToolParameter(
                     description="The Kubernetes namespace name for filtering issues and alerts",
                     type="string",
                     required=True,
                 ),
-                "workload": ToolParameter(
+                WORKLOAD: ToolParameter(
                     description="Kubernetes resource name to filter issues and alerts (e.g., Pod, Deployment, Job, etc.). Must be the full name. For Pods, include the exact generated suffix.",
                     type="string",
                     required=True,
                 ),
             }
         )

319-320: Align override signature and drop suppression

Match the base signature and remove # type: ignore; fix default to ISSUE to reflect the tool’s purpose.

-    def _fetch_issues(self, params: Dict) -> Optional[List[Dict]]:  # type: ignore
-        return super()._fetch_issues(params, finding_type=FindingType.ISSUE)
+    def _fetch_issues(
+        self,
+        params: Dict[str, Any],
+        cluster: Optional[str] = None,
+        finding_type: FindingType = FindingType.ISSUE,
+    ) -> Optional[List[Dict[str, Any]]]:
+        return super()._fetch_issues(
+            params,
+            cluster=cluster,
+            finding_type=FindingType.ISSUE,
+        )

📜 Review details

Configuration used: CodeRabbit UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 322b1eb and d7c338c.

📒 Files selected for processing (1)

holmes/plugins/toolsets/robusta/robusta.py (4 hunks)

🧰 Additional context used

📓 Path-based instructions (2)

**/*.py

📄 CodeRabbit inference engine (CLAUDE.md)

**/*.py: Use Ruff for formatting and linting (configured in pyproject.toml) for all Python code
Type hints are required; code should pass mypy (configured in pyproject.toml)
ALWAYS place Python imports at the top of the file, not inside functions or methods

Files:

holmes/plugins/toolsets/robusta/robusta.py

holmes/plugins/toolsets/**/*

📄 CodeRabbit inference engine (CLAUDE.md)

Toolsets must be located as holmes/plugins/toolsets/{name}.yaml or holmes/plugins/toolsets/{name}/

Files:

holmes/plugins/toolsets/robusta/robusta.py

🧬 Code graph analysis (1)

holmes/plugins/toolsets/robusta/robusta.py (2)

holmes/core/tools.py (1)

ToolParameter (155-161)

holmes/core/supabase_dal.py (1)

FindingType (57-59)

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (3)

GitHub Check: llm_evals
GitHub Check: Pre-commit checks
GitHub Check: build

holmes/plugins/toolsets/robusta/robusta.py

tests/llm/utils/mock_dal.py

coderabbitai

Actionable comments posted: 1

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)

tests/llm/utils/mock_dal.py (1)

63-76: Bug: get_resource_instructions returns None when generate_mocks is False.

Return is inside the if-block, so the method returns None otherwise. Dedent. Based on coding guidelines.

             if self._generate_mocks:
                 file_path = self._get_mock_file_path("resource_instructions")
@@
-                logging.warning(
-                    f"A mock file was generated for you at {file_path} with the contentof dal.get_resource_instructions({type}, {name})"
-                )
-
-                return data
+                logging.warning(
+                    "A mock file was generated for you at %s with the content of "
+                    "dal.get_resource_instructions(%s, %s)",
+                    file_path,
+                    type,
+                    name,
+                )
+            return data

♻️ Duplicate comments (2)

tests/llm/utils/mock_dal.py (2)

24-31: Replace blind except; keep mock-mode defaults and log the exception.

Catching bare Exception violates Ruff BLE001 and hides the error context. Capture the exception, keep the mock defaults, and annotate the intentional broad catch. Based on coding guidelines.

-        try:
-            super().__init__(cluster="test")
-        except Exception:
-            self.enabled = True
-            self.cluster = "test"
-            logging.warning(
-                "Mocksupabase dal could not connect to db. Running in pure mock mode. real db calls and --generate-mock will fail."
-            )
+        try:
+            super().__init__(cluster="test")
+        except Exception as e:  # noqa: BLE001 - broad on purpose for offline mock mode
+            # Ensure sane defaults for pure mock mode
+            self.enabled = True
+            self.cluster = "test"
+            logging.warning(
+                "MockSupabaseDal could not connect to DB; running in pure mock mode. "
+                "Live DB calls and --generate-mocks will fail. (%s)",
+                e,
+            )

101-106: Fix timezone handling: .astimezone on naive datetimes raises; normalize to aware UTC.

Current list-comprehension calls astimezone on possibly naive datetimes, causing runtime errors. Normalize once and reuse start/end. Based on past reviews.

-            for item in self._issues_metadata:
-                creation_date, start, end = [
-                    datetime.fromisoformat(dt.replace("Z", "+00:00")).astimezone(
-                        timezone.utc
-                    )
-                    for dt in (item["creation_date"], start_datetime, end_datetime)
-                ]
+            def _to_utc(dt_str: str) -> datetime:
+                s = dt_str.replace("Z", "+00:00")
+                dt = datetime.fromisoformat(s)
+                if dt.tzinfo is None:
+                    return dt.replace(tzinfo=timezone.utc)
+                return dt.astimezone(timezone.utc)
+
+            start = _to_utc(start_datetime)
+            end = _to_utc(end_datetime)
+
+            for item in self._issues_metadata:
+                creation_date = _to_utc(item["creation_date"])

Also applies to: 96-100, 107-113

🧹 Nitpick comments (4)

tests/llm/utils/mock_dal.py (4)

47-49: Remove redundant f.close and set encoding in with-open blocks.

with closes files automatically; add encoding for consistency.

-                with open(file_path, "w") as f:
-                    f.write(json.dumps(data or {}, indent=2))
-                    f.close()
+                with open(file_path, "w", encoding="utf-8") as f:
+                    f.write(json.dumps(data or {}, indent=2))
@@
-                with open(file_path, "w") as f:
-                    f.write(json.dumps(data or {}, indent=2))
-                    f.close()
+                with open(file_path, "w", encoding="utf-8") as f:
+                    f.write(json.dumps(data or {}, indent=2))
@@
-                with open(file_path, "w") as f:
-                    f.write(json.dumps(data or {}, indent=2))
-                    f.close()
+                with open(file_path, "w", encoding="utf-8") as f:
+                    f.write(json.dumps(data or {}, indent=2))

Also applies to: 67-69, 141-143

51-53: Tighten logging: fix typos, prefer structured placeholders.

Minor cleanups: “contentof” typo; consistent class name; structured logging instead of f-strings.

-                logging.warning(
-                    f"A mock file was generated for you at {file_path} with the contentof dal.get_issue_data({issue_id})"
-                )
+                logging.warning(
+                    "A mock file was generated for you at %s with the content of dal.get_issue_data(%s)",
+                    file_path,
+                    issue_id,
+                )
@@
-                logging.warning(
-                    f"A mock file was generated for you at {file_path} "
-                    f"with the content of dal.get_issues_metadata("
-                    f"{start_datetime}, {end_datetime}, {limit}, "
-                    f"{workload}, {ns}, {finding_type})"
-                )
+                logging.warning(
+                    "A mock file was generated for you at %s with the content of "
+                    "dal.get_issues_metadata(%s, %s, %s, %s, %s, %s)",
+                    file_path,
+                    start_datetime,
+                    end_datetime,
+                    limit,
+                    workload,
+                    ns,
+                    finding_type,
+                )

Also applies to: 71-73, 145-150

57-59: Avoid shadowing built-in ‘type’ in get_resource_instructions signature.

Rename parameter to resource_type to improve clarity.

-    def get_resource_instructions(
-        self, type: str, name: Optional[str]
+    def get_resource_instructions(
+        self, resource_type: str, name: Optional[str]
     ) -> Optional[ResourceInstructions]:
@@
-            data = super().get_resource_instructions(type, name)
+            data = super().get_resource_instructions(resource_type, name)
@@
-                    "dal.get_resource_instructions(%s, %s)",
-                    file_path,
-                    type,
-                    name,
+                    "dal.get_resource_instructions(%s, %s)", file_path, resource_type, name

164-168: Harden fixture parsing for issues_metadata.json.

Be defensive against invalid JSON or wrong top-level type.

-    if issues_metadata_path.exists():
-        issues_metadata = json.loads(read_file(issues_metadata_path))
+    if issues_metadata_path.exists():
+        try:
+            raw = json.loads(read_file(issues_metadata_path))
+            if isinstance(raw, list):
+                issues_metadata = raw
+            else:
+                logging.warning(
+                    "issues_metadata.json should contain a list; got %s",
+                    type(raw).__name__,
+                )
+                issues_metadata = []
+        except json.JSONDecodeError as e:
+            logging.warning("Failed to parse issues_metadata.json: %s", e)
+            issues_metadata = []

📜 Review details

Configuration used: CodeRabbit UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 59006b1 and bd18cfd.

📒 Files selected for processing (1)

tests/llm/utils/mock_dal.py (4 hunks)

🧰 Additional context used

📓 Path-based instructions (3)

**/*.py

📄 CodeRabbit inference engine (CLAUDE.md)

**/*.py: Use Ruff for formatting and linting (configured in pyproject.toml) for all Python code
Type hints are required; code should pass mypy (configured in pyproject.toml)
ALWAYS place Python imports at the top of the file, not inside functions or methods

Files:

tests/llm/utils/mock_dal.py

tests/**/*.py

📄 CodeRabbit inference engine (CLAUDE.md)

Only use pytest markers that are defined in pyproject.toml; never introduce undefined markers/tags

Files:

tests/llm/utils/mock_dal.py

tests/**

📄 CodeRabbit inference engine (CLAUDE.md)

Test files should mirror the source structure under tests/

Files:

tests/llm/utils/mock_dal.py

🧬 Code graph analysis (1)

tests/llm/utils/mock_dal.py (4)

holmes/core/supabase_dal.py (3)

SupabaseDal (70-603)

FindingType (57-59)

get_issues_metadata (246-303)

holmes/utils/global_instructions.py (1)

Instructions (6-7)

holmes/core/resource_instruction.py (1)

ResourceInstructions (15-17)

tests/llm/utils/test_case_utils.py (1)

read_file (50-52)

🪛 Ruff (0.14.1)

tests/llm/utils/mock_dal.py

26-26: Do not catch blind exception: Exception

(BLE001)

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (3)

GitHub Check: llm_evals
GitHub Check: Pre-commit checks
GitHub Check: build

tests/llm/utils/mock_dal.py

# Conflicts: # tests/llm/utils/mock_dal.py

coderabbitai

Actionable comments posted: 0

♻️ Duplicate comments (5)

tests/llm/utils/mock_dal.py (5)

1-5: Remove file-level type ignore and add proper type hints.

The # type: ignore at line 1 suppresses all type checking, violating the coding guideline that "code should pass mypy". Import Any and use parameterized types throughout.

Apply this diff:

-# type: ignore
 import json
 import logging
 from pathlib import Path
-from typing import Dict, List, Optional
+from typing import Any, Dict, List, Optional

Then update type hints on lines 22, 143 as shown in subsequent comments.

22-22: Use parameterized Dict type hint.

The issues_metadata parameter should use Dict[str, Any] instead of unparameterized Dict.

Apply this diff:

-        issues_metadata: Optional[List[Dict]],
+        issues_metadata: Optional[List[Dict[str, Any]]],

28-35: Fix exception handling: capture exception and improve logging.

The bare except: violates Ruff BLE001 and doesn't capture the exception details. The log message also contains a typo.

Apply this diff:

         if initialize_base:
             try:
                 super().__init__(cluster="test")
-            except:  # noqa: E722
+            except Exception as e:  # noqa: BLE001 - broad on purpose for offline mock mode
                 self.enabled = True
                 self.cluster = "test"
                 logging.warning(
-                    "Mocksupabase dal could not connect to db. Running in pure mock mode. real db calls and --generate-mock will fail."
+                    "MockSupabaseDal could not connect to DB; running in pure mock mode. Live DB calls and --generate-mocks will fail. (%s)",
+                    e,
                 )

134-143: Use parameterized return type.

The return type should use List[Dict[str, Any]] instead of unparameterized List[Dict].

Apply this diff:

     def get_issues_metadata(
         self,
         start_datetime: str,
         end_datetime: str,
         limit: int = 100,
         workload: Optional[str] = None,
         ns: Optional[str] = None,
         cluster: Optional[str] = None,
         finding_type: FindingType = FindingType.CONFIGURATION_CHANGE,
-    ) -> Optional[List[Dict]]:
+    ) -> Optional[List[Dict[str, Any]]]:

148-154: Fix timezone handling to prevent runtime errors on naive datetimes.

Calling .astimezone() on a naive datetime (one without tzinfo) raises ValueError. If creation_date or the input strings lack timezone info, this will crash at runtime.

Apply this diff to normalize all datetimes to aware UTC before comparison:

+            def _to_utc(dt_str: str) -> datetime:
+                # Accept '...Z', '+00:00', or naive -> assume UTC
+                s = dt_str.replace("Z", "+00:00")
+                dt = datetime.fromisoformat(s)
+                if dt.tzinfo is None:
+                    return dt.replace(tzinfo=timezone.utc)
+                return dt.astimezone(timezone.utc)
+
             for item in self._issues_metadata:
-                creation_date, start, end = [
-                    datetime.fromisoformat(dt.replace("Z", "+00:00")).astimezone(
-                        timezone.utc
-                    )
-                    for dt in (item["creation_date"], start_datetime, end_datetime)
-                ]
+                creation_date = _to_utc(item["creation_date"])
+                start = _to_utc(start_datetime)
+                end = _to_utc(end_datetime)
                 if not (start <= creation_date <= end):
                     continue

🧹 Nitpick comments (1)

tests/llm/utils/mock_dal.py (1)
168-177: Consider dict comprehension for cleaner code.

The manual dict construction could be simplified using a dict comprehension or by selecting only the needed keys upfront.

For example:
                keys = ["id", "title", "subject_name", "subject_namespace", 
                        "subject_type", "description", "starts_at", "ends_at"]
                filtered_item = {k: item.get(k) for k in keys}
                filtered_data.append(filtered_item)

📜 Review details

Configuration used: CodeRabbit UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between bd18cfd and 1e965c9.

📒 Files selected for processing (2)

tests/llm/utils/mock_dal.py (5 hunks)
tests/llm/utils/mock_toolset.py (1 hunks)

🚧 Files skipped from review as they are similar to previous changes (1)

tests/llm/utils/mock_toolset.py

🧰 Additional context used

📓 Path-based instructions (3)

**/*.py

📄 CodeRabbit inference engine (CLAUDE.md)

**/*.py: Use Ruff for formatting and linting (configured in pyproject.toml) for all Python code
Type hints are required; code should pass mypy (configured in pyproject.toml)
ALWAYS place Python imports at the top of the file, not inside functions or methods

Files:

tests/llm/utils/mock_dal.py

tests/**/*.py

📄 CodeRabbit inference engine (CLAUDE.md)

Only use pytest markers that are defined in pyproject.toml; never introduce undefined markers/tags

Files:

tests/llm/utils/mock_dal.py

tests/**

📄 CodeRabbit inference engine (CLAUDE.md)

Test files should mirror the source structure under tests/

Files:

tests/llm/utils/mock_dal.py

🧬 Code graph analysis (1)

tests/llm/utils/mock_dal.py (2)

holmes/core/supabase_dal.py (3)

SupabaseDal (71-677)

FindingType (58-60)

get_issues_metadata (247-304)

tests/llm/utils/test_case_utils.py (1)

read_file (50-52)

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (3)

GitHub Check: Pre-commit checks
GitHub Check: llm_evals
GitHub Check: build

🔇 Additional comments (1)

tests/llm/utils/mock_dal.py (1)

215-218: LGTM! Consistent mock loading pattern.

The loading of issues_metadata.json follows the same pattern as existing mock data files (issue_data.json, resource_instructions.json), maintaining consistency in the mock data infrastructure.

Also applies to: 233-233

github-actions · 2025-10-28T10:35:21Z

Results of HolmesGPT evals

ask_holmes: 35/36 test cases were successful, 0 regressions, 1 setup failures

Test suite	Test case	Status
ask	01_how_many_pods	✅
ask	02_what_is_wrong_with_pod	✅
ask	04_related_k8s_events	✅
ask	05_image_version	✅
ask	09_crashpod	✅
ask	10_image_pull_backoff	✅
ask	110_k8s_events_image_pull	✅
ask	11_init_containers	✅
ask	13a_pending_node_selector_basic	✅
ask	14_pending_resources	✅
ask	15_failed_readiness_probe	✅
ask	17_oom_kill	✅
ask	18_oom_kill_from_issues_history	✅
ask	19_detect_missing_app_details	✅
ask	20_long_log_file_search	✅
ask	24_misconfigured_pvc	✅
ask	24a_misconfigured_pvc_basic	✅
ask	28_permissions_error	🚧
ask	39_failed_toolset	✅
ask	41_setup_argo	✅
ask	42_dns_issues_steps_new_tools	✅
ask	43_current_datetime_from_prompt	✅
ask	45_fetch_deployment_logs_simple	✅
ask	51_logs_summarize_errors	✅
ask	53_logs_find_term	✅
ask	54_not_truncated_when_getting_pods	✅
ask	59_label_based_counting	✅
ask	60_count_less_than	✅
ask	61_exact_match_counting	✅
ask	63_fetch_error_logs_no_errors	✅
ask	79_configmap_mount_issue	✅
ask	83_secret_not_found	✅
ask	86_configmap_like_but_secret	✅
ask	93_calling_datadog[0]	✅
ask	93_calling_datadog[1]	✅
ask	93_calling_datadog[2]	✅

Legend

✅ the test was successful
:minus: the test was skipped
⚠️ the test failed but is known to be flaky or known to fail
🚧 the test had a setup failure (not a code regression)
🔧 the test failed due to mock data issues (not a code regression)
🚫 the test was throttled by API rate limits/overload
❌ the test failed and should be fixed before merging the PR

RoiGlinik added 17 commits October 21, 2025 09:55

make dal function to support issues and config changes

680905d

add new tool for fetching issues only

d24dc10

add instructions for the new tool

59246ad

fix typos

fb0b26b

Merge branch 'master' into ROB-2290-workload-issues-tool

9d9fddb

# Conflicts: # holmes/core/supabase_dal.py # holmes/plugins/toolsets/robusta/robusta.py

minor fixes. improve description. remove fixed external cluster

39abd10

fix copied name

dc6e904

remove finding type ALL for now

9bd38b3

remove some tools from test to avoid tools limit

8588c5a

Merge branch 'master' into ROB-2290-workload-issues-tool

dd7e272

load mock dal for tests

1ccc79e

load issues json file as issues table

7bafa50

mock issues function

c671cb3

fix naming for inheritance

9595736

create base test case for loading issues from history

5ccc0fc

Merge branch 'master' into ROB-2290-workload-issues-testing

ace628d

allow mock dal to function without db connection

45ff238

coderabbitai bot reviewed Oct 23, 2025

View reviewed changes

move params to constructor to use required

322b1eb

coderabbitai bot reviewed Oct 23, 2025

View reviewed changes

holmes/plugins/toolsets/robusta/robusta.py Show resolved Hide resolved

RoiGlinik added 3 commits October 26, 2025 10:24

Merge branch 'master' into ROB-2290-workload-issues-testing

7e3def3

fix description

b7c4677

Merge branch 'master' into ROB-2290-workload-issues-testing

d7c338c

# Conflicts: # holmes/plugins/toolsets/robusta/robusta.py

coderabbitai bot reviewed Oct 26, 2025

View reviewed changes

holmes/plugins/toolsets/robusta/robusta.py Show resolved Hide resolved

change test to easy

59006b1

Avi-Robusta reviewed Oct 27, 2025

View reviewed changes

tests/llm/utils/mock_dal.py Outdated Show resolved Hide resolved

Avi-Robusta reviewed Oct 27, 2025

View reviewed changes

tests/llm/utils/mock_dal.py Show resolved Hide resolved

make sure values are set on failure

bd18cfd

coderabbitai bot reviewed Oct 27, 2025

View reviewed changes

tests/llm/utils/mock_dal.py Outdated Show resolved Hide resolved

Merge branch 'master' into ROB-2290-workload-issues-testing

1e965c9

# Conflicts: # tests/llm/utils/mock_dal.py

RoiGlinik requested a review from Avi-Robusta October 28, 2025 10:27

coderabbitai bot reviewed Oct 28, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

ROB-2290 workload issues testing #1074

ROB-2290 workload issues testing #1074

RoiGlinik commented Oct 23, 2025

Uh oh!

coderabbitai bot commented Oct 23, 2025 •

edited

Loading

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

coderabbitai bot left a comment

Uh oh!

github-actions bot commented Oct 28, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

ROB-2290 workload issues testing #1074

Are you sure you want to change the base?

ROB-2290 workload issues testing #1074

Conversation

RoiGlinik commented Oct 23, 2025

Uh oh!

coderabbitai bot commented Oct 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Possibly related PRs

Suggested reviewers

Pre-merge checks and finishing touches

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Oct 28, 2025

Results of HolmesGPT evals

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

coderabbitai bot commented Oct 23, 2025 •

edited

Loading