Skip to content

COW-593 task 2 remaining dashboards resources comparison#18

Merged
lgahdl merged 7 commits intojefferson/cow-615-cow-593-task-1-essential-dashboards-overview-apifrom
jefferson/cow-616-cow-593-task-2-remaining-dashboards-resources-comparison
Feb 25, 2026
Merged

COW-593 task 2 remaining dashboards resources comparison#18
lgahdl merged 7 commits intojefferson/cow-615-cow-593-task-1-essential-dashboards-overview-apifrom
jefferson/cow-616-cow-593-task-2-remaining-dashboards-resources-comparison

Conversation

@jeffersonBastos
Copy link
Collaborator

@jeffersonBastos jeffersonBastos commented Feb 11, 2026

Summary

Completes COW-593 Task 2 by adding the remaining three Grafana dashboards (Resources, Comparison, Trader Activity) and updating all dashboards with cross-navigation links, providing a complete monitoring suite for CoW Protocol performance testing.

Changes

  • Add Resources dashboard (cow-perf-resources): CPU, memory, and network I/O monitoring per container with gauges, time series, and summary tables
  • Add Comparison dashboard (cow-perf-comparison): Baseline vs current performance comparison with regression indicators, latency/throughput deltas, and severity breakdowns
  • Add Trader Activity dashboard (cow-perf-traders): Per-trader statistics, top N analysis, order distribution, and success rates
  • Update existing dashboards (Overview, API Performance) with navigation links to all 5 dashboards
  • Add implementation plans and task documentation for COW-591 and COW-593

Dashboard Suite Overview

The complete dashboard suite now includes:

Dashboard UID Purpose
Overview cow-perf-overview High-level test status and KPIs
API Performance cow-perf-api API latency and throughput metrics
Resources cow-perf-resources Container resource utilization
Comparison cow-perf-comparison Baseline comparison and regression detection
Trader Activity cow-perf-traders Per-trader performance analysis

How to Test

  1. Start the CoW Protocol services with Docker Compose
  2. Import dashboards into Grafana from configs/dashboards/
  3. Verify cross-navigation links work between all 5 dashboards
  4. Run a performance test to verify metrics populate correctly

Checklist

  • Tests pass (poetry run pytest)
  • Linting passes (poetry run ruff check .)
  • Type checking passes (poetry run mypy src/)
  • Documentation updated (if needed)
  • Breaking changes documented (if any)

Breaking Changes

None

Related Issues

  • COW-593: Grafana Dashboards (Task 2: Remaining Dashboards)
  • COW-591: Prometheus Exporters (documentation added)

Add three new dashboards completing the Grafana visualization suite:

- Resources dashboard: CPU, memory, network monitoring per container
- Comparison dashboard: baseline vs current with regression indicators
- Trader Activity dashboard: per-trader statistics and activity patterns

Update existing dashboards with cross-navigation links to all 5 dashboards.
… COW-593

Document Prometheus exporter phases and Grafana dashboard implementation
plans to track progress on metrics infrastructure work.
@linear
Copy link

linear bot commented Feb 11, 2026

- Add prometheus_port config field with default 9091
- CLI uses config default, --prometheus-port 0 to disable
- Enhance order timeout logging with status, age, token pair, lifecycle
- Improve monitoring output with status breakdown counts
- Show all terminal states in final summary (filled/expired/failed/cancelled)
- Update README and CLI docs with monitoring instructions
Add concurrent Prometheus metrics update loop that exports test progress
and throughput metrics every second during performance test runs. This
fixes "No Data" panels in the Overview dashboard.

Remove redundant P50 delta panels from the comparison dashboard and
adjust grid positions for cleaner layout.
@jeffersonBastos jeffersonBastos marked this pull request as ready for review February 12, 2026 18:17
jeffersonBastos and others added 3 commits February 13, 2026 17:35
- Create 7 core alerting rules (latency, error rate, throughput, resources, test execution)
- Enable rule_files in Prometheus configuration
- Add alerts volume mount in Docker Compose
- Add Grafana annotations to show firing alerts on dashboard
- Add container_memory_percent metric for CriticalMemoryUsage alert
- Add implementation plan: thoughts/plans/2026-02-13-cow-598-alerting-rules.md
- Add implementation notes to ticket file documenting scope decisions
- Update INDEX.md with plan entry and document cluster reference

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
@lgahdl lgahdl merged commit 3f6ee8b into jefferson/cow-615-cow-593-task-1-essential-dashboards-overview-api Feb 25, 2026
10 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants