Feat/add testing structure to v2 by victorstevansuse · Pull Request #36 · SUSE/suse-ai-observability-extension

victorstevansuse · 2026-03-20T18:36:07Z

Add comprehensive test suite, developer tooling, and local test infrastructure

Summary

A bunch of static validation tests that parse and validate all STY templates, Groovy scripts, monitors, metric bindings, and views .
snapshot tests that detect any unintended change to the extension's 199 topology nodes
fuzz tests that verified parser robustness.
bats tests for init.sh install/uninstall logic with mocked sts CLI
Groovy linting integrated via npm-groovy-lint
Local test environment via K3d with SUSE Observability, production-mirror OTel Collector, and pluggable AI components (QDrant, Ollama, Milvus, OpenSearch, vLLM)
Demo apps integration for manual exploration of the full telemetry pipeline. (suse-ai-demo-apps)
task check — single command to validate everything before pushing. Kind of a very primitive static analysis.
task deploy — safe deployment that validates before uploading

Still W.I.P

Integrations tests, currently just stubs of what I wanna do.
Although the infra runs ok in k3d, I'm still tweaking the collector and demos config.

Bugs found and fixed

products.sty referenced ...request-sucess-rate (typo) instead of ...success-rate — the Milvus success rate metric was silently missing from the UI
suse-ai-product-id-extractor.groovy had an unused typeName variable flagged by the linter
init.sh used xargs which is not available in the container base image

What's included

Test infrastructure (`tests/`)

tests/
├── static/                     # 34 tests — no infra needed
│   ├── certains_test.go        # CERTAINS.md + knowledge/ fact enforcement
│   ├── config_test.go          # stackpack.conf file reference validation
│   ├── packaging_test.go       # Include resolution, circular deps
│   ├── sty_test.go             # Unique IDs and identifiers
│   ├── monitors_test.go        # Remediation hint references 
│   ├── groovy_test.go          # Product catalog, type mappings, toString safety
│   ├── crossref_test.go        # Metric binding cross-references
│   ├── views_test.go           # View → ViewType → Menu cross-references
│   └── snapshot_test.go        # Golden file regression detection
├── init/
│   └── init_test.bats          # init.sh tests with mocked sts CLI 
├── integration/                # Stubs for deploy-and-verify tests 
├── internal/
│   ├── parser/                 # STY, HOCON, Groovy parsers + fuzz tests
│   ├── snapshot/               # Golden file comparison utility
│   ├── stackstate/             # REST API client
│   ├── otel/                   # OTel fixture builder
│   └── testutil/               # Path resolution helpers
├── testdata/snapshots/         # 6 golden files
└── infra/
    ├── setup.sh                # K3d lifecycle manager
    ├── otel-values.yaml        # Production-mirror OTel Collector Helm values
    ├── otel-config.yaml        # Standalone OTel config for docker-compose
    ├── docker-compose.yaml     # OTel Collector for use with external StackState
    ├── components/             # K8s manifests for AI components
    │   ├── qdrant.yaml         # Vector database (default)
    │   ├── ollama.yaml         # Inference engine, CPU mode (default)
    │   ├── milvus.yaml         # Vector database (opt-in)
    │   ├── opensearch.yaml     # Search engine (opt-in)
    │   └── vllm.yaml           # Inference engine, CPU mode (opt-in)
    └── demo-apps/
        └── values.yaml         # Helm values for suse-ai-demo-apps

Taskfile targets

Command	Purpose
`task check`	Lint + static tests + init tests
`task check SILENT=1`	Same, with minimal output
`task deploy`	check → version-up → upload
`task lint`	Groovy linting
`task lint-fix`	Auto-fix Groovy lint issues
`task test-static`	34 Go static validation tests
`task test-init`	10 bats tests for init.sh
`task test-integration`	Integration tests (needs running StackState)
`task infra-up`	Provision K3d cluster with full stack
`task infra-down`	Tear down K3d cluster
`task infra-status`	Show pod status across both namespaces

Local test environment

task infra-up deploys into a K3d cluster:

SUSE Observability (suse-observability namespace) — full trial deployment via Helm
OTel Collector (suse-private-ai namespace) — mirrors production config with all GenAI inference pipelines, tail sampling, spanmetrics, and component-specific transforms
QDrant + Ollama (default) — vector database and inference engine
Demo apps (default) — RAG pipeline generating realistic GenAI telemetry for manual exploration
Milvus, OpenSearch, vLLM (opt-in via DEPLOY_MILVUS=true, etc.)

The OTel collector image defaults to otel/opentelemetry-collector-contrib:0.147.0 and can be switched to the custom build (e.g. otelcol-suse-ai) via OTEL_COLLECTOR_IMAGE.

Note:
I don't think this will ever be suitable for robust tests given the GPU constraints of an AI Stack, but still helps with some tests.

What the static tests enforce

From knowledge/CERTAINS.md:

Icon base64 has valid prefix, encoding, and is single-line
Sync nodes have componentActions field
Include paths don't have double provisioning/ prefix
QueryViews have queryVersion field
ComponentType highlights have about section

From knowledge/RECOVERY_PROTOCOL.md and knowledge/MONITOR_CREATION_GUIDE.md:

ComponentType highlights have events, externalComponent, relatedResources
Monitors have description, status, intervalSeconds, arguments.metric
Child STY files don't have nodes: root key
Metric binding URN references resolve to existing bindings

Structural validation:

All include paths resolve to real files (recursive), no circular includes
All node IDs and identifiers are unique
Groovy scripts handle all known products, types, and use .toString() on externalId
QueryViews reference existing ViewTypes, MainMenuGroup items reference existing QueryViews

Snapshot regression detection (golden files):

Component types (27), metric bindings (110), monitors (15), views (23), include graph (38), Groovy switch cases (3 scripts)

Test plan

task check passes (lint + 34 static tests + 10 init tests)
task check SILENT=1 runs with minimal output
task lint-fix auto-fixes Groovy style issues
Fuzz tests run 10s each without crashes
Integration test stubs compile
Golden files are reviewed and committed
task infra-up deploys successfully and task infra-status shows all pods running
Run task deploy against a test instance to verify the Milvus metric fix

…ath utilities

…xtures, and test stubs

…st-all)

…ction

… infra

victorstevansuse added 26 commits March 19, 2026 20:03

feat(tests): initialize Go test module with directory structure and p…

829f15e

…ath utilities

Add STY parser with TDD approach

925f8b1

Add Groovy script text analyzer

c0b7262

Add HOCON parser for stackpack.conf with TDD

28038d1

feat(tests): add stackpack.conf validation tests

2b13bbd

feat(tests): add packaging completeness tests

43c3548

feat(tests): add STY structure validation tests

c65fbab

feat(tests): add monitor validation tests

575df00

feat(tests): add Groovy script pattern validation tests

beed92b

feat(tests): add Handlebars and cross-reference validation tests

4b800c7

feat(tests): add integration test infrastructure, API client, OTel fi…

0fb62ac

…xtures, and test stubs

feat: add test targets to Taskfile (test-static, test-integration, te…

a7acccf

…st-all)

feat(tests): add golden file snapshot comparison utility

72ebc9b

feat(tests): add fuzz tests for parser robustness

18650e6

feat(tests): add snapshot tests with golden files for regression dete…

08ef7ad

…ction

refactor(tests): remove low-value and redundant tests

bfc6a52

feat: add Groovy linting, init.sh tests, and deploy pipeline

3302635

refactor(tests): trim init.sh tests to 10 high-value cases

1f24913

feat(tests): enforce CERTAINS.md facts via static analysis

811b580

feat(tests): enforce rules from knowledge/ docs via static analysis

c466d33

fix: remove unused typeName variable in product ID extractor

94ab802

feat: add check verbosity modes and lint-fix task

d3dc67b

fix(infra): replace Docker Compose with K3d + Helm deployment

d36640f

fix: replace xargs with bash parameter expansion in init.sh

2fdd43a

feat(infra): add OTel collector, AI components, and demo apps to test…

d5cfe51

… infra

feat(tests): update README.md with the new tests commands

e494ef8

victorstevansuse requested a review from thbertoldi March 20, 2026 18:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feat/add testing structure to v2#36

Feat/add testing structure to v2#36
victorstevansuse wants to merge 26 commits intostackpack-v2from
feat/add-testing-structure-to-v2

victorstevansuse commented Mar 20, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

victorstevansuse commented Mar 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Add comprehensive test suite, developer tooling, and local test infrastructure

Summary

Still W.I.P

Bugs found and fixed

What's included

Test infrastructure (tests/)

Taskfile targets

Local test environment

What the static tests enforce

Test plan

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

victorstevansuse commented Mar 20, 2026 •

edited

Loading

Test infrastructure (`tests/`)