Turn Spec delivery into measurable weekly outcomes with
sce value metrics.
Most AI-assisted workflows can ship code, but they often cannot answer:
- Are we delivering faster this week than last week?
- Is quality improving or degrading?
- Should we pass Day30/Day60 gate based on evidence?
sce solves this with machine-readable KPI snapshots, baseline derivation, trend analysis, and gate-ready summaries.
# 0) Generate sample input (first-time setup)
sce value metrics sample --out ./kpi-input.json --period 2026-W10 --json
# 1) Snapshot this week
sce value metrics snapshot --input ./kpi-input.json --period 2026-W10 --checkpoint day-60 --json
# 2) Build baseline from earliest history snapshots
sce value metrics baseline --from-history 3 --period 2026-W10 --json
# 3) Analyze trend risk from latest snapshots
sce value metrics trend --window 6 --jsonIf you do not use the sample command, create kpi-input.json manually:
{
"period": "2026-W10",
"metrics": {
"ttfv_minutes": 25,
"batch_success_rate": 0.86,
"cycle_reduction_rate": 0.34,
"manual_takeover_rate": 0.16
},
"notes": "weekly review snapshot"
}snapshot --json returns a compact machine-readable summary:
{
"success": true,
"period": "2026-W10",
"risk_level": "medium",
"triggered_metrics": [
"manual_takeover_rate"
],
"snapshot_path": ".sce/specs/114-00-kpi-automation-and-observability/custom/weekly-metrics/2026-W10.json",
"gate_summary_path": ".sce/specs/114-00-kpi-automation-and-observability/custom/weekly-metrics/gate-summary.2026-W10.day-60.json",
"contract_path": "metric-definition.yaml"
}trend --json returns range, risk, and per-metric direction:
{
"success": true,
"period": "2026-W10",
"window_size": 3,
"range": {
"from": "2026-W08",
"to": "2026-W10"
},
"risk_level": "high",
"triggered_metrics": [
"ttfv_minutes"
],
"metrics": [
{
"metric_id": "ttfv_minutes",
"delta": 3,
"trend": "up",
"better_direction": "lower",
"status": "degraded",
"target_passed": true
}
],
"trend_path": ".sce/specs/114-00-kpi-automation-and-observability/custom/weekly-metrics/trend.latest.json"
}- Run
snapshotonce per week after major delivery batch. - Regenerate baseline when process or scope changes materially.
- Run
trendbefore Day30/Day60 decision review. - Attach generated JSON files to review notes as evidence.
- Auditability: Full input/output trace for each KPI run.
- Comparability: Same metric contract across weeks and agents.
- Gate-readiness: Directly consumable decision payload for Day30/Day60.
- Operational focus: Risk reasons are explicit, not subjective.