Gateway Watchdog Discord

Discord-first watchdog skill for OpenClaw Gateway health monitoring, alerting, and safe recovery.

🛠️ Installation

1. Ask OpenClaw (Recommended)

Tell OpenClaw: "Install the gateway-watchdog skill." The agent will handle the installation and configuration automatically.

2. Manual Installation (CLI)

If you prefer the terminal, run:

clawhub install gateway-watchdog

What This Skill Does

Monitors Gateway health with a state machine (healthy, degraded, critical)
Sends Discord incident messages (ALERT, RECOVERED)
Deduplicates noisy failures using threshold + cooldown
Supports optional bounded auto-restart
Runs via two paths:
- Internal OpenClaw cron (normal operations)
- External macOS LaunchAgent fallback (when Gateway is unhealthy)

Why Two Paths

Cron path is the native OpenClaw scheduler route and easy to manage with openclaw cron.
LaunchAgent path is a fallback that can still run even when OpenClaw Gateway scheduling is impaired.

Using both gives better resilience than cron-only monitoring.

Skill Layout

SKILL.md - Agent-facing skill instructions
scripts/gateway-watchdog.sh - Core watchdog logic
scripts/install-launchd.sh - LaunchAgent installer helper
references/com.openclaw.gateway-watchdog.plist.template - launchd template
references/cron-agent-turn.md - Isolated cron prompt template

Runtime Data Isolation

All runtime artifacts stay in:

~/.openclaw/watchdogs/gateway-discord/

Files:

state.json - current watchdog state
events.jsonl - append-only event history
backups/state-*.json - rolling state backups
config.env - local deployment config (tokens/channel ids)

This avoids touching OpenClaw core files for normal watchdog operation.

Detection Logic

The watchdog checks:

openclaw gateway status --json
openclaw health --json --timeout <ms>

Failure classes:

runtime_stopped
rpc_probe_failed
health_unreachable
auth_mismatch
config_rewritten (baseline drift detected: openclaw.json != openclaw.json.good)
config_invalid
gateway_check_failed

Alerting rules:

Alert only after GW_WATCHDOG_FAIL_THRESHOLD consecutive failures
Suppress repeated alerts during GW_WATCHDOG_COOLDOWN_SECONDS
Send RECOVERED once when transitioning back to healthy
Alert body is human-readable (reason label + observed symptom + suggested action)

Prerequisites

OpenClaw CLI installed and available (openclaw)
Python 3 available (python3)
macOS (for LaunchAgent mode)
Discord delivery config (webhook or bot token)

Delivery Modes

Priority order:

DISCORD_WEBHOOK_URL
DISCORD_BOT_TOKEN + DISCORD_CHANNEL_ID

If webhook is set, webhook is used first. Otherwise bot API is used.

Quick Start

Run once manually:

bash "./scripts/gateway-watchdog.sh"

Optional env:

export DISCORD_WEBHOOK_URL="https://discord.com/api/webhooks/..."
export DISCORD_BOT_TOKEN="<your_discord_bot_token>"
export DISCORD_CHANNEL_ID="<your_discord_channel_id>"
export GW_WATCHDOG_SOURCE="manual"
export GW_WATCHDOG_FAIL_THRESHOLD=2
export GW_WATCHDOG_COOLDOWN_SECONDS=300

Install LaunchAgent Fallback

Install and load with 30s interval:

bash "./scripts/install-launchd.sh" --interval 30 --load

Check status:

launchctl list | grep "com.openclaw.gateway-watchdog"

Configure Internal OpenClaw Cron

openclaw cron add \
  --name "gateway-watchdog-internal" \
  --cron "*/1 * * * *" \
  --session isolated \
  --message "Run bash /absolute/path/to/scripts/gateway-watchdog.sh. Announce only state changes." \
  --announce \
  --channel discord \
  --to "channel:<your_channel_id>" \
  --best-effort-deliver

Auto-Recovery (Optional)

Disabled by default:

export GW_WATCHDOG_ENABLE_RESTART=0

Enable bounded restart:

export GW_WATCHDOG_ENABLE_RESTART=1
export GW_WATCHDOG_MAX_RESTART_ATTEMPTS=2

Safety behavior:

restart only after threshold failures
max attempts per incident window
no reinstall behavior in this skill

Configuration Reference

Common variables:

GW_WATCHDOG_BASE_DIR (default: ~/.openclaw/watchdogs/gateway-discord)
GW_WATCHDOG_FAIL_THRESHOLD (default: 2)
GW_WATCHDOG_COOLDOWN_SECONDS (default: 300)
GW_WATCHDOG_HEALTH_TIMEOUT_MS (default: 10000)
GW_WATCHDOG_ENABLE_RESTART (default: 0)
GW_WATCHDOG_MAX_RESTART_ATTEMPTS (default: 2)
GW_WATCHDOG_KEEP_BACKUPS (default: 50)
GW_WATCHDOG_SOURCE (default: unknown)
GW_WATCHDOG_CONFIG_FILE (default: ~/.openclaw/openclaw.json)
GW_WATCHDOG_CONFIG_BASELINE_FILE (default: ~/.openclaw/openclaw.json.good)

Binary overrides:

OPENCLAW_BIN
PYTHON_BIN

Test Checklist

Use this before production rollout:

Syntax checks
- bash -n scripts/gateway-watchdog.sh
- bash -n scripts/install-launchd.sh
Manual smoke run
- GW_WATCHDOG_SOURCE=test bash scripts/gateway-watchdog.sh
Discord delivery test
- verify one test message arrives in your target channel
Failure test
- stop/impair Gateway and verify ALERT
Recovery test
- restore Gateway and verify RECOVERED

Troubleshooting

No Discord messages
- Check config.env values and token/channel correctness
- Validate bot has permission to post in target channel
watchdog lock exists, exiting
- Another run is active; this is expected for overlap protection
Repeated suppressed events
- Cooldown/threshold is working; lower values for aggressive alerting
Gateway healthy but still alerting
- Re-run openclaw gateway status --json and openclaw health --json
- Ensure OPENCLAW_BIN resolves to the expected OpenClaw install

Security Notes

Do not commit config.env (contains credentials/ids in real deployments)
Use minimum required Discord permissions for the bot
Prefer webhook mode for simple one-channel alerting
Keep GW_WATCHDOG_ENABLE_RESTART=0 until you are confident in detection quality

Publishing Notes

This repository is structured for ClawHub publishing:

clawhub publish . \
  --slug openclaw-gateway-watchdog-skill \
  --name "Gateway Watchdog Discord" \
  --version <x.y.z> \
  --changelog "..."

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
references		references
scripts		scripts
CHANGELOG.md		CHANGELOG.md
README.md		README.md
SKILL.md		SKILL.md
skill.json		skill.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Gateway Watchdog Discord

🛠️ Installation

1. Ask OpenClaw (Recommended)

2. Manual Installation (CLI)

What This Skill Does

Why Two Paths

Skill Layout

Runtime Data Isolation

Detection Logic

Prerequisites

Delivery Modes

Quick Start

Install LaunchAgent Fallback

Configure Internal OpenClaw Cron

Auto-Recovery (Optional)

Configuration Reference

Test Checklist

Troubleshooting

Security Notes

Publishing Notes

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Gateway Watchdog Discord

🛠️ Installation

1. Ask OpenClaw (Recommended)

2. Manual Installation (CLI)

What This Skill Does

Why Two Paths

Skill Layout

Runtime Data Isolation

Detection Logic

Prerequisites

Delivery Modes

Quick Start

Install LaunchAgent Fallback

Configure Internal OpenClaw Cron

Auto-Recovery (Optional)

Configuration Reference

Test Checklist

Troubleshooting

Security Notes

Publishing Notes

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages