Error Guard — Control‑Plane Safety

v1.0.0

System safety and control-plane skill that prevents agent deadlocks and freezes. Provides non-LLM control commands to inspect task state, flush message queues, cancel long-running work, and recover safely without restarting the container. Use when implementing or operating long-running tasks, sub-agents, benchmarks, background monitors (e.g., Moltbook, PNR checks), or when the system becomes unresponsive and needs immediate recovery controls.

8· 3.1k·13 current·13 all-time

Install

OpenClaw Prompt Flow

Install with OpenClaw

Best for remote or guided setup. Copy the exact prompt, then paste it into OpenClaw for amar1432/error-guard.

Previewing Install & Setup.
Prompt PreviewInstall & Setup
Install the skill "Error Guard — Control‑Plane Safety" (amar1432/error-guard) from ClawHub.
Skill page: https://clawhub.ai/amar1432/error-guard
Keep the work scoped to this skill only.
After install, inspect the skill metadata and help me finish setup.
Use only the metadata you can verify from ClawHub; do not invent missing requirements.
Ask before making any broader environment changes.

Command Line

CLI Commands

Use the direct CLI path if you want to install manually and keep every step visible.

OpenClaw CLI

Bare skill slug

openclaw skills install error-guard

ClawHub CLI

Package manager switcher

npx clawhub@latest install error-guard
Security Scan
VirusTotalVirusTotal
Benign
View report →
OpenClawOpenClaw
Benign
medium confidence
Purpose & Capability
The name/description (control‑plane safety, task registry, flush/recover) match the code: registry, status/flush/recover commands, spawn and session helpers, and watchdog logic are all present and serve the stated purpose.
Instruction Scope
SKILL.md promises no LLM calls, no payload storage, and minimal state; the code follows this model (only task metadata persisted). One minor mismatch: SKILL.md says 'no external I/O' but example worker (moltbook-worker) contains a placeholder fetch simulation (currently no real network call). The control APIs (sessions_send, sessions_spawn, process.list/kill) are platform SDK calls and are expected for a control‑plane skill, but they do give the skill wide runtime reach.
Install Mechanism
No install spec (no external downloads) which is low risk. However the package contains TypeScript source files; there is no build/install instruction in SKILL.md — ensure the platform will compile/load these sources as expected. There is also a small syntax/typo artifact (imports using 'as any') that should be reviewed before runtime.
Credentials
The skill declares no environment variables, credentials, or config paths and the code does not access secrets or external API keys. Persisted state is limited to a local state.json containing only task metadata (taskId, timestamps, state).
!
Persistence & Privilege
This skill has significant control-plane privileges: it can spawn sub‑agents (sessions_spawn), send session messages (sessions_send), list and kill exec/sessions (process.list/process.kill), and it writes a local state file. These capabilities are coherent with its purpose but mean the skill can affect other running tasks and system behavior — test in an isolated environment and restrict who can invoke it. Note: always:true is not set (good); autonomous invocation is allowed by default (platform default).
Assessment
What to consider before installing: - This is a control‑plane tool: it can spawn and kill sessions and will clear the in‑memory registry and attempt to kill exec sessions on /flush. Those are powerful actions that can interrupt other work; run it in a test environment first. - The skill does not ask for credentials and only persists minimal task metadata to skills/error-guard/state.json, but verify that path and file permissions are acceptable in your deployment. - Review the SDK calls (sessions_spawn, sessions_send, process.list, process.kill) to confirm they are scoped to the agent/session boundaries you expect — ensure it cannot inadvertently kill unrelated system processes. - There are a few minor code hygiene issues (odd 'as any' import syntax) — ensure the code compiles or is adapted to your runtime before enabling it in production. - If you want to reduce risk, restrict who can invoke this skill, avoid giving it always:true, and test its /flush behavior to confirm it only affects intended sessions.

Like a lobster shell, security has layers — review code before you run it.

latestvk979q0y7631c86zsxndhaade3n80aff8orchestrationvk979q0y7631c86zsxndhaade3n80aff8reliabilityvk979q0y7631c86zsxndhaade3n80aff8safetyvk979q0y7631c86zsxndhaade3n80aff8sub-agentvk979q0y7631c86zsxndhaade3n80aff8systemvk979q0y7631c86zsxndhaade3n80aff8watchdogvk979q0y7631c86zsxndhaade3n80aff8
3.1kdownloads
8stars
1versions
Updated 2mo ago
v1.0.0
MIT-0

error-guard

⚠️ System‑level skill (Advanced users)

This skill defines the control‑plane safety primitives for OpenClaw. It is intentionally minimal, non‑blocking, and designed to prevent agent freezes, deadlocks, and unrecoverable states when running long‑lived or high‑risk workloads.

Design Principles

Warning: This skill operates at the agent control‑plane level. It should be installed only by users who understand OpenClaw’s execution model and are running workloads that can block, hang, or run for extended periods.

  • Main agent never blocks: no long exec, no external I/O, no LLM calls.
  • Event-driven: workers emit events; the control plane listens.
  • Fail-safe first: recovery commands must always respond.
  • Minimal state: track only task metadata (never payloads).

Command Surface (Phase 1)

/status

Report current system health and task registry state.

Returns:

  • Active tasks (taskId, type, state)
  • Start time and last heartbeat
  • Flags for stalled or overdue tasks

Constraints:

  • Must run in constant time
  • Must not call any model or external API

/flush

Emergency stop.

Immediately:

  • Cancel all active tasks
  • Kill active exec/process sessions
  • Clear pending message queue
  • Reset in-memory task registry

Constraints:

  • Must always respond
  • No waiting on workers
  • No model calls

/recover

Safe recovery sequence.

Steps:

  1. Execute /flush
  2. Reset control-plane state
  3. Optionally reload skills/state (no container restart)

Future Extensions (Not Implemented Yet)

  • Sub-agent runner helper (event-driven)
  • Task watchdogs with TTL and silence detection
  • Structured event protocol (task.started, task.heartbeat, task.completed, ...)
  • Back-pressure and task classes (interactive / batch / background)

Security & Privacy

  • This skill does not store payloads, prompts, messages, or model outputs
  • Only minimal task metadata is persisted (taskId, timestamps, state)
  • No API keys, credentials, or user data are read or written
  • Safe to publish and share publicly

Non-Goals

  • No business logic
  • No background polling loops
  • No user-facing features
  • No LLM reasoning paths

This skill is the last line of defense. Keep it small, fast, and reliable.

Comments

Loading comments...