Guardrail Bridge

ReviewAudited by ClawScan on May 12, 2026.

Overview

Guardrail Bridge appears to be a disclosed pre-agent moderation plugin that can block messages locally or send them to configured security providers, with no artifact-backed evidence of hidden exfiltration or destructive behavior.

Before installing, decide whether you want automatic pre-agent filtering. If privacy is important, prefer the local blacklist mode; if using HTTP providers, configure trusted endpoints, store API keys in environment variables, and review fallback behavior so provider outages either pass or block according to your needs.

Findings (4)

Artifact-based informational review of SKILL.md, metadata, install specs, static scan signals, and capability signals. ClawScan does not execute the skill or run runtime probes.

What this means

When enabled, the plugin can stop some requests before the agent sees them.

Why it was flagged

The plugin hooks the before_dispatch event so it can decide whether a user message should be passed through or blocked.

Skill content
api.on("before_dispatch", async (event, ctx) => { ... return handler(event, ctx); });
Recommendation

Use clear configuration and test the selected blacklist or provider behavior so legitimate workflows are not unexpectedly blocked.

What this means

Provider API keys may be needed and will be used to authenticate moderation requests.

Why it was flagged

The skill may use provider API keys for remote moderation, which is expected for its HTTP connector mode and is documented.

Skill content
Use provider-specific environment variable names ... `DKNOWNAI_API_KEY`, `SECRA_API_KEY`, or `HIDYLAN_API_KEY`.
Recommendation

Prefer environment variables over plaintext config, scope provider keys appropriately, and rotate them if exposed.

What this means

User prompts and some session or channel/user-derived metadata may be shared with the selected remote moderation provider.

Why it was flagged

In HTTP mode, the plugin sends the message text and a session identifier to the configured DKnownAI moderation endpoint.

Skill content
body: JSON.stringify({ request_id: requestId, session_id: sessionId, input: text })
Recommendation

Enable HTTP providers only if their privacy terms are acceptable; use the local blacklist connector when prompts should not leave the local environment.

What this means

Once enabled, the guardrail runs automatically for configured dispatches until disabled or stopped.

Why it was flagged

The plugin is configured to start with OpenClaw when enabled, which is normal for a pre-agent guardrail but means it operates persistently during the session.

Skill content
"activation": { "onStartup": true }
Recommendation

Install it only if you want ongoing pre-dispatch filtering, and review the enabled connector and per-channel settings.