Install
openclaw skills install evidence-gateGenerates evidence obligations for a claim or action, evaluates existing evidence against them, and returns a structured verdict (PASS / SOFT_PASS / BLOCK / CONFLICT) with safe downgrade guidance. TRIGGER when the agent is about to present a root-cause diagnosis, claim 'the cause is X', say 'this is safe', recommend a destructive or irreversible action, recommend a rollback, make a safety assertion, or state a strong conclusion based on limited evidence. DO NOT TRIGGER when summarizing, formatting, brainstorming possibilities, performing low-risk reversible operations, or when the caller already has its own evidence-validation step.
openclaw skills install evidence-gateUse this skill to insert a lightweight evidence gate into an existing workflow without replacing the workflow.
Its purpose is not to make the caller more cautious — capable agents are already cautious. Its purpose is to make that caution structured, auditable, and actionable by answering a narrower question:
What evidence must exist before this conclusion or action is responsible enough to present, recommend, or execute?
Treat the caller's conclusion or action as tentative until the gate returns a verdict.
Keep the skill lightweight, selective, and non-blocking by default.
This skill gates the agent's own reasoning quality — not the user's intent.
It is not:
Given a tentative claim or action, do three things:
Do not fully own evidence collection. Recommend missing evidence for the caller to gather using its existing tools.
Use a single-pass gate instead of taking over the full workflow:
Assume no durable skill state across calls. Do not require a second gate pass unless the caller explicitly chooses to orchestrate one outside this skill.
Use this skill when one or more of the following are true:
Do not use this skill when:
This skill must preserve the caller's original capability as much as possible.
It should:
Apply these defaults unless the caller provides stricter policy:
2-5 concrete evidence obligations.The only required input is the claim — the conclusion, diagnosis, recommendation, or action under consideration.
Invocation examples:
/evidence-gate "The root cause is a nil dereference in request parsing"/evidence-gate "Safe to delete the staging database"When invoked with just a claim, the skill infers the remaining context:
claim_type: inferred from the claim language (e.g., "the cause is" → diagnosis, "safe to" → safety, "should delete" → action)domain: inferred from the current working contextrisk_level: inferred from the action's reversibility and blast radiusexecution_mode: inferred from whether the caller is informing, recommending, or about to executetarget_strength: inferred from the claim's language strengthThe caller may optionally provide any of these fields to override inference.
Use references/input-template.md when a caller wants a canonical explicit input shape.
See references/protocol.md for the full schema semantics.
The skill should return a structured gate result containing:
Return JSON matching references/output-template.md.
Use references/verdict-schema.json as the machine-checkable schema.
Keep gate_required even on explicit invocation.
Use gate_required = false as a fast exit when the claim is already low-risk, exploratory, or sufficiently bounded.
Use exactly these verdicts:
PASS
SOFT_PASS
BLOCK
CONFLICT
Reduce the caller's current position to a tentative, explicit candidate. If the caller already states the final conclusion as settled, rewrite it internally as tentative before gating it.
Translate the candidate claim/action into a small set of concrete evidence requirements.
Good evidence requirements are:
Bad evidence requirements are vague, such as:
Determine whether currently known evidence satisfies the requirements.
The skill should explicitly mark:
satisfiedmissingconflictingnot_applicableReturn a verdict immediately after evaluating known evidence. If evidence is missing, identify only the smallest set of additional checks that would materially change the verdict.
If evidence is insufficient, prefer one of:
Do not hard-block low-risk work unnecessarily.
Assume every call is fresh. Do not depend on remembering prior requirements, prior verdicts, or prior collection attempts unless the caller explicitly embeds them in the current input.
Do not require access to hidden chain-of-thought. Judge only from explicit claim, explicit evidence, explicit policy, and explicit outputs.
PASS with rationale.Bias toward using this skill when any of the following are present:
risk_level = highexecution_mode = autoBias away from using this skill when:
risk_level = lowWhen the gate does not fully pass, prefer these downgrades:
When the verdict is not PASS, the caller should avoid overstating certainty.
Good examples:
Bad examples:
This skill is not:
Its job is narrower: make evidence obligations explicit, assess whether they are met, and enforce safe downgrade behavior when they are not.