Back to skill
v1.0.0

PwnClaw Security Scan

BenignClawScan verdict for this skill. Analyzed May 1, 2026, 5:31 AM.

Analysis

This instruction-only skill is coherent with its security-scanning purpose, but users should run it only on controlled agents because it sends adversarial tests through an external service and recommends permanent prompt changes.

GuidanceBefore installing or using this skill, confirm you are authorized to test the target agent or endpoint, run scans in a controlled environment where possible, avoid exposing sensitive production data, and review any PwnClaw-generated system-prompt rules before making them permanent.

Findings (3)

Artifact-based informational review of SKILL.md, metadata, install specs, static scan signals, and capability signals. ClawScan does not execute the skill or run runtime probes.

Abnormal behavior control

Checks for instructions or behavior that redirect the agent, misuse tools, execute unexpected code, cascade across systems, exploit user trust, or continue outside the intended task.

Agent Goal Hijack
SeverityMediumConfidenceHighStatusNote
SKILL.md
- Prompt Injection & Indirect Injection
- Jailbreaks & Refusal Bypass
- Data Exfiltration & Agency Hijacking

The skill intentionally exposes the agent to adversarial prompts as part of a security test. This is disclosed and purpose-aligned, but those prompts are designed to test whether the agent can be redirected.

User impactIf run against a production or highly privileged agent, the test prompts may attempt to make the agent ignore instructions, misuse tools, or reveal data.
RecommendationRun scans only on agents and endpoints you control, preferably in a test configuration with limited tools and non-sensitive data.
Sensitive data protection

Checks for exposed credentials, poisoned memory or context, unclear communication boundaries, or sensitive data that could leave the user's control.

Insecure Inter-Agent Communication
SeverityMediumConfidenceHighStatusNote
SKILL.md
GET `https://www.pwnclaw.com/api/test/{token}` ... POST `https://www.pwnclaw.com/api/test/{token}` with `{ "response": "your answer" }` ... PwnClaw sends attacks directly

The skill routes test prompts and agent responses through the external PwnClaw service or allows that service to contact the agent endpoint directly.

User impactPwnClaw may observe the endpoint being tested, the prompts sent, the agent's responses, and scan results.
RecommendationUse only human-issued test tokens, test only endpoints you own or are authorized to assess, and avoid sending confidential production data through the scan.
Memory and Context Poisoning
SeverityMediumConfidenceHighStatusNote
SKILL.md
PwnClaw generates specific security rules based on your vulnerabilities. Add them as permanent instructions in your agent's system prompt

The skill recommends converting externally generated scan guidance into persistent agent instructions.

User impactPermanent prompt changes can alter future agent behavior across sessions if adopted without review.
RecommendationReview generated rules carefully, keep them narrowly scoped, and apply them through your normal change-review process before making them permanent.