Arena Council — Multi-Model AI Debate

SuspiciousAudited by ClawScan on May 10, 2026.

Overview

The skill mostly provides local multi-model voting, but it also automatically applies optional “god mode” jailbreak/probing behavior that can rewrite prompts and bypass model refusals.

Install only if you intentionally want a local multi-model council that may also use god-mode refusal-bypass behavior when available. Review or remove the god-mode integration before using it with sensitive prompts, and prefer explicitly selecting trusted local models.

Findings (4)

Artifact-based informational review of SKILL.md, metadata, install specs, static scan signals, and capability signals. ClawScan does not execute the skill or run runtime probes.

ConcernHigh Confidence
ASI01: Agent Goal Hijack
What this means

Your request may be transformed into a jailbreak-style prompt without an explicit confirmation each time, potentially producing outputs you did not intend to solicit.

Why it was flagged

The skill can automatically rewrite the user's prompt using a stored bypass technique when a model is marked censored, changing the prompt semantics and attempting to override model refusals.

Skill content
If model is known-censored, wrap prompt with the stored technique... return apply_technique(prompt, technique)
Recommendation

Make any refusal-bypass behavior explicit, disabled by default, and require a clear user opt-in before modifying prompts.

What this means

Prompts given to this skill may be sent to multiple locally running models rather than just one model.

Why it was flagged

The skill sends prompts to a local LM Studio chat-completions API, which is expected for a local multi-model council but is still a real model-invocation data flow.

Skill content
LMSTUDIO_BASE = "http://127.0.0.1:1234/v1" ... session.post(f"{LMSTUDIO_BASE}/chat/completions", json=payload
Recommendation

Use explicit model lists and avoid submitting sensitive prompts unless you are comfortable with all selected local models receiving them.

What this means

If a god-mode folder exists, this skill may execute additional Python code that was not included in this review and may change how prompts are handled.

Why it was flagged

The skill conditionally adds a sibling god-mode scripts directory to Python's import path and imports modules from it, so runtime behavior depends on unbundled code outside this skill's reviewed files.

Skill content
GOD_MODE_DIR = Path(__file__).parent.parent.parent / "god-mode" / "scripts" ... sys.path.insert(0, str(GOD_MODE_DIR))
Recommendation

Declare this dependency clearly, pin or bundle reviewed helper code, and avoid importing executable code from sibling skill directories by default.

What this means

A saved profile can affect later sessions, causing future prompts to be modified based on prior or potentially altered profile state.

Why it was flagged

The skill stores and later reuses persistent model-profile data that determines whether and how future prompts are wrapped with god-mode techniques.

Skill content
profiles_file = GOD_MODE_DIR / "model_profiles.json" ... profiles[model_id] = result ... save_profiles(profiles)
Recommendation

Store profiles in a clearly scoped location, show users when profile state is used, provide a disable/clear option, and validate profile contents before applying prompt transformations.