Clonev

AdvisoryAudited by Static analysis on Apr 30, 2026.

Overview

No suspicious patterns detected.

Findings (0)

Artifact-based informational review of SKILL.md, metadata, install specs, static scan signals, and capability signals. ClawScan does not execute the skill or run runtime probes.

What this means

A user or agent could generate speech that appears to come from a real person who did not consent.

Why it was flagged

The primary description frames cloning other people's or celebrity voices as an intended use, which can create audio that listeners may trust as authentic.

Skill content
Clone their voice or someone else's voice... Works with: Any voice! Yours, a celebrity, a character, etc.
Recommendation

Use only with explicit consent, add clear labeling/watermarking guidance, and require the agent to verify consent before cloning a third party's voice.

What this means

Cloned voice audio could be sent externally before the user reviews the exact file, recipient, or impersonation risk.

Why it was flagged

The quick-reference workflow makes sending the generated cloned voice to Telegram part of the agent action sequence, without a separate confirmation or recipient-scoping step.

Skill content
→ Run: VOICE=$(...clonev.sh "hello" "/path/to/sample.wav" en)
→ Send: message action=send channel=telegram asVoice=true filePath="$VOICE"
Recommendation

Default to saving the generated audio locally and require explicit user approval, destination, and context before sending it through any messaging channel.

What this means

Sensitive voice samples may remain on disk and could be reused in later runs or confused with another person's sample by filename collision.

Why it was flagged

The script copies the user-provided voice sample into a persistent hard-coded directory and does not delete it; if a filename already exists, it reuses the old copy instead of the current file.

Skill content
if [ ! -f "${COQUI_DIR}/voice-samples/${SAMPLE_BASENAME}" ]; then
    cp "$VOICE_SAMPLE" "${COQUI_DIR}/voice-samples/"
fi
Recommendation

Use per-run temporary sample paths, clean up voice samples by default, avoid basename reuse, and clearly disclose any retention option to the user.

What this means

Future runs may execute different container code than the code reviewed here.

Why it was flagged

The skill runs an external Docker image tagged latest, so the executed runtime can change over time; this is purpose-aligned for XTTS but should be pinned and disclosed.

Skill content
docker run --rm --entrypoint "" ... ghcr.io/coqui-ai/tts:latest ...
Recommendation

Pin the container image by version or digest and declare Docker, ffmpeg, model download size, and trusted image provenance in the install requirements.