Skill flagged — suspicious patterns detected

ClawHub Security flagged this skill as suspicious. Review the scan results before using.

智能配音合成虾

v1.0.0

AI智能配音合成专家。将文案/脚本转换为高拟真语音音频,支持多种音色、情感控制、SSML标注和后期处理。 触发场景:用户说"配音"、"语音合成"、"TTS"、"旁白"、"播客音频"、"有声读物"、"AI配音"、"朗读"、"音频生成", 或要求"用XX声音读这段文案"、"生成播客音频"、"把文章转成有声版"等。 支...

0· 94·0 current·0 all-time
byRicky@tujinsama

Install

OpenClaw Prompt Flow

Install with OpenClaw

Best for remote or guided setup. Copy the exact prompt, then paste it into OpenClaw for tujinsama/ai-voice-synthesis-claw.

Previewing Install & Setup.
Prompt PreviewInstall & Setup
Install the skill "智能配音合成虾" (tujinsama/ai-voice-synthesis-claw) from ClawHub.
Skill page: https://clawhub.ai/tujinsama/ai-voice-synthesis-claw
Keep the work scoped to this skill only.
After install, inspect the skill metadata and help me finish setup.
Use only the metadata you can verify from ClawHub; do not invent missing requirements.
Ask before making any broader environment changes.

Command Line

CLI Commands

Use the direct CLI path if you want to install manually and keep every step visible.

OpenClaw CLI

Canonical install target

openclaw skills install tujinsama/ai-voice-synthesis-claw

ClawHub CLI

Package manager switcher

npx clawhub@latest install ai-voice-synthesis-claw
Security Scan
Capability signals
Requires sensitive credentials
These labels describe what authority the skill may exercise. They are separate from suspicious or malicious moderation verdicts.
VirusTotalVirusTotal
Suspicious
View report →
OpenClawOpenClaw
Suspicious
high confidence
!
Purpose & Capability
The skill's stated purpose is text→TTS using ElevenLabs/OpenAI/Azure/system TTS, which matches the included synthesize-voice.py for ElevenLabs and OpenAI; however the registry metadata declares no required env vars or credentials while both SKILL.md and the script expect ELEVENLABS_API_KEY and OPENAI_API_KEY (SKILL.md also lists AZURE_SPEECH_KEY and region but the script does not implement Azure). This mismatch between claimed requirements and actual code is incoherent.
Instruction Scope
SKILL.md provides a clear TTS workflow and example commands that invoke scripts/synthesize-voice.py and post-processing. However the docs demonstrate running echo "ElevenLabs: $ELEVENLABS_API_KEY" which would print API keys to stdout/logs (a potential secret-leak risk). The instructions ask the agent to read script files and write output audio files (expected), and there are no instructions to exfiltrate data to unexpected endpoints. The guide suggests installing dependencies via pip/brew but there is no install spec in the metadata.
Install Mechanism
There is no automated install spec (instruction-only plus a Python script). That is the lower-risk model because nothing is automatically downloaded or executed during install. The SKILL.md suggests pip/brew commands for dependencies, which is expected for a Python-based TTS script but will run arbitrary package installs if followed by a user.
!
Credentials
The package metadata declares no required environment variables, but the script reads ELEVENLABS_API_KEY and OPENAI_API_KEY from the environment and SKILL.md also references AZURE_SPEECH_* keys. Requiring API keys for the listed TTS services is reasonable, but the omission from metadata is inconsistent and the SKILL.md example of echoing env vars risks exposing secrets. There are no other unnecessary credentials requested.
Persistence & Privilege
The skill does not request privileged persistence (always:false) and does not modify other skills or system-wide configs. It only runs as a normal user CLI script and writes generated audio files to the working directory.
What to consider before installing
This skill appears to implement TTS via ElevenLabs and OpenAI, but there are a few red flags you should consider before installing or supplying API keys: - Metadata vs. reality: The registry metadata lists no required environment variables, but the included script requires ELEVENLABS_API_KEY and OPENAI_API_KEY. Confirm with the author or expect to provide those keys. - Azure mismatch: SKILL.md mentions Azure credentials, but the script does not implement Azure TTS — ask the maintainer for clarification if you need Azure support. - Secret exposure: SKILL.md shows examples that echo environment variables (e.g., echo "ElevenLabs: $ELEVENLABS_API_KEY"). Avoid executing such commands in shared or logged environments since they may expose your API keys in logs. Instead, verify keys privately or use secure tooling to manage secrets. - Dependency installation: The instructions tell you to pip install packages and brew install ffmpeg. Only install these in a trusted/isolated environment (virtualenv/container) to limit risk. - Voice cloning / copyright: The skill notes voice cloning requires authorization. Do not pass audio samples or use someone else's voice without consent. Suggested actions before use: inspect the code (you already have synthesize-voice.py), run it in an isolated environment, provide API keys with least-privilege credentials or test keys, and request the publisher update the skill metadata to list required env vars and clarify Azure support.

Like a lobster shell, security has layers — review code before you run it.

latestvk977grxqdqxy3z9w8ebs64f7hh8594bv
94downloads
0stars
1versions
Updated 6d ago
v1.0.0
MIT-0

智能配音合成虾 (ai-voice-synthesis-claw)

将文字转化为有温度的声音。

工作流程

步骤 1:理解需求

收集以下信息(未提供时使用默认值):

  • 文本内容:待配音的文案/脚本
  • 音色风格:参考 references/voice-style-guide.md 选择合适音色
  • 语速:slow / normal(默认)/ fast
  • 情感:calm / warm / professional / energetic
  • 输出格式:mp3(默认)/ wav

步骤 2:文本预处理

在调用 TTS 前对文本进行处理:

  • 分句断句(按标点符号)
  • 数字转中文(100 → 一百)
  • 多音字标注(如"重要"的"重")
  • 添加停顿标记

步骤 3:选择 TTS 引擎

按优先级选择可用引擎:

  1. ElevenLabs(推荐):最自然,支持情感控制,需 ELEVENLABS_API_KEY
  2. OpenAI TTS:质量稳定,需 OPENAI_API_KEY
  3. Azure TTS:多语言支持,需 AZURE_SPEECH_KEY + AZURE_SPEECH_REGION
  4. 系统 TTS(兜底):使用 tts 工具直接合成(无需 API key,质量较低)

检查环境变量确认可用引擎:

echo "ElevenLabs: $ELEVENLABS_API_KEY" && echo "OpenAI: $OPENAI_API_KEY"

步骤 4:生成 SSML(可选,精细控制时使用)

参考 references/ssml-guide.md 为文本添加 SSML 标注。 简单场景可跳过,直接传纯文本。

步骤 5:调用合成脚本

# 单段文本合成
python3 scripts/synthesize-voice.py \
  --text "你好,欢迎收听本期节目" \
  --voice warm-female \
  --speed normal \
  --output ./output.mp3

# 从文件合成
python3 scripts/synthesize-voice.py \
  --script ./script.txt \
  --voice professional-male \
  --speed fast \
  --output ./output.mp3

# 添加背景音乐
python3 scripts/synthesize-voice.py \
  --script ./script.txt \
  --bgm ./bgm/light-jazz.mp3 \
  --bgm-volume 0.1 \
  --output ./output.mp3

步骤 6:后期处理

参考 references/audio-processing-guide.md,脚本自动完成:

  • 降噪处理
  • 音量标准化(-14 LUFS)
  • 背景音乐混音(可选)
  • 格式转换

步骤 7:交付

将生成的音频文件发送给用户:

合成完成!这是你的配音文件。
MEDIA:./output.mp3

音色快速参考

场景推荐音色
知识科普professional-male / professional-female
情感故事warm-female
商业广告magnetic-male
轻松娱乐young-energetic

详细音色库见 references/voice-style-guide.md

环境依赖

pip install elevenlabs openai pydub requests
brew install ffmpeg  # macOS

注意事项

  • 单次合成建议不超过 10 分钟音频
  • 音色克隆需至少 1 分钟清晰样本音频
  • 使用他人声音克隆需获得授权
  • 无 API key 时降级使用系统 tts 工具

Comments

Loading comments...