Mimotts25

v2.5.2

小米大模型 MiMo TTS 2.5 语音合成。支持多种预设音色(中文/英文/默认)、风格控制(情感、方言、角色扮演、语速)、音频标签精细控制。Use when the user asks to convert text to speech, generate audio, read text aloud wit...

0· 118·0 current·0 all-time

Install

OpenClaw Prompt Flow

Install with OpenClaw

Best for remote or guided setup. Copy the exact prompt, then paste it into OpenClaw for aaroncxxx/mimotts25.

Previewing Install & Setup.
Prompt PreviewInstall & Setup
Install the skill "Mimotts25" (aaroncxxx/mimotts25) from ClawHub.
Skill page: https://clawhub.ai/aaroncxxx/mimotts25
Keep the work scoped to this skill only.
After install, inspect the skill metadata and help me finish setup.
Required env vars: MIMO_API_KEY
Use only the metadata you can verify from ClawHub; do not invent missing requirements.
Ask before making any broader environment changes.

Command Line

CLI Commands

Use the direct CLI path if you want to install manually and keep every step visible.

OpenClaw CLI

Bare skill slug

openclaw skills install mimotts25

ClawHub CLI

Package manager switcher

npx clawhub@latest install mimotts25
Security Scan
Capability signals
Requires sensitive credentials
These labels describe what authority the skill may exercise. They are separate from suspicious or malicious moderation verdicts.
VirusTotalVirusTotal
Benign
View report →
OpenClawOpenClaw
Benign
high confidence
Purpose & Capability
Name/description (MiMo TTS) match the included script and SKILL.md. The skill only requires a single TTS API key (MIMO_API_KEY) which is appropriate for a TTS integration. No unrelated binaries, config paths, or extra credentials are requested.
Instruction Scope
SKILL.md instructs the agent to call the bundled Python script to POST text to the MiMo TTS API and write the returned audio to a file. The instructions reference only the API key, command arguments, and an output file; they do not instruct reading arbitrary host files, other env vars, or sending data to third parties beyond the documented API endpoint.
Install Mechanism
No install spec is provided (instruction/script-only); the included Python script uses only standard library modules and does not pull code from external URLs or install packages. This is low-risk from an installation perspective.
Credentials
Only one environment variable (MIMO_API_KEY) is required and it directly corresponds to the stated purpose. The SKILL.md also documents storing the key in OpenClaw config or env var. No extra secrets, keys, or unrelated service credentials are requested.
Persistence & Privilege
The skill is not marked always:true and does not claim to modify other skills or system-wide settings. It writes only the output audio file specified by the user. Autonomous invocation is allowed by default but is expected for skills and is not combined with other high‑risk behaviors here.
Assessment
This skill appears to be a straightforward MiMo TTS client. Before installing, verify that the API endpoint (https://api.xiaomimimo.com) is the correct/official MiMo service you intend to use and only supply a TTS-specific API key (do not reuse high‑privilege keys). The script will send provided text to that endpoint and save the returned audio locally; review network access policies if you need to restrict outbound calls. If you need higher assurance, inspect the script locally (scripts/tts.py) and test with a limited or test API key first.

Like a lobster shell, security has layers — review code before you run it.

Runtime requirements

🎙️ Clawdis
EnvMIMO_API_KEY
Primary envMIMO_API_KEY
latestvk97emads245famha83jv2qmf7985dryg
118downloads
0stars
4versions
Updated 4d ago
v2.5.2
MIT-0

MiMo TTS 2.5.2 — 语音合成

小米大模型 MiMo TTS 2.5.2 版本,高质量中文/英文语音合成。

🆕 2.5.2 版本更新

新增音色

  • mimo_male - 男声音色
  • mimo_child - 童声音色
  • mimo_cantonese - 粤语音色
  • mimo_sichuan - 四川话语音

新增音频格式

  • mp3 - 更小的文件大小,适合网络传输
  • ogg - 更好的兼容性,开源格式

优化功能

  • ✅ 重试机制:自动处理API限流
  • ✅ 改进错误处理:更详细的错误信息
  • ✅ 超时设置:避免长时间等待

首次配置

⚠️ TTS 的 API Key 独立于模型推理 Key。 即使 mimo-v2-pro 能正常调用,TTS 仍需单独配置 Key。

  1. 前往小米 MiMo 开放平台获取 TTS API Key:https://api.xiaomimimo.com
  2. 通过 OpenClaw 配置:
openclaw config set skills.entries.mimotts25.apiKey "your-tts-api-key-here"

或直接设置环境变量 MIMO_API_KEY。 配置后需重启会话。

故障排查

错误原因解决
401 Invalid API KeyAPI Key 未传入或格式不对确认已用 config set 配置 TTS 专用 Key,重启会话
工具调用被 abort上下文过长或系统繁忙等几秒后重试

生成语音

使用 scripts/tts.py 合成语音:

python3 "{baseDir}/scripts/tts.py" "要合成的文本" -o output.wav

参数说明

参数默认值说明
-ooutput.wav输出文件路径
-vmimo_default音色:mimo_defaultdefault_zhdefault_enmimo_malemimo_childmimo_cantonesemimo_sichuan
-s风格标签,如 开心东北话悄悄话孙悟空
-fwav音频格式:wavmp3ogg
--user-msg可选的用户角色上下文,用于调整语气
--api-key环境变量 MIMO_API_KEYAPI Key 覆盖
--max-retries3API调用最大重试次数
--list-voices列出所有可用音色
--list-formats列出所有可用音频格式

使用示例

# 基础合成
python3 "{baseDir}/scripts/tts.py" "你好,今天天气真好" -o hello.wav

# 方言风格
python3 "{baseDir}/scripts/tts.py" "哎呀妈呀,这天儿也忒冷了吧" -s "东北话" -o dongbei.wav

# 英文音色
python3 "{baseDir}/scripts/tts.py" "Hello, how are you today?" -v default_en -o hello_en.wav

# 情感 + 语速
python3 "{baseDir}/scripts/tts.py" "明天就是周五了,真开心!" -s "开心 变快" -o happy.wav

# 唱歌
python3 "{baseDir}/scripts/tts.py" "一闪一闪亮晶晶" -s "唱歌" -o sing.wav

# 🆕 2.5.2 新功能示例

# 男声音色
python3 "{baseDir}/scripts/tts.py" "大家好,我是你们的主持人" -v mimo_male -o male.wav

# 童声音色
python3 "{baseDir}/scripts/tts.py" "妈妈,我想吃糖" -v mimo_child -o child.wav

# 粤语
python3 "{baseDir}/scripts/tts.py" "你好,今日天气好好" -v mimo_cantonese -o cantonese.wav

# 四川话
python3 "{baseDir}/scripts/tts.py" "这个火锅巴适得很" -v mimo_sichuan -o sichuan.wav

# MP3格式(更小的文件)
python3 "{baseDir}/scripts/tts.py" "今天心情不错" -f mp3 -o output.mp3

# OGG格式(开源格式)
python3 "{baseDir}/scripts/tts.py" "测试音频" -f ogg -o output.ogg

# 列出所有可用音色
python3 "{baseDir}/scripts/tts.py" --list-voices

# 列出所有可用音频格式
python3 "{baseDir}/scripts/tts.py" --list-formats

风格与音频标签

  • 在文本开头使用 <style>风格</style> 设置整体风格
  • 行内音频标签精细控制:(紧张)(小声)(语速加快)(深呼吸)(苦笑)(沉默片刻)
  • 多风格组合:<style>开心 变快</style>文本内容

🆕 2.5.2 新增风格标签

  • (停顿) - 自然停顿
  • (叹气) - 叹气声
  • (笑声) - 轻笑
  • (清嗓子) - 清嗓子声音
  • (耳语) - 耳语效果

音色列表

名称voice 参数
MiMo-默认mimo_default
MiMo-中文女声default_zh
MiMo-英文女声default_en
MiMo-男声mimo_male
MiMo-童声mimo_child
MiMo-粤语mimo_cantonese
MiMo-四川话mimo_sichuan

参考风格

风格适用场景
可爱撒娇、软萌
开心欢快、兴奋
东北话方言、搞笑
悄悄话神秘、低语
孙悟空角色扮演
唱歌儿歌、旋律
变快 / 变慢语速控制
🆕 悲伤悲伤、失落
🆕 愤怒愤怒、激动
🆕 平静平静、舒缓
🆕 惊讶惊讶、意外
可自由组合开心 变快可爱 悄悄话悲伤 变慢

交付

生成音频后,用 MEDIA: 指令交付给用户:

MEDIA:output.wav

📋 版本历史

v2.5.2 (2026-04-23)

  • ✨ 新增4种音色:男声、童声、粤语、四川话
  • ✨ 新增音频格式:mp3、ogg
  • ✨ 新增重试机制,自动处理API限流
  • ✨ 改进错误处理,提供更详细的错误信息
  • ✨ 新增风格标签:停顿、叹气、笑声、清嗓子、耳语
  • ✨ 新增情感风格:悲伤、愤怒、平静、惊讶
  • ✨ 新增命令行参数:--list-voices、--list-formats
  • ✨ 新增max-retries参数,可配置重试次数

v2.5.1

  • 🐛 修复了一些已知问题
  • 🚀 性能优化

v2.5.0

  • 🎉 初始版本
  • ✨ 支持3种基础音色
  • ✨ 支持风格控制和音频标签
  • ✨ 支持wav格式

Comments

Loading comments...