article-tts
v2.1.0拍照或文字转音频:文章照片 OCR 提取文字,或直接接收文字,生成 Microsoft Edge TTS 语音,支持中英文、自动转写、语速调节、逐句拆分。| Capture article photos (OCR) or plain text, generate natural audio via Edge TT...
⭐ 1· 116·0 current·0 all-time
by退役前写代码的@54meteor
MIT-0
Download zip
LicenseMIT-0 · Free to use, modify, and redistribute. No attribution required.
Security Scan
OpenClaw
Benign
medium confidencePurpose & Capability
The name/description (image OCR + Edge TTS) matches the declared runtime steps and required tools: Tesseract for OCR, Python + Pillow for image preprocessing, and uvx/edge-tts for TTS. Requiring tessdata and language packs for Chinese OCR is expected. No unexplained external credentials or unrelated binaries are requested.
Instruction Scope
Instructions stay within the stated task: preprocess image, run tesseract, produce text, optionally split into sentences, and call edge-tts via uvx. Two things to note: (1) skipConfirmation is explicitly warned as a privacy risk because it will convert OCR output (which may contain sensitive data) directly to audio; (2) the doc includes examples that run other scripts by absolute path (e.g., a feishu-voice-send script under /mnt/d/wslspace/...), which assumes local files/skills exist and could execute arbitrary code if present. The skill does not request extra env vars and relies on OpenClaw's message(...) tool for channel delivery.
Install Mechanism
This is instruction-only (no packaged install). The SKILL.md suggests apt-get to install tesseract and language packs (standard). It also relies on uvx auto-downloading edge-tts on first run — an implicit network fetch of code at runtime. That auto-download is reasonable for convenience but is a higher-risk action than purely using already-installed binaries because it pulls remote package(s) dynamically.
Credentials
No environment variables or credentials are requested; the skill defers to OpenClaw's channel authentication. This is proportionate for a messaging-forwarding TTS skill. Caveat: forwarding via other skills (e.g., feishu-voice-send) may require credentials/configuration outside this skill.
Persistence & Privilege
The skill is not always-enabled and does not request elevated platform privileges. It is a runtime instruction-only skill and does not modify other skills or require persistent configuration changes.
Assessment
This skill appears to do what it says: OCR images or accept text, then produce Edge TTS audio and send it over the active channel. Before installing/using it: 1) Be cautious with skipConfirmation — don’t enable it for images that may contain private data. 2) Expect an apt-get step to install tesseract and a first-run network download (uvx will auto-fetch edge-tts); if you need stricter supply-chain control, preinstall and vet the edge-tts package source. 3) The docs include absolute example paths and an example call to another skill/script (feishu-voice-send) — verify those scripts exist and review them before executing. 4) Run the skill in a sandbox or test environment first if you are uncertain about auto-downloaded components. If you want a stronger assurance, ask the publisher for explicit sources/URLs for uvx/edge-tts and the feishu helper script, or request a packaged release rather than instruction-only steps.Like a lobster shell, security has layers — review code before you run it.
latestvk97f8n4f2avgq2cc8hjdbjf67984b8sf
License
MIT-0
Free to use, modify, and redistribute. No attribution required.
