video2txt-视频理解字幕提取
v1.0.1将本地视频或音频文件转写为 SRT 字幕文件和 TXT 纯文本文件
⭐ 0· 115·0 current·0 all-time
MIT-0
Download zip
LicenseMIT-0 · Free to use, modify, and redistribute. No attribution required.
Security Scan
OpenClaw
Benign
medium confidencePurpose & Capability
Name/description, required binary (python3), declared dependencies (faster-whisper, av, opencc) and the included Python script all align with a local transcription/subtitle generation tool. The script uses ffprobe/ffmpeg and faster-whisper as expected.
Instruction Scope
SKILL.md instructs the agent to run the provided Python script and to install dependencies; the instructions stay within the transcription task. Important operational behaviors are called out (model download on first run, use background execution). Note: the script will perform network downloads for Whisper models and will call ffprobe/ffmpeg via subprocess; these are expected but are external network/system interactions the user should be aware of. Also the SKILL.md pre-scan flagged unicode-control-chars (possible prompt-injection attempt) — the visible content looks normal, but a manual check of raw file bytes for hidden control characters is recommended.
Install Mechanism
No custom install spec; standard pip requirements.txt is provided. This is low-risk compared with arbitrary remote archive downloads. The only external runtime download is the Whisper model files (expected for this functionality).
Credentials
The skill requests no environment variables or credentials. It needs access to local files (input media) and will write SRT/TXT output and model files to disk (models directory). These requirements are proportional to the stated goal.
Persistence & Privilege
Skill is not always-on and does not request special platform privileges. It does not declare or appear to modify other skills or global agent settings.
Scan Findings in Context
[unicode-control-chars] unexpected: A prompt-injection detection found unicode control characters in SKILL.md. This is not required for transcription and is unexpected; it may be an artifact or a concealed formatting attempt. Recommend inspecting the raw SKILL.md for hidden control characters (e.g., U+202E, U+200B, U+202C) before trusting/automating the skill.
Assessment
This skill appears to do what it says: transcribe local media into .srt and .txt using faster-whisper and ffmpeg. Before installing or running it: 1) Run pip installs in an isolated venv. 2) Ensure ffmpeg/ffprobe are installed and on PATH. 3) Be aware the first run will download Whisper model files (network traffic and significant disk use); verify you are comfortable with that. 4) Inspect the SKILL.md and script for any hidden/unexpected characters or modifications (the static scan flagged unicode control characters). 5) If you plan to run it automatically, restrict it to media files you trust — the script will read local files and write model files and outputs to disk. If anything in the SKILL.md raw text looks suspicious, do not grant it automated/autonomous execution until you confirm the content is clean.Like a lobster shell, security has layers — review code before you run it.
latestvk97evey77q8v92r2qa0rvkypmx83afww
License
MIT-0
Free to use, modify, and redistribute. No attribution required.
Runtime requirements
video Clawdis
Binspython3
