Byted Mediakit Voiceover Editing

v1.0.9

Volcano Engine AI MediaKit talking-head video editing Skill: a one-stop workflow from environment setup through media management, audio processing, talking-h...

0· 116·1 current·1 all-time
MIT-0
Download zip
LicenseMIT-0 · Free to use, modify, and redistribute. No attribution required.
Security Scan
VirusTotalVirusTotal
Benign
View report →
OpenClawOpenClaw
Benign
high confidence
Purpose & Capability
Name/description (Volcano/Byted MediaKit talking‑head editing) match what the code and SKILL.md implement: ASR submission, candidate generation, review UI, VOD upload/export and local processing. The declared environment variables (VOLC_*, ASR_*, ARK_SKILL_*) and required permissions (network, file read/write, temp storage) are appropriate for those tasks.
Instruction Scope
Runtime instructions require the agent to run a controlled sequence of scripts (cd into scripts/, run setup.sh, pipeline_xxx, prepare_export_data, etc.), read process env and a skill .env, and write outputs under project output/. These steps are within the editing/export workflow. Note: setup.sh and the scripts may write/update the skill .env (e.g., EXECUTION_MODE), and the SKILL.md encourages the agent to edit .env via a file-write tool — that's expected for configuration but is a point to be conscious of (secrets stored in .env must be handled carefully).
Install Mechanism
There is no external arbitrary binary download; setup.sh creates a Python venv and installs pinned packages from requirements.txt/requirements-local.txt on PyPI. This is normal for Python tooling but means the environment will install substantial packages (torch, demucs, ffmpeg wrapper etc.), so run in an isolated environment and review requirements if you need reproducible/secure installs.
Credentials
Required secrets (ASR_API_KEY, VOLC_SPACE_NAME, optional VOLC_ACCESS_KEY_ID/SECRET or ARK_SKILL_API_*) map directly to ASR and VOD functions the skill performs. There are no unrelated credential requests. The skill reads both process env and .env; it may write EXECUTION_MODE into .env during automatic downgrade — this is plausible but users should avoid placing unrelated high‑privilege secrets in that .env.
Persistence & Privilege
always:false and no requests to modify other skills or system-wide agent settings. The skill writes its own .env and creates a local virtualenv under scripts/.venv and output files under the project output/ — expected persistence for a local tool. Autonomous invocation (model calls) is allowed by default but not in itself a red flag.
Assessment
This skill appears coherent for talking‑head/video editing: it will call ASR endpoints and VOD upload APIs and may install heavy Python packages (torch, demucs, ffmpeg helpers). Before installing or running: 1) do not put high‑privilege credentials in the skill .env — use a dedicated minimal‑permission ASR/VOD account or a test VOD space; 2) run setup/install only in an isolated environment (container/VM) because pip will install large native packages; 3) review requirements.txt and scripts if you must comply with internal policy (they use requests and perform network upload to ASR/VOD endpoints, which is expected); 4) be aware the skill may write EXECUTION_MODE into .env automatically during mode auto‑downgrade; 5) avoid pointing the skill at sensitive local files — it reads/writes files under project_root/output. If you need higher assurance, run the pipeline in local mode (EXECUTION_MODE=local) which avoids sending media to cloud services, but that requires local ASR and separation dependencies to be installed and available.

Like a lobster shell, security has layers — review code before you run it.

latestvk97338p9v49wtwezcgc2e5xzjs83waj8

License

MIT-0
Free to use, modify, and redistribute. No attribution required.

Runtime requirements

Environment variables
ARK_SKILL_API_BASEoptionalSkillHub VOD 网关 OpenAPI 根 URL(与 ARK_SKILL_API_KEY 同时存在时启用 apig);通常由宿主/容器注入进程环境,不必写入 .env
ARK_SKILL_API_KEYoptionalSkillHub 网关 Bearer Token;与 ARK_SKILL_API_BASE 同时存在时启用 apig;通常由宿主/容器注入,不必写入 .env
VOLC_ACCESS_KEY_IDoptional火山引擎访问密钥ID(直连 OpenAPI 时必填;若进程环境中已有 ARK_SKILL_API_BASE 与 ARK_SKILL_API_KEY 则可不填)
VOLC_ACCESS_KEY_SECREToptional火山引擎访问密钥 Secret(直连 OpenAPI 时必填;若进程环境中已有 ARK_SKILL_API_BASE 与 ARK_SKILL_API_KEY 则可不填)
VOLC_SPACE_NAMErequired火山引擎VOD存储空间名称
ASR_API_KEYrequired语音识别服务API密钥
ASR_BASE_URLrequired语音识别服务接口地址
VOD_EXPORT_SKIP_SUBTITLEoptional导出时是否跳过字幕压制(默认跳过;0/false/no 表示启用字幕压制)
TALKING_VIDEO_AUTO_EDIT_REVIEW_AUTO_OPENoptional审核页启动时是否自动打开浏览器(默认不打开;1/true/yes 表示打开)
TALKING_VIDEO_AUTO_EDIT_VIDEO_CUToptional是否开启视频裁剪功能:1 进行(默认);0 不进行。仅当有字幕或音频静音时生效。
EXECUTION_MODEoptional执行模式选择(可选),优先级 apig > cloud > local: - apig: 使用 SkillHub 网关(需 ARK_SKILL_API_BASE + ARK_SKILL_API_KEY) - cloud: 直连火山引擎 OpenAPI(需 VOLC_ACCESS_KEY_* + VOLC_SPACE_NAME + ASR_*) - local: 完全本地执行,使用 Qwen3-ASR / Demucs / ffmpeg,无需任何云端环境变量和 VOD 空间 留空则自动检测:按 apig > cloud > local 优先级尝试,缺参时打印提示并自动降级。

Comments