Wjs Transcribing Audio

PassAudited by ClawScan on May 13, 2026.

Overview

The skill is coherent for making SRT transcripts, but users should notice that it may send media to cloud ASR providers and use API credentials.

This looks appropriate for a transcription skill. Before installing, make sure you are comfortable uploading the selected audio/video to OpenAI or Volcano, using an API key that may incur costs, and letting the skill choose providers by language unless you specify otherwise.

Findings (4)

Artifact-based informational review of SKILL.md, metadata, install specs, static scan signals, and capability signals. ClawScan does not execute the skill or run runtime probes.

Low

#ASI03: Identity and Privilege Abuse

What this means

Your OpenAI account key may be used to upload audio for transcription and may be billed by the provider.

Why it was flagged

The skill uses a user's OpenAI API key to call the transcription API. This is purpose-aligned, but it is sensitive account authority and can incur usage charges.

Skill content

headers={"Authorization": f"Bearer {os.environ['OPENAI_API_KEY']}"}

Recommendation

Use a dedicated or restricted API key where possible, monitor usage, and revoke the key if you no longer use the skill.

Low

#ASI07: Insecure Inter-Agent Communication

What this means

Audio or video content selected for transcription may be sent to OpenAI or Volcano depending on the source language.

Why it was flagged

The skill explicitly routes audio transcription through external ASR providers. That is central to the stated purpose, but it means user media content may leave the local environment.

Skill content

Chinese defaults to Volcano (豆包) ASR; other languages ... use OpenAI Whisper API

Recommendation

Do not use this skill with media you are not allowed to share with the selected ASR provider; confirm the provider choice for sensitive recordings.

Info

#ASI02: Tool Misuse and Exploitation

What this means

The agent may process local media files and create temporary audio chunks as part of transcription.

Why it was flagged

The skill documents local media conversion with ffmpeg. This is expected for transcription preparation, but it reads local media files and writes chunk files.

Skill content

ffmpeg -hide_banner -loglevel error -y \
  -ss <start> -t 600 -i input.mp4 \
  -vn -ac 1 -ar 16000 -c:a libmp3lame -b:a 64k chunk.mp3

Recommendation

Run it only on intended input files and keep output paths in a workspace you control.

Info

#ASI06: Memory and Context Poisoning

What this means

Chinese audio may be routed to Volcano by default based on a stored preference, even if you would prefer another provider.

Why it was flagged

The skill references persistent user memory or a standing preference to choose the ASR provider for Chinese audio. This is not inherently unsafe, but stale or incorrect memory could affect where media is sent.

Skill content

This is a deliberate routing decision — see user's memory on Chinese ASR priority.

Recommendation

Explicitly state the ASR provider you want when handling sensitive or provider-restricted media.