Step Asr

Security checks across malware telemetry and agentic risk

Overview

This skill is a straightforward cloud transcription helper that uploads the user-selected audio file to StepFun’s ASR API and optionally saves the transcript locally.

Install only if you are comfortable sending the audio files and any prompt text you choose to StepFun for processing under your StepFun account. Avoid confidential, regulated, or third-party recordings unless StepFun’s terms and your consent requirements allow it, and keep STEPFUN_API_KEY private.

SkillSpector

By NVIDIA
Vulnerability Patterns
  • Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
  • MCP Least PrivilegeUnderdeclared Capability, Wildcard Permission, Missing Permission Declaration
  • Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands
  • Privilege EscalationExcessive Permissions, Sudo/Root Execution, Credential Access
  • Supply ChainUnpinned Dependencies, External Script Fetching, Obfuscated Code
Findings (3)

Lp3

Medium
Category
MCP Least Privilege
Confidence
94% confidence
Finding
The skill clearly relies on environment secrets, writes output files, and sends data over the network, but it does not declare permissions to make those capabilities explicit to users or the platform. This weakens transparency and consent, especially because audio content and API credentials are involved, and can lead to users invoking a skill without understanding its access scope.

Missing User Warnings

Medium
Confidence
98% confidence
Finding
The skill transcribes local audio by sending it to the external Step ASR service, but the description and quick-start guidance do not prominently warn users that their audio leaves the local environment. Audio often contains sensitive personal, business, or regulated information, so omission of this disclosure creates a meaningful privacy and data-handling risk.

Missing User Warnings

Medium
Confidence
79% confidence
Finding
The script sends the full base64-encoded audio content to a third-party remote API, but the runtime behavior does not provide an explicit user-facing warning or consent prompt before transmission. In a skill context, users may run the tool on sensitive recordings and underestimate that audio leaves the local environment, creating privacy and compliance risk.

VirusTotal

64/64 vendors flagged this skill as clean.

View on VirusTotal