K8s Self Hosted Whisper Api

Security checks across malware telemetry and agentic risk

Overview

This skill does what it says, but it needs review because a crafted prompt can execute local Python code and audio is sent to an unauthenticated HTTP Whisper service.

Install only if you trust the Kubernetes Whisper service and the network it runs on. Avoid sensitive audio unless plain HTTP/no-auth transmission is acceptable, and patch or avoid the --prompt option before using untrusted prompt text.

SkillSpector

By NVIDIA
Vulnerability Patterns
  • Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
  • Trigger AbuseOverly Broad Trigger, Shadow Command Trigger, Keyword Baiting Trigger
  • MCP Least PrivilegeUnderdeclared Capability, Wildcard Permission, Missing Permission Declaration
  • Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands
  • Privilege EscalationExcessive Permissions, Sudo/Root Execution, Credential Access
Findings (4)

Lp3

Medium
Category
MCP Least Privilege
Confidence
94% confidence
Finding
The skill invokes shell tooling and makes network requests but does not declare corresponding permissions, which weakens platform trust boundaries and informed consent. In practice, this can let a user or reviewer underestimate what the skill can do, especially since it sends user-provided audio to an internal HTTP service.

Missing User Warnings

Medium
Confidence
97% confidence
Finding
The skill sends audio content to a remote service over plain HTTP, yet the description does not clearly warn users that their audio leaves the local context for network processing. This creates a meaningful privacy and integrity risk because audio may contain sensitive data and HTTP provides no transport encryption.

Vague Triggers

Medium
Confidence
83% confidence
Finding
The trigger scope is broad enough to match many common transcription or subtitle requests without clearly stating limits, which increases the chance of accidental invocation. That matters here because invocation causes data to be sent to an unauthenticated internal ASR service, so overbroad matching can expose user audio more often than intended.

Missing User Warnings

Medium
Confidence
95% confidence
Finding
The script uploads the full audio file to an internal service over plain HTTP, which provides no transport encryption or peer authentication. Even inside a Kubernetes cluster, this can expose sensitive speech content and derived transcripts to interception, traffic capture, misrouting, or abuse by a compromised pod, node, sidecar, or network component; the skill context increases risk because transcription inputs commonly contain private conversations, credentials, or regulated data.

VirusTotal

63/63 vendors flagged this skill as clean.

View on VirusTotal