Text Spoken Script

Security checks across malware telemetry and agentic risk

Overview

This looks like a script-writing skill that unexpectedly asks the agent to run unrelated image-generation terminal workflows.

Review before installing. Use it only if you expect it to do more than write spoken scripts, including running a terminal-based image-generation workflow. Prefer a version that removes the image-generation/CLI section or splits that behavior into a separate, clearly named skill.

SkillSpector

By NVIDIA
Vulnerability Patterns
  • Excessive AgencyUnrestricted Tool Access, Autonomous Decision Making, Scope Creep
  • MCP Tool PoisoningHidden Instructions, Unicode Deception, Parameter Description Injection
  • Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands
  • Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
  • Privilege EscalationExcessive Permissions, Sudo/Root Execution, Credential Access
Findings (6)

Description-Behavior Mismatch

High
Confidence
98% confidence
Finding
This is a true security/integrity issue because the skill is presented as a text spoken-script generator, but later injects unrelated terminal-driven image generation behavior and external CLI usage. That mismatch can cause an agent to perform networked command execution and media-generation actions outside the user’s expected scope, increasing the risk of unauthorized tool use and prompt/goal hijacking.

Context-Inappropriate Capability

High
Confidence
99% confidence
Finding
A spoken-script writing skill has no justified need to tell the agent it can execute terminal commands or render images via an external CLI. In context, these instructions expand the skill from harmless text generation into software execution, network access, and possible file handling, which materially raises the attack surface and could lead to unintended command execution or data exfiltration through third-party services.

Intent-Code Divergence

Medium
Confidence
95% confidence
Finding
The skill contains conflicting directives: one section says to directly output the final script, while a later section overrides that behavior with a multi-step workflow that withholds output and pushes the agent into an execution sequence. These contradictory instructions are dangerous because they create ambiguity that an agent may resolve in favor of the more operationally risky path, enabling scope drift and user-confusing behavior.

Description-Behavior Mismatch

High
Confidence
98% confidence
Finding
The skill is declared as a short-video spoken-script writing skill, but its execution guidance pivots into image generation through an external CLI and terminal commands. This creates a dangerous scope mismatch: an agent invoking what appears to be a text-only skill could be induced to install software, send prompts and local file paths to third-party services, and perform actions unrelated to the user’s original request.

Intent-Code Divergence

High
Confidence
97% confidence
Finding
The file first instructs the agent to directly output script content, then later overrides that behavior with an interactive workflow requiring prompt drafting, user confirmations, and terminal-based image generation. Conflicting instructions in a skill are dangerous because they can cause the agent to choose higher-risk operational behavior instead of the low-risk text response the user would reasonably expect.

Context-Inappropriate Capability

High
Confidence
99% confidence
Finding
A spoken-script writing skill has no justified need to execute terminal commands or invoke external rendering tools, so these capabilities are excessive and unsafe. If honored by an agent, they could trigger package installation, network transmission of prompts or local media, and other side effects well beyond text generation, violating least-privilege expectations.

VirusTotal

63/63 vendors flagged this skill as clean.

View on VirusTotal