Brand Voice Architect

Security checks across malware telemetry and agentic risk

Overview

This is a coherent brand-voice analysis and prompt-generation skill, with some transparency cautions around broad activation and generated prompts that silently rewrite prohibited terms.

Install this only if you want brand-voice analysis and prompt generation. Provide only writing samples you intend to analyze, and review generated system prompts before using them in another assistant, especially where exact wording, quotations, legal text, or compliance-sensitive language matters.

SkillSpector

By NVIDIA

Vulnerability Patterns

System Prompt LeakageDirect Leakage, Indirect Extraction, Tool-Based Exfiltration
Trigger AbuseOverly Broad Trigger, Shadow Command Trigger, Keyword Baiting Trigger
MCP Least PrivilegeUnderdeclared Capability, Wildcard Permission, Missing Permission Declaration
Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands
Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration

Findings (3)

Lp3

Medium

Category: MCP Least Privilege
Confidence: 93% confidence
Finding: The skill explicitly directs the agent to use local scripts and reference files, which implies file-read/code-adjacent capability, but it declares no permissions or boundaries. This creates a trust and enforcement gap: an orchestrator may allow the skill to access repository files or corpora without clear user-visible authorization, increasing the chance of unintended data exposure during analysis.

Vague Triggers

Medium

Confidence: 89% confidence
Finding: The trigger guidance is overly expansive, including vague requests like making writing more consistent or asking what tone to use. That can cause the skill to activate in situations where the user did not intend a corpus-analysis or prompt-synthesis workflow, potentially pulling in files, generating system prompts, or shaping outputs beyond the user's expectation.

Natural-Language Policy Violations

Medium

Confidence: 91% confidence
Finding: The generated system prompt explicitly instructs the downstream model to silently replace prohibited user-provided words with preferred equivalents. That creates undisclosed transformation of user input, which can misrepresent the user's meaning, hide sensitive or safety-relevant terms, and reduce transparency in audit or compliance contexts. In a brand-voice skill, this is more dangerous because the tool is designed to rewrite content at scale, making covert semantic drift likely and hard to detect.

VirusTotal

64/64 vendors flagged this skill as clean.

View on VirusTotal