Defluff

Security checks across malware telemetry and agentic risk

Overview

This appears to be a purpose-aligned email analysis skill, with minor scoping and language-preference caveats rather than evidence of hidden or unsafe behavior.

Install this if you want an agent to analyze email text for scam, authorship, or prompt-injection signals. Be careful with private emails, and ask explicitly for the analysis type and output language you want so the skill does not over-apply to ordinary summaries.

SkillSpector

By NVIDIA

Vulnerability Patterns

Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands
Trigger AbuseOverly Broad Trigger, Shadow Command Trigger, Keyword Baiting Trigger
Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
Privilege EscalationExcessive Permissions, Sudo/Root Execution, Credential Access
Supply ChainUnpinned Dependencies, External Script Fetching, Obfuscated Code

Findings (2)

Vague Triggers

Medium

Confidence: 93% confidence
Finding: The trigger criteria are broad enough to match ordinary email summarization and triage requests, so the skill may activate when the user did not intend to perform AI-authorship detection or scam classification. That can cause misrouting to a specialized skill, privacy over-processing of email content, and misleading outputs in contexts where a plain summary was requested.

Natural-Language Policy Violations

Medium

Confidence: 88% confidence
Finding: Forcing the output to mirror the email's language without user opt-in can override user preferences, reduce comprehension, and create policy or accessibility issues when the user expects analysis in a different language. In a security workflow, this may also hinder review if analysts cannot read the mirrored language, increasing the chance that phishing or fraud indicators are missed.

VirusTotal

61/61 vendors flagged this skill as clean.

View on VirusTotal