Universal Document Ingestion Router

Security checks across malware telemetry and agentic risk

Overview

This is a local document parsing router that reads user-specified files and writes parsed outputs, with no evidence of hidden network use, credential access, persistence, or destructive behavior.

Install this only if you want agents to parse local documents into knowledge-base-ready files. Use narrow input directories, choose output locations carefully, and avoid --copy-sources for confidential material unless you intentionally want retained source copies.

SkillSpector

By NVIDIA

Vulnerability Patterns

Trigger AbuseOverly Broad Trigger, Shadow Command Trigger, Keyword Baiting Trigger
MCP Least PrivilegeUnderdeclared Capability, Wildcard Permission, Missing Permission Declaration
Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands
Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
Privilege EscalationExcessive Permissions, Sudo/Root Execution, Credential Access

Findings (4)

Lp3

Medium

Category: MCP Least Privilege
Confidence: 89% confidence
Finding: The skill declares only the `exec` tool, but the documented behavior and CLI usage clearly imply reading arbitrary input files and writing parsed outputs such as `document.json`, `document.md`, `chunks.jsonl`, and batch artifacts. This permission mismatch weakens policy enforcement and auditing because an agent may execute a script that performs file read/write operations not explicitly declared in the skill metadata.

Vague Triggers

Medium

Confidence: 89% confidence
Finding: The integration note tells agents to invoke this skill automatically for a very broad set of common document-related tasks, including document search and knowledge-base enrichment, without requiring user confirmation or tighter capability scoping. In an agent ecosystem, overly broad auto-invocation guidance can cause inappropriate routing, unnecessary file processing, and expanded access to potentially sensitive documents beyond the minimum needed task path.

Missing User Warnings

Medium

Confidence: 89% confidence
Finding: The parser writes extracted document text into document.md and chunk files on disk, which can expose sensitive source content to unintended readers, backups, or downstream tooling. In a document-ingestion skill, this is especially relevant because the inputs are likely to include confidential business or personal documents, and the script does not provide user-facing disclosure, redaction, or output-protection controls.

Missing User Warnings

Medium

Confidence: 87% confidence
Finding: When --copy-sources is used, the script stores full original documents in a samples directory, creating an extra unredacted copy of potentially sensitive files. This increases data-retention and exposure risk because source documents may remain accessible after processing, be swept into backups, or be read by other users on the system.

VirusTotal

62/62 vendors flagged this skill as clean.

View on VirusTotal