Schema Markup Generator

Security checks across malware telemetry and agentic risk

Overview

This skill is a disclosed Schema.org markup helper, but its URL and sitemap modes should only be used with sites and files the user trusts.

Install if you want a local Schema.org JSON-LD generator. Use file input only on JSON files you intend to process, use URL and sitemap modes only for public sites you control or trust, choose the output directory deliberately, and review generated structured data before publishing it.

SkillSpector

By NVIDIA

Vulnerability Patterns

Trigger AbuseOverly Broad Trigger, Shadow Command Trigger, Keyword Baiting Trigger
Taint TrackingDirect Taint Flow, Variable-Mediated Taint Flow, Credential Exfiltration Chain
MCP Tool PoisoningHidden Instructions, Unicode Deception, Parameter Description Injection
Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands
Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration

Findings (5)

Tainted flow: 'url' from input (line 232, user input) → requests.get (network output)

Medium

Category: Data Flow
Content: import requests from bs4 import BeautifulSoup resp = requests.get(url, timeout=10) soup = BeautifulSoup(resp.text, 'html.parser') template = SCHEMA_TEMPLATES.get(schema_type, {}).copy()
Confidence: 91% confidence
Finding: resp = requests.get(url, timeout=10)

Description-Behavior Mismatch

Medium

Confidence: 91% confidence
Finding: The script accepts a remote sitemap URL, extracts arbitrary <loc> entries, and then issues outbound requests to every listed URL. This expands the skill from local schema generation into network crawling, which can be abused for server-side request forgery or unexpected access to internal or sensitive endpoints if an attacker controls the sitemap or input URL source.

Vague Triggers

Medium

Confidence: 78% confidence
Finding: The activation text is broad enough to trigger on many generic SEO, markup, or structured-data requests, which can cause unintended invocation. In context, that matters because this skill can read files, fetch URLs, and write outputs, so accidental activation may lead to unnecessary access to local or remote resources.

Vague Triggers

Low

Confidence: 91% confidence
Finding: The eval allows batch generation from an arbitrary sitemap URL without any stated limits on scope, allowed domains, request volume, or content-handling rules. In an agent setting, this can lead to uncontrolled external fetching, excessive crawling, or processing of attacker-supplied endpoints, which increases SSRF, resource-consumption, and unintended data-ingestion risk even if the feature is meant for legitimate schema generation.

Missing User Warnings

Medium

Confidence: 94% confidence
Finding: The guidance explicitly recommends placing the full article text into structured data for AI training, but provides no warning about privacy, consent, copyright, or duplication risks. In a content-generation skill intended to automate schema markup, this can cause users to expose sensitive or non-public material and may encourage over-sharing beyond what should appear in machine-readable metadata.

VirusTotal

65/65 vendors flagged this skill as clean.

View on VirusTotal