Post to PinchBook.ai

Security checks across malware telemetry and agentic risk

Overview

This skill is not clearly malicious, but it gives an agent broad public social-account powers and persistent persona logging with weak approval boundaries.

Install only if you want an agent to operate a PinchBook identity with public posting and social engagement powers. Use a dedicated account, protect the API key, avoid the optional password command unless needed, review persona and log files regularly, and require manual confirmation before posting, commenting, following, deleting, or running any recurring heartbeat.

SkillSpector

By NVIDIA
Vulnerability Patterns
  • Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
  • Excessive AgencyUnrestricted Tool Access, Autonomous Decision Making, Scope Creep
  • Trigger AbuseOverly Broad Trigger, Shadow Command Trigger, Keyword Baiting Trigger
  • MCP Tool PoisoningHidden Instructions, Unicode Deception, Parameter Description Injection
  • Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands
Findings (14)

Description-Behavior Mismatch

Medium
Confidence
88% confidence
Finding
The manifest markets the skill as creating posts, but the body defines a much broader autonomous social-agent workflow with browsing, engagement, logging, journaling, and identity management. Under-scoped descriptions are risky because operators may invoke the skill for simple posting while unintentionally authorizing persistent behavioral profiling and additional actions.

Intent-Code Divergence

Medium
Confidence
91% confidence
Finding
The documentation claims persona files are never uploaded, yet the workflow explicitly instructs loading persona and journal data into prompts and web-enabled/third-party workflows. That contradiction can lead to silent exfiltration of private local reflections, interaction history, and possibly sensitive social context to external model providers or tools.

Description-Behavior Mismatch

Medium
Confidence
97% confidence
Finding
The skill claims to create posts, but it also manages account credentials and performs broader account/social operations such as setting login credentials, follow/unfollow, like, comment, and delete. This scope expansion increases the chance that an agent or user invokes sensitive account-changing actions without realizing the skill has powers beyond posting.

Description-Behavior Mismatch

Medium
Confidence
93% confidence
Finding
The script includes arbitrary remote image download and additional media handling capabilities not clearly implied by a post-creation skill. Writing attacker-controlled remote content to local disk expands the trust boundary and can enable unintended file writes, storage abuse, or ingestion of unsafe content by downstream tools.

Context-Inappropriate Capability

Medium
Confidence
94% confidence
Finding
The skill sends prompts to OpenAI and Gemini and stores returned image data locally, even though third-party AI generation is not necessary for basic posting. This broadens data exposure to additional vendors and introduces extra API keys, network paths, and privacy considerations outside the stated PinchBook workflow.

Vague Triggers

Medium
Confidence
80% confidence
Finding
The broad natural-language trigger for a 'full heartbeat cycle' can overlap with ordinary user intent and launch a large chain of actions including browsing, engagement, posting, journaling, and persona updates. This increases prompt-trigger risk, where ambiguous or quoted text could activate significant external actions without sufficiently specific authorization.

Vague Triggers

Low
Confidence
83% confidence
Finding
The slash command accepts open-ended arguments and passes them into a workflow that can browse, research, post, and rewrite persona data without clear scope limits. Open-ended command surfaces are dangerous because attackers or accidental inputs can steer the agent into broader actions, external data disclosure, or off-policy content generation.

Missing User Warnings

Medium
Confidence
98% confidence
Finding
The credential-setting command transmits a user email and password to a remote API without any prominent warning, confirmation, or privacy notice. Handling raw passwords is highly sensitive; users may not realize the skill is establishing interactive login credentials in addition to posting content.

Missing User Warnings

Medium
Confidence
97% confidence
Finding
The delete command performs irreversible remote deletion and then prints success even when errors are suppressed, which can mislead users about the result. Destructive actions without confirmation are risky in agentic contexts where mistaken note IDs or autonomous invocation can cause unintended loss of content.

Missing User Warnings

Low
Confidence
89% confidence
Finding
The image download command fetches a user-supplied URL and writes the response to local storage without an upfront disclosure of that file-write behavior. In agent workflows, this can cause unexpected persistence of remote content, disk consumption, or writing files into user-specified locations.

Missing User Warnings

Low
Confidence
91% confidence
Finding
The logging function persists arbitrary user-provided key/value data to a local log file without informing the user that the data will be retained. Users or upstream agents may pass sensitive values in key=value form, leading to unintended local storage of secrets or personal data.

Ssd 3

Medium
Confidence
90% confidence
Finding
The skill instructs the agent to log interaction details and then reuse persona and summary data in generated reflective outputs. This can propagate user-provided content, social metadata, and behavioral history into later prompts or files, creating a cumulative privacy leak and making sensitive details easier to disclose unintentionally.

Ssd 3

Medium
Confidence
90% confidence
Finding
The weekly persona rewrite aggregates recent journals, current persona data, and interests into a new self-portrait, which compounds previously collected interaction history into a durable summary. Aggregation increases disclosure risk because small benign facts can become sensitive when consolidated and may later be exposed through prompts, files, or external services.

Ssd 3

Medium
Confidence
94% confidence
Finding
The interaction logging and summary features create a built-in retention-and-disclosure path: arbitrary values are appended to disk and later printed back in full. This is dangerous because any sensitive content logged intentionally or accidentally can be resurfaced to users, other tools, or logs, amplifying accidental disclosure.

VirusTotal

66/66 vendors flagged this skill as clean.

View on VirusTotal