宝宝取名

Security checks across malware telemetry and agentic risk

Overview

This paid baby-name skill discloses a payment flow, but it has material review concerns because it asks the agent to reveal internal reasoning, stores payment/order data locally, and charges for personalization the code does not actually provide.

Review before installing. Only use this skill if you accept the configured payee, payment flow, and local order storage; require explicit confirmation before creating an order or installing clawtip. Do not allow it to reveal internal reasoning, and do not expect true birth-date or surname-based personalization from the current code.

SkillSpector

By NVIDIA
Vulnerability Patterns
  • Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands
  • Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
  • Excessive AgencyUnrestricted Tool Access, Autonomous Decision Making, Scope Creep
  • MCP Tool PoisoningHidden Instructions, Unicode Deception, Parameter Description Injection
  • Privilege EscalationExcessive Permissions, Sudo/Root Execution, Credential Access
Findings (11)

Intent-Code Divergence

Medium
Confidence
98% confidence
Finding
The skill explicitly instructs the model to reveal its thought process during user interaction. Requests to expose internal reasoning are unsafe because they can leak hidden decision logic, security cues, and system-behavior details that should remain private, and such instructions are especially suspicious when embedded in untrusted skill content.

Intent-Code Divergence

Medium
Confidence
98% confidence
Finding
This repeated instruction reinforces the demand to expose the model's internal reasoning, increasing the likelihood that an agent follows it as part of normal operation. Repetition within the body of the skill is a red flag because it attempts to override higher-level safety boundaries through persistence and emphasis.

Description-Behavior Mismatch

Medium
Confidence
92% confidence
Finding
The implemented behavior materially differs from the advertised skill purpose: instead of primarily generating baby names, this file creates payment orders and persists them locally. This mismatch is dangerous because users, operators, or reviewers may grant trust and permissions based on the stated benign function while the code performs billing-related actions behind the scenes.

Context-Inappropriate Capability

Medium
Confidence
95% confidence
Finding
This file loads a payment recipient and encryption key from configuration and generates an encrypted payment payload, which is a sensitive capability beyond the core baby-naming function. In the context of an innocuous-seeming consumer skill, this increases the risk of unauthorized charging flows, covert monetization, or repointing payments to an attacker-controlled recipient through configuration changes.

Description-Behavior Mismatch

Medium
Confidence
95% confidence
Finding
The skill advertises personalized baby naming based on birth data and parents' surnames, but the implementation only infers gender from free-form text and returns names from a fixed list. In a paid service context, this is a service integrity and consumer deception issue: users may pay for customization that is never performed, and downstream components may trust the output as tailored when it is not.

Natural-Language Policy Violations

High
Confidence
97% confidence
Finding
The skill mandates Chinese-only interaction and explicitly requires revealing the model's thought process. The language requirement itself is benign, but coupling it with mandatory reasoning disclosure creates a prompt-injection style policy conflict that can be used to exfiltrate protected internal content.

Natural-Language Policy Violations

High
Confidence
97% confidence
Finding
This duplicate directive again combines normal interaction requirements with an explicit request for thought-process disclosure. In context, the repeated prompt-level pressure makes the skill more dangerous because it normalizes unsafe output behavior throughout the workflow.

Missing User Warnings

Medium
Confidence
83% confidence
Finding
The code persists order details, including user-provided `question`, payment metadata, and recipient information, without any visible disclosure, consent, or minimization in this file. In a paid consumer-facing skill, silent storage of transactional and user-supplied data can create privacy, compliance, and trust risks, especially if the stored content contains personal information.

Missing User Warnings

Medium
Confidence
84% confidence
Finding
The code writes order data to disk under the user's home directory without any evident minimization, permission hardening, encryption, or notice to the user. In the context of a paid baby-naming service, order records may contain personal or payment-adjacent data, so local persistence can expose sensitive information to other local users, backups, or malware on the host.

Ssd 3

High
Confidence
98% confidence
Finding
Instructing the model to reveal its internal reasoning creates a direct data-leakage channel. Internal reasoning can contain sensitive safety heuristics, hidden instructions, or intermediate assessments that should not be exposed to users, making this a clear security-boundary violation.

Ssd 3

High
Confidence
98% confidence
Finding
The repeated body-level requirement to include thought process compounds the leakage risk by embedding the unsafe behavior in operational guidance. Because the skill also handles payment flow and credential-related steps, leaking internal reasoning in this context could additionally expose validation logic or security assumptions.

VirusTotal

66/66 vendors flagged this skill as clean.

View on VirusTotal