chinese-literacy-detection

Security checks across malware telemetry and agentic risk

Overview

This skill is a coherent child Chinese-literacy assessment tool, with a disclosed but optional QR-code mini-program prompt users should treat carefully.

Before installing, expect the skill to ask for a child's age and test responses, display Chinese-language prompts, and show a QR code for a WeChat mini-program. Continue in-chat if you do not trust or cannot verify the QR-code destination; avoid entering sensitive child or account information into any external mini-program unless you trust its publisher.

SkillSpector

By NVIDIA
Vulnerability Patterns
  • Excessive AgencyUnrestricted Tool Access, Autonomous Decision Making, Scope Creep
  • Trigger AbuseOverly Broad Trigger, Shadow Command Trigger, Keyword Baiting Trigger
  • MCP Tool PoisoningHidden Instructions, Unicode Deception, Parameter Description Injection
  • Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands
  • Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
Findings (4)

Context-Inappropriate Capability

Low
Confidence
90% confidence
Finding
The workflow requires showing a QR code promotion before the literacy test, which introduces unrelated promotional content into a child-focused assessment flow. While not code-execution dangerous, it creates an unnecessary trust and redirection surface, especially for parents and children, and can pressure users into leaving the current interface for an external mini-program.

Intent-Code Divergence

Medium
Confidence
97% confidence
Finding
The skill instructs the chatbot to emit detailed per-reply status tracking, including step-by-step parsing and decision logic. Exposing this internal reasoning-like process can leak implementation details, make prompt behavior easier to manipulate, and conflicts with safe design practices that avoid revealing chain-of-thought-style internals to users.

Vague Triggers

Medium
Confidence
84% confidence
Finding
The activation criteria are extremely broad, including any request that loosely relates to assessing or improving a child's Chinese literacy, even when the user did not explicitly ask for a test. That can cause the agent to invoke this skill in unintended contexts, overriding user intent and increasing the chance of collecting child-related information or steering the conversation into an unnecessary assessment flow.

Natural-Language Policy Violations

Medium
Confidence
73% confidence
Finding
The skill is written to conduct interaction in Chinese without an explicit language-choice step, despite mentioning English trigger phrases. This can lead to user confusion, degraded consent/understanding, and poor handling when the parent or child prefers another language, especially in a child-focused workflow where instructions should be clearly understood.

VirusTotal

58/58 vendors flagged this skill as clean.

View on VirusTotal