英语笔记出题机

Security checks across malware telemetry and agentic risk

Overview

This is a coherent English-learning skill, but it uses local study files, optional Baidu API credentials, and a local microphone transcription page that users should understand before installing.

Install only if you are comfortable keeping Baidu API keys and learning records in local markdown files and using a localhost speech-recognition server for microphone practice. Do not commit the credentials file to a repository, keep the skill directory private, and close the speech server when the speaking exercise is finished.

SkillSpector

By NVIDIA
Vulnerability Patterns
  • Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
  • Trigger AbuseOverly Broad Trigger, Shadow Command Trigger, Keyword Baiting Trigger
  • Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands
  • Privilege EscalationExcessive Permissions, Sudo/Root Execution, Credential Access
  • Supply ChainUnpinned Dependencies, External Script Fetching, Obfuscated Code
Findings (8)

Vague Triggers

Medium
Confidence
84% confidence
Finding
The activation phrases are very generic conversational terms such as '练英语', '英语练习', and module names like '听力' or '阅读'. In an assistant environment, broad triggers can cause accidental invocation during normal conversation, leading to unintended execution of skill behavior and unexpected access to local resources or generated tasks.

Missing User Warnings

Medium
Confidence
72% confidence
Finding
The README instructs users to place Baidu TTS credentials in a local markdown file and describes automatic maintenance of review-related files, but it does not clearly warn that sensitive credentials and study content may be stored on disk. This increases the risk of inadvertent exposure through backups, sync tools, screenshots, repository commits, or local multi-user access.

Vague Triggers

Medium
Confidence
86% confidence
Finding
The trigger list includes broad everyday phrases such as “听力”, “口语”, and “阅读”, which can cause accidental activation in unrelated conversations. In an agent skill that creates tasks, writes files, and may start local services, unintended invocation increases the chance of unwanted side effects without clear user intent.

Missing User Warnings

Medium
Confidence
88% confidence
Finding
The skill description states it will automatically create five tasks, write temporary files, and clean them up, but it does not clearly warn the user before these state-changing actions occur. This is dangerous because users may invoke the skill expecting tutoring only, while the skill modifies the local workspace and task state without informed consent.

Missing User Warnings

Medium
Confidence
93% confidence
Finding
The speaking module instructs the agent to write and run a local HTTP server that accepts microphone recordings and stores submitted results, but the document does not provide clear privacy, retention, or attack-surface warnings. Even though it binds to 127.0.0.1, it still processes sensitive voice data and opens a local service that could be misused by other local processes or expose private content through insecure handling.

Missing User Warnings

Low
Confidence
82% confidence
Finding
The document tells users to place Baidu TTS API credentials in a local reference file but does not explicitly label them as secrets or warn against accidental disclosure, logging, or committing them to shared storage. This can lead to credential leakage and unauthorized use of the third-party service.

Missing User Warnings

Medium
Confidence
93% confidence
Finding
The page records microphone audio, converts it to base64, and sends it to a local HTTP server for transcription without clearly informing the user that captured speech leaves the page and is transmitted for processing. Even though the destination is localhost, spoken content may contain sensitive personal data, and the lack of explicit disclosure and consent creates a real privacy/security issue in a voice-learning skill.

Missing User Warnings

Medium
Confidence
94% confidence
Finding
The submit flow sends both exercise targets and recognized speech results to the server without prior disclosure or confirmation. This exposes not only the user's speech-derived text but also learning content and timestamps, which can reveal study habits or personal utterances if the local service logs or forwards the data.

VirusTotal

65/65 vendors flagged this skill as clean.

View on VirusTotal