小爱舆情AI标注(OpenAI兼容)

Security checks across malware telemetry and agentic risk

Overview

This skill does the advertised Feishu table labeling, but it under-discloses required external model calls and sends table content to an arbitrary OpenAI-compatible endpoint.

Review before installing. Use a Feishu app limited to the intended Bitable, start with a small limit, and assume table content will be sent to the configured OpenAI-compatible provider. Do not use it on confidential, regulated, or personal data unless that provider and its retention terms are acceptable. Ask the publisher to declare the model-provider inputs, allowed network destinations, privacy behavior, and dry-run or rollback process.

SkillSpector

By NVIDIA
Vulnerability Patterns
  • Excessive AgencyUnrestricted Tool Access, Autonomous Decision Making, Scope Creep
  • MCP Tool PoisoningHidden Instructions, Unicode Deception, Parameter Description Injection
  • Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands
  • Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
  • Privilege EscalationExcessive Permissions, Sudo/Root Execution, Credential Access
Findings (3)

Scope Creep

Medium
Confidence
91% confidence
Finding
The manifest declares network permission only for Feishu, but the description explicitly states the skill may call third-party model APIs in-process via OPENAI_API_KEY/OPENAI_BASE_URL/OPENAI_MODEL. This creates a security and governance mismatch: operators may believe outbound traffic is limited to Feishu while the implementation is designed to support additional destinations, which can enable unexpected data egress of table contents to external model providers.

Intent-Code Divergence

Medium
Confidence
95% confidence
Finding
The skill defines 'brand safety risk' so broadly that truthful allegations about defects, privacy issues, or safety concerns are automatically classified as risky merely because they are negative or unofficial. This creates a censorship and mislabeling mechanism that can suppress legitimate consumer complaints and safety reporting, especially in moderation or escalation pipelines.

Intent-Code Divergence

High
Confidence
98% confidence
Finding
The instruction that product-related replies 'must not admit defects' and must provide only positive guidance directly conflicts with objective risk assessment and incident handling. In practice, this can cause the system to conceal failures, mislead users during safety or privacy events, and block accurate acknowledgment of real harms.

VirusTotal

65/65 vendors flagged this skill as clean.

View on VirusTotal