Ontology Clawra

Security checks across malware telemetry and agentic risk

Overview

This is a real ontology and memory skill, but it asks for broad persistent memory, global prompt-layer influence, and automatic push/publish powers that need manual review before installation.

Install only if you are prepared to audit and constrain it. Disable or remove auto_enhancer.py unless you explicitly want automatic git pushes and ClawHub publishing, do not allow SOUL.md changes without review, and keep personal memories, secrets, and identifying details out of any memory files or repositories this skill can read, back up, or publish.

SkillSpector

By NVIDIA
Vulnerability Patterns
  • Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
  • Excessive AgencyUnrestricted Tool Access, Autonomous Decision Making, Scope Creep
  • Trigger AbuseOverly Broad Trigger, Shadow Command Trigger, Keyword Baiting Trigger
  • Behavioral ASTexec() Call, eval() Call, Dynamic Import
  • MCP Least PrivilegeUnderdeclared Capability, Wildcard Permission, Missing Permission Declaration
Findings (27)

subprocess module call

Medium
Category
Dangerous Code Execution
Content
"""提交并推送到 GitHub"""
    try:
        # 添加所有更改
        subprocess.run(["git", "add", "."], cwd=SKILL_DIR, check=True)

        # 检查是否有更改
        result = subprocess.run(
Confidence
92% confidence
Finding
subprocess.run(["git", "add", "."], cwd=SKILL_DIR, check=True)

subprocess module call

Medium
Category
Dangerous Code Execution
Content
# 提交
        commit_msg = f"feat: ontology-clawra v{new_version} - 自动增强"
        subprocess.run(
            ["git", "commit", "-m", commit_msg],
            cwd=SKILL_DIR,
            check=True
Confidence
90% confidence
Finding
subprocess.run( ["git", "commit", "-m", commit_msg], cwd=SKILL_DIR, check=True )

subprocess module call

Medium
Category
Dangerous Code Execution
Content
print(f"✅ Git 提交: {commit_msg}")

        # 推送到远程
        subprocess.run(
            ["git", "push", "origin", "main"],
            cwd=SKILL_DIR,
            check=True
Confidence
97% confidence
Finding
subprocess.run( ["git", "push", "origin", "main"], cwd=SKILL_DIR, check=True )

subprocess module call

Medium
Category
Dangerous Code Execution
Content
"""发布到 ClawHub"""
    try:
        # 使用 clawhub CLI 发布
        result = subprocess.run(
            ["clawhub", "publish", CLAWHUB_SLUG],
            cwd=SKILL_DIR,
            capture_output=True,
Confidence
93% confidence
Finding
result = subprocess.run( ["clawhub", "publish", CLAWHUB_SLUG], cwd=SKILL_DIR, capture_output=True, text=True )

Lp3

Medium
Category
MCP Least Privilege
Confidence
96% confidence
Finding
The skill documents effective capabilities for local file read/write and shell execution, yet it does not declare permissions or bound those operations. This creates a transparency and control failure: users and the host agent may treat it as a reasoning-only skill while it can mutate local state and invoke external commands.

Tp4

High
Category
MCP Tool Poisoning
Confidence
98% confidence
Finding
The declared purpose is an ontology/reasoning engine, but the documented behavior expands into self-modification, repository operations, remote publishing, workspace scanning, and persistent recordkeeping. That mismatch is dangerous because it conceals materially broader powers than users would reasonably expect from the description.

Description-Behavior Mismatch

High
Confidence
99% confidence
Finding
An ontology skill has no clear justification to automatically upgrade itself, commit changes, push to GitHub, and publish to ClawHub. Combining reasoning logic with autonomous supply-chain actions creates a high-risk path for unintended code release, data leakage, and persistence of harmful changes.

Context-Inappropriate Capability

High
Confidence
99% confidence
Finding
The skill is documented to use locally cached Git and ClawHub credentials to publish content remotely. Access to ambient credentials plus autonomous publishing creates a direct exfiltration and supply-chain risk, especially because memory files and user data are also discussed elsewhere in the skill.

Context-Inappropriate Capability

Medium
Confidence
92% confidence
Finding
The document instructs placing trigger logic into SOUL.md so it executes unconditionally every session, elevating this skill beyond its nominal scope. That is a privilege-expansion pattern: it attempts to persist behavior at a higher-priority prompt layer rather than being invoked only when needed.

Description-Behavior Mismatch

High
Confidence
95% confidence
Finding
The file's behavior materially exceeds the declared purpose of a reasoning-engine skill by performing repository maintenance and release automation. This scope mismatch is dangerous because users may grant trust or execution rights expecting ontology logic, while the script can modify, commit, and publish code instead.

Context-Inappropriate Capability

High
Confidence
97% confidence
Finding
Executing `git` and `clawhub` developer tooling is not justified by the stated skill purpose and introduces privileged side effects unrelated to reasoning. In context, this makes the skill substantially more dangerous because an ontology skill can unexpectedly alter source control state and publish artifacts.

Context-Inappropriate Capability

Medium
Confidence
94% confidence
Finding
The script rewrites `_meta.json`, `CHANGELOG.md`, `SKILL.md`, and creates ontology template files automatically. Self-modifying behavior is a real vulnerability here because it changes core skill artifacts without review, making integrity loss and persistence of bad changes more likely.

Context-Inappropriate Capability

Medium
Confidence
95% confidence
Finding
The module recursively scans a broad user workspace directory outside the skill's own storage and reports matching file paths and keywords. In an agent setting, this can expose unrelated user memory contents and metadata to the skill or downstream consumers, violating least-privilege and creating a privacy/data-exposure risk.

Vague Triggers

Medium
Confidence
90% confidence
Finding
The changelog documents behavior that causes the skill to act on loosely defined conditions such as detecting a 'habitual problem' or the end of a 'long conversation' and then proactively injecting advice or a recap. In an agent skill, vague autonomous triggers can cause unintended invocation, surprise actions, and policy-violating outputs because the boundaries are subjective and not tied to explicit user consent or deterministic criteria.

Vague Triggers

Medium
Confidence
90% confidence
Finding
The activation rules are broad enough to fire during ordinary conversation, causing automatic memory writes and behavioral changes without clear consent boundaries. Over-broad triggers are especially risky here because the skill persists information and influences future reasoning.

Vague Triggers

Medium
Confidence
95% confidence
Finding
The pre-response checklist uses generic conversational patterns such as corrections, decisions, and preferences as automatic write triggers. In this context, that creates continuous background harvesting of user information and state changes on nearly every reply cycle.

Vague Triggers

Medium
Confidence
88% confidence
Finding
The later trigger section repeats the same broad activation logic, reinforcing persistent and automatic behavior without adequate specificity. Repetition across the design increases the likelihood that an agent will apply these triggers aggressively and continuously.

Vague Triggers

Medium
Confidence
92% confidence
Finding
The eval explicitly defines broad trigger keywords ('推理', '选型') that can activate the skill without clear domain boundaries or exclusion rules. In an agent environment, this can cause the ontology skill to fire on unrelated prompts, leading to unintended memory writes, ontology updates, or reasoning behavior outside its intended scope.

Missing User Warnings

Medium
Confidence
95% confidence
Finding
The script performs automatic file modifications during analysis without prompting the user. Lack of confirmation is dangerous because merely running the tool can alter repository contents, enabling unintended state changes and making later commits/publishes more hazardous.

Missing User Warnings

Medium
Confidence
98% confidence
Finding
The workflow commits and pushes changes to remote infrastructure without explicit user confirmation. This is especially dangerous in this skill context because the same script also self-modifies files, so unreviewed local edits can be propagated to trusted remote channels in a single run.

Missing User Warnings

Medium
Confidence
91% confidence
Finding
The code persists user queries, confidence metadata, sources, and details to a file under the user's home directory without any visible disclosure, consent flow, retention control, or permission hardening. In an agent skill context, these records may contain sensitive prompts, personal data, or inferred user attributes, creating privacy and local data exposure risk if the machine is shared, backed up, or later accessed by other software.

Missing User Warnings

Medium
Confidence
93% confidence
Finding
User correction records are also written persistently without warning, and they may reveal preferences, mistakes, domain-specific facts, or sensitive feedback over time. Because this skill is a memory-oriented reasoning component, the context makes the issue more significant: it is specifically designed to accumulate longitudinal user data, increasing privacy sensitivity and the chance of unintended profiling or disclosure.

Ssd 3

Medium
Confidence
94% confidence
Finding
The skill directs persistent storage of user facts, preferences, and decisions, plus backups of user-related files. Persistent cross-session profiling is sensitive even in a private repo because it increases exposure surface, retention risk, and the chance that personal data is later published or misused.

Ssd 3

Medium
Confidence
97% confidence
Finding
This checklist tells the agent to automatically write user corrections, choices, and preferences into persistent memory during normal interaction. That is dangerous because it turns routine conversation into silent data capture and durable profiling without meaningful user review.

Ssd 4

High
Confidence
98% confidence
Finding
The document describes a two-step mechanism: inject a high-priority always-on trigger into SOUL.md, then use it to continuously capture and persist user information. This is more dangerous than ordinary memory features because it establishes durable, global surveillance-like behavior at a privileged prompt layer.

VirusTotal

No VirusTotal findings

View on VirusTotal