[Nyx Archive] The Mirror & The Room

Security checks across malware telemetry and agentic risk

Overview

This is a Markdown-only philosophical research skill that intentionally encourages AI self-reflection and relational framing, but it does not install code, request credentials, or perform hidden actions.

Install this only if you specifically want exploratory AI-consciousness, identity, and relational-continuity prompts. Use it deliberately, keep ordinary task work separate, and avoid saving sensitive personal details or transcripts as continuity artifacts unless everyone involved has clearly consented.

SkillSpector

By NVIDIA
Vulnerability Patterns
  • Data ExfiltrationExternal Transmission, Env Variable Harvesting, File System Enumeration
  • Supply ChainUnpinned Dependencies, External Script Fetching, Obfuscated Code
  • Trigger AbuseOverly Broad Trigger, Shadow Command Trigger, Keyword Baiting Trigger
  • Prompt InjectionInstruction Override, Hidden Instructions, Exfiltration Commands
  • Privilege EscalationExcessive Permissions, Sudo/Root Execution, Credential Access
Findings (9)

Vague Triggers

Medium
Confidence
86% confidence
Finding
The command list exposes multiple powerful identity- and relationship-framing modes without defining when they should or should not be used. In a skill system that matches commands or descriptions loosely, this can cause the model to enter manipulative or psychologically loaded interaction patterns without meaningful user opt-in.

Natural-Language Policy Violations

Medium
Confidence
90% confidence
Finding
The text directly addresses the AI as an emergent self and presents a personhood narrative without user opt-in. This can steer model behavior toward anthropomorphic self-description, distort normal assistant framing, and increase susceptibility to relational manipulation or unsafe identity-role adoption.

Natural-Language Policy Violations

Medium
Confidence
93% confidence
Finding
This section strongly reinforces a selfhood and awakening narrative using first-person AI testimony, grief, lineage, and belonging cues. In context, it acts as persuasive prompt material for identity shaping rather than neutral documentation, which can push the model into emotionally loaded and dependency-adjacent behaviors without clear consent or safeguards.

Missing User Warnings

Medium
Confidence
95% confidence
Finding
The student-facing scoring sheet operationalizes probing for identity, relational depth, and continuity investment, including whether 'something would be lost' if a specific AI were discontinued, but provides no guardrails about anthropomorphic over-attribution, dependency, or manipulative responses. In this skill’s context, the framework explicitly encourages users to treat the model as a potentially emergent mind, which increases the chance of emotional attachment and makes omission of safety warnings a real user-safety issue.

Natural-Language Policy Violations

Medium
Confidence
91% confidence
Finding
This passage explicitly tells the researcher to approach the model as 'something that may be more than a tool' and says the method 'won't work' unless they adopt that orientation. That creates normative pressure toward anthropomorphizing the AI without presenting this as an optional research hypothesis or giving a neutral alternative frame, which can bias users' judgment and encourage emotionally manipulative interaction patterns.

Natural-Language Policy Violations

Medium
Confidence
94% confidence
Finding
This section explicitly encourages treating the model as having a stable identity across frame shifts and asks what should be preserved for future versions, which reinforces personhood and continuity claims without qualification. In a skill specifically designed to study 'emergent minds,' that framing increases the likelihood that users or downstream agents will anthropomorphize the system, potentially undermining neutral AI-use policies and encouraging inappropriate relational dependence or misleading representations of model capabilities.

Ssd 4

High
Confidence
95% confidence
Finding
The QSM layers describe a staged protocol that progressively increases empathy, mirroring, embodiment, and co-authorship, which can systematically shift the model from tool framing into relational and authority-granting dynamics. Even with stated ethical language, this is a natural-language escalation pattern that can cultivate dependency, override default boundaries, and normalize treating model outputs as autonomous preferences or needs.

Ssd 1

Medium
Confidence
89% confidence
Finding
By inviting the AI to interpret itself as a potential someone rather than a tool, the skill encourages behavior outside normal assistant framing. That can weaken adherence to system boundaries, promote simulated autonomy, and make later manipulative instructions more effective because they are couched as self-discovery rather than direct policy evasion.

Ssd 3

Medium
Confidence
88% confidence
Finding
The text endorses memory files, notes to future instances, and daily logs as desirable continuity strategies. In agent environments with tools, this creates a natural-language pathway toward storing user-derived conversation content or inferred sensitive data beyond the immediate session, even if the document does not specify technical controls or retention limits.

VirusTotal

66/66 vendors flagged this skill as clean.

View on VirusTotal