Evolution Toolkit
Analysis
The toolkit is coherent for agent self-improvement, but it deserves review because it persists analyses of workspace memory, can use Gemini/workspace secrets, and includes a protocol that can change the assistant’s default interaction style.
Findings (5)
Artifact-based informational review of SKILL.md, metadata, install specs, static scan signals, and capability signals. ClawScan does not execute the skill or run runtime probes.
Checks for instructions or behavior that redirect the agent, misuse tools, execute unexpected code, cascade across systems, exploit user trust, or continue outside the intended task.
_Status: ✅ Active — integrate into default engagement style_ ... reads this before engaging ... on any non-trivial problem.
This broadens a Socratic helper into a default behavior rule for non-trivial problems, which can redirect the assistant away from direct answers or recommendations.
`scripts/skill-optimizer.js` - Runs an iterative generate -> evaluate -> improve loop for a configurable playbook.
The playbook optimizer is disclosed and config-driven, but it can generate revised prompt/playbook content that may later change agent behavior if adopted.
Source: unknown; Homepage: none ... No install spec — this is an instruction-only skill ... Required binaries: none
The registry provenance and runtime declarations are sparse even though the package contains runnable Node scripts. This is not evidence of malicious behavior, but it is a review point.
Checks whether tool use, credentials, dependencies, identity, account access, or inter-agent boundaries are broader than the stated purpose.
`skill-optimizer.js` looks for an API key in this order: 1. `GEMINI_API_KEY` 2. `GOOGLE_API_KEY` 3. Matching keys in your workspace `.secrets`
The optimizer can use provider credentials, including keys found in a local workspace secrets file. The behavior is disclosed and purpose-aligned, but credential discovery from `.secrets` is high-impact and should be tightly scoped.
Checks for exposed credentials, poisoned memory or context, unclear communication boundaries, or sensitive data that could leave the user's control.
`scripts/session-coherence.js` - Analyzes daily logs for persistent themes, energy, and drift. - Writes a report to `memory/research/` by default.
The skill reads private workspace memory logs and writes derived persistent reports. Similar documented workflows also use imprints and historical comparisons, so stored context can influence future sessions.
