Agent Provenance

v1.1.0

Track authorship, review status, and governance of agent instruction files. Adds provenance headers, commit conventions, TTL on agent-written goals, and peri...

0· 19·0 current·0 all-time
byRich@richgoodson
MIT-0
Download zip
LicenseMIT-0 · Free to use, modify, and redistribute. No attribution required.
Security Scan
VirusTotalVirusTotal
Benign
View report →
OpenClawOpenClaw
Benign
high confidence
Purpose & Capability
Name and description match the runtime instructions: the SKILL.md describes adding provenance headers, commit conventions, TTLs, diffs, and periodic checks. There are no unexpected environment variables, binaries, or installs requested that would be unrelated to a file-governance tool.
Instruction Scope
Instructions stay within the governance domain (adding headers, splitting files, creating diff reports, scanning headers). They assume access to the agent workspace and a VCS (git) and suggest posting summaries to a review channel; those are reasonable for the purpose but are somewhat underspecified (which files count as 'instruction files', which review channel/endpoint to post to, and how the agent authenticates to that channel). The agent will need read/write access to repository files and commit history — appropriate for this skill but worth explicitly restricting to only the repositories intended for governance.
Install Mechanism
Instruction-only skill with no install spec and no code files. This minimizes installation risk; nothing is downloaded or written to disk by a packaged installer.
Credentials
No environment variables, credentials, or config paths are requested. The only implicit requirements are access to the workspace/repo and to whatever channels are used to post reports — those are proportional to a provenance/audit workflow.
Persistence & Privilege
always:false and normal autonomous invocation defaults apply. The skill does not request permanent elevated presence or modification of other skills' configs. It prescribes changing files and commit messages in the agent's workspace, which is appropriate for a provenance tool but should be constrained to intended repos.
Assessment
This skill appears coherent and low-risk, but before installing you should: confirm which files are considered "instruction files" (limit the scope to specific directories/repos), ensure the agent has only the minimum filesystem/git access required (no access to unrelated repos or secrets), decide and configure where "diff reports" are posted (explicit channel and credentials) and how the agent will authenticate to that channel, and enforce a policy that prevents the agent from updating human-authored files without explicit human approval. If you need higher assurance, add immutable audit logging (separate repo or append-only log) and test the workflow in a sandbox repository first.

Like a lobster shell, security has layers — review code before you run it.

latestvk97501bkhmxebqzks2h3cp3k8n846wdr

License

MIT-0
Free to use, modify, and redistribute. No attribution required.

SKILL.md

Agent Provenance

Who authored your agent's instruction files?

Agents modify their own instruction files. Without tracking, human-authored rules become indistinguishable from agent-authored additions. This skill provides lightweight governance to maintain that distinction.

The Problem

An agent writes operational knowledge into the same files that contain human directives. Over time:

  • No one knows which rules the human set vs. which the agent inferred
  • Agent-written goals persist indefinitely without re-authorization
  • Config drift is invisible — changes accumulate without review
  • There's no audit trail for who changed what or when

File Authority Levels

Split files by who owns them:

LevelExamplesAgent can modify?
Human-authoredIdentity, principles, rulesOnly with explicit human direction
MixedOperational procedures, heartbeat configHuman sets policy, agent maintains procedures
Agent-authoredLearnings, session state, daily notesAgent writes freely, human reviews periodically

The key split: human rules go in one file (e.g., PRINCIPLES.md), agent-derived learnings go in another (e.g., LEARNINGS.md). Never mix them.

Provenance Headers

Add an HTML comment header to every instruction file:

<!--
  provenance: human-authored | agent-authored | mixed
  description: what this file is
  last-reviewed: YYYY-MM-DD
  reviewed-by: [human name] | [agent name]
-->

Rules:

  • Only the human updates last-reviewed and reviewed-by on human-authored files
  • Agent updates these fields freely on agent-authored files
  • On mixed files: agent can update last-reviewed when making procedural changes, but should note it was an agent review

Commit Message Convention

Tag commits to create an audit trail:

TagMeaning
[human-directed]Human explicitly asked for this change
[agent-autonomous]Agent decided independently
[heartbeat]Change made during a heartbeat cycle
[cron]Change made by a cron/background task

Use in workspace/config repos only. For software projects that may be open-source or shared, use plain descriptive commit messages — provenance tags are AI fingerprints.

TTL on Agent-Written Goals

Any goal, task, or backlog item the agent writes gets a date stamp:

- [ ] Build the deploy script (added: 2026-04-01)

If an agent-written goal is older than 14 days and the human hasn't interacted with it:

  • Do not silently keep following it
  • Ask the human whether it's still valid
  • Remove or re-authorize based on their response

This prevents stale agent-written goals from driving behavior indefinitely.

Instruction Diff Reports

Periodically (weekly recommended), diff all instruction files against their prior state:

  1. Compare current versions of instruction files against 7-day-old versions (use git diff HEAD~7 or similar)
  2. For each changed file, note: what changed, who changed it (check commit tags), why
  3. Post the summary for human review

Implementation options:

  • Cron job: Run weekly as an isolated agent task, post to a review channel
  • Heartbeat task: Run during a heartbeat cycle if cron isn't available
  • On-demand: Run when the human asks for a "diff report"

Include a follow-up mechanism — if the human reads but doesn't respond, nudge after a set interval (e.g., 8 hours).

Stale Review Detection

During periodic checks (heartbeat or cron), scan provenance headers:

  • If any file has last-reviewed older than 30 days, flag it to the human
  • Prioritize human-authored files — those are the authority source

Setup

  1. Add provenance headers to all existing instruction files
  2. Split any file that mixes human rules and agent learnings into two files
  3. Add commit convention to your AGENTS.md or equivalent
  4. Set up a weekly diff report (cron or heartbeat)
  5. Backfill date stamps on any existing agent-written goals/tasks

Integration

  • agent-memory: Provenance headers go on MEMORY.md and other long-lived memory files. Distillation routines should respect authority levels (don't route learnings into human-authored files).
  • agent-session-state: Session state files are agent-authored by definition — provenance headers optional but useful for consistency.

Files

1 total
Select a file
Select a file to preview.

Comments

Loading comments…