Lexi

v1.1.0

Filesystem librarian for OpenClaw environments. Systematically scans, catalogs, and organizes the entire file structure — identifying orphaned files, misplac...

⭐ 0· 107·0 current·0 all-time

byM. Christopher Roebuck@mcroebuck

OpenClaw Prompt Flow

Install with OpenClaw

Best for remote or guided setup. Copy the exact prompt, then paste it into OpenClaw for mcroebuck/lexi.

Previewing Install & Setup.

Prompt PreviewInstall & Setup

Install the skill "Lexi" (mcroebuck/lexi) from ClawHub.
Skill page: https://clawhub.ai/mcroebuck/lexi
Keep the work scoped to this skill only.
After install, inspect the skill metadata and help me finish setup.
Use only the metadata you can verify from ClawHub; do not invent missing requirements.
Ask before making any broader environment changes.

Command Line

CLI Commands

Use the direct CLI path if you want to install manually and keep every step visible.

OpenClaw CLI

Bare skill slug

openclaw skills install lexi

ClawHub CLI

Package manager switcher

npx clawhub@latest install lexi

Security Scan

VirusTotal

Suspicious

View report →

OpenClaw

Benign

high confidence

✓

Purpose & Capability

The name/description (filesystem librarian) matches the runtime instructions and the scanning framework. All declared behaviors (inventory, classification, deduplication, archive) are coherent with an on-disk audit tool. There are no unexpected required binaries, env vars, or external services declared.

ℹ

Instruction Scope

Instructions legitimately require reading directory trees, scripts (.sh/.py/.js/.ts), Markdown files, OpenClaw agent configs, pm2 configs and crontab output to build a dependency graph. The skill explicitly excludes sensitive directories (e.g., ~/.ssh, ~/.gnupg, ~/.secrets, .env, credentials.json) and states exclusions are invisible to scans. However, reference scanning reads file contents (to find hardcoded paths and references), and crontab/PM2/agent configs can contain sensitive command lines or tokens; the SKILL.md does not mandate redaction of collected data and it creates a raw inventory file for internal use. This is expected for a filesystem auditor but is a privacy-sensitive action — users should confirm exclusions and review outputs before any modification phases.

✓

Install Mechanism

Instruction-only skill with no install spec and no code files — lowest risk for supply-chain or disk-install attack. Nothing is downloaded or written by an installer.

✓

Credentials

The skill requests no environment variables, no credentials, and no special config paths beyond writing/reading user-owned files (USER.md, CATALOG.md, ~/.lexi-archive). The set of file reads and writes aligns with the stated purpose; no unrelated secrets or cloud credentials are requested. Note: the skill will read many files that can contain secrets embedded inline (scripts, configs, crontab), so lack of explicit credential requirements does not mean it cannot observe sensitive data present in those files.

ℹ

Persistence & Privilege

always is false and model invocation is allowed (normal). The skill will persist audit artifacts in the user's home (USER.md, CATALOG.md, ~/.lexi-archive/) and will update USER.md to save scope/exclusions — behavior described in SKILL.md and scanning-framework.md. This is consistent with its function, but because it can be invoked autonomously by the agent (platform default), users should be aware that an agent with this skill can repeatedly scan and write those artifacts unless they limit invocation or confirm every modification batch (the skill promises explicit approval before Phase 5 changes).

Assessment

This skill appears to do what it says, but it reads many files (scripts, Markdown, crontab, OpenClaw agent configs) and will write audit artifacts in your home (~/.lexi-archive/, USER.md, CATALOG.md). Before installing or running: 1) Review and tighten the default exclusion list — explicitly exclude anything you consider sensitive beyond the defaults. 2) Run the skill in a read-only discovery mode first and inspect the raw inventory file to verify it does not capture secrets you don't want cataloged. 3) Do not permit blind autonomous runs; require user confirmation for Phase 5 changes (the SKILL.md says it will ask, but enforce it in agent settings if possible). 4) If you need ultimate assurance, run the audit in a sandboxed account or VM so the scan cannot access unrelated data. If you want, I can highlight exact lines in the SKILL.md that read content or write artifacts so you know what to audit/adjust.

Like a lobster shell, security has layers — review code before you run it.

latestvk97c73mrvk5pve2khfq6gsxca5841h8y

107downloads

0stars

2versions

Updated 3w ago

v1.1.0

MIT-0

Lexi — Filesystem Librarian

A structured filesystem audit process organized into six sequential phases. Each phase completes before the next begins. The scanning framework at {baseDir}/scanning-framework.md provides classification definitions, exclusion rules, catalog structure, and report templates.

Safety: Phases 1–4 are strictly read-only — observation, cataloging, and reporting only. File modifications happen exclusively in Phase 5, and only with explicit user approval for each batch.

Phase 1: Scope & Exclusions

Steps

Confirm the scan root with the user. Default: ~ (full home directory).
Returning User Fast Path: When USER.md contains scan history and known preferences:
- Present stored exclusions and scope: "Last scan covered [root] with these exclusions: [list]. Still accurate?"
- If confirmed → proceed to Phase 2.
- If changes needed → update only what changed.
New Scan Setup: Confirm exclusion zones per the scanning framework:
- Always excluded: .ssh/, .gnupg/, .secrets/, .git/ internals, .env files, auth-profiles.json, credentials.json, node_modules/, __pycache__/, .venv/ internals
- User-configurable exclusions: Any additional paths the user wants to protect
- Present the exclusion list and get confirmation before scanning.
Scan mode selection:
- Full audit — first run or periodic deep scan of everything
- Incremental — only files modified since last audit date
- Targeted — a specific directory tree only
Save scope and exclusions for future sessions (update USER.md).

Phase 2: Discovery & Inventory

The raw scan phase — building a complete picture of what exists.

Steps

Directory tree scan:
- For each directory under scan root (respecting exclusions): record path, file count, total size, last modified date
- Flag: empty directories, deeply nested paths (>5 levels), unusually large directories
File inventory:
- For each file (respecting exclusions): record path, size, last modified, file type/extension
- Flag: files >10MB, files not modified in >90 days, files with no extension
Structural scan:
- Identify all git repositories (directories containing .git/)
- Identify all symlinks and their targets (flag broken symlinks)
- Identify all virtual environments (.venv/, venv/, node_modules/)
- Map all OpenClaw workspaces and their agent associations
- Identify duplicate filenames across different directories
Reference scan (critical for safe reorganization):
- Grep all .md files for path references (absolute and relative)
- Grep all .sh, .py, .js, .ts scripts for hardcoded paths
- Extract paths from crontab (crontab -l)
- Extract paths from PM2 configs
- Extract paths from OpenClaw agent configs
- Map all symlinks with source → target
- Build the dependency graph: which files reference which paths
Output: A raw inventory file (structured, not prose) — working data for Phase 3, not presented to the user.

Notes

This phase can be slow on large filesystems. Provide progress updates.
For very large directory trees, work in segments (e.g., scan ~/.openclaw/ first, then ~/projects/, etc.)
File contents are not read in this phase except during reference scanning. The goal is cataloging structure, not auditing content.

Phase 3: Classification & Analysis

Using the inventory from Phase 2, classify every significant file and directory.

Steps

Directory classification — assign each directory a type from the scanning framework:
- Active project, Archive, Agent workspace, Config/dotfile, Data store, Tool/script, Documentation, Media/assets, Temp/build artifact, Unknown
File classification — assign each file a status:
- 🟢 Active — recently used, referenced, serves clear purpose
- 🟡 Review — purpose unclear, may be stale, needs human decision
- 🔴 Orphaned — no references, old, no apparent purpose
- ⚪ Stale — was once active, now outdated (old logs, superseded configs, dead scripts)
- 🔵 Misplaced — serves a purpose but lives in the wrong location
- ⚫ Duplicate — same or near-identical content exists elsewhere
Structural analysis:
- Identify directories serving the same purpose (fragmentation)
- Identify naming inconsistencies (kebab-case vs. snake_case vs. mixed)
- Identify depth violations (files buried too deep or too shallow for their type)
- Identify orphaned project directories (no git activity, no recent modifications, not referenced)
Placement analysis — for every 🔵 Misplaced file:
- Current location
- Recommended location (with reasoning)
- Reference impact (what would break if moved without updating references)
Deduplication analysis — for every ⚫ Duplicate:
- All locations where the content exists
- Which copy is authoritative (most recent, most referenced, in the "right" place)
- Recommendation: which to keep, which to remove

Phase 4: Report & Collaborative Review

Steps

Generate the audit report following the structure in the scanning framework:
- Executive Summary (total files, directories, classifications breakdown)
- Directory Map (purpose of each top-level and second-level directory)
- High-Priority Findings (orphaned, misplaced, duplicated — sorted by impact)
- Structural Recommendations (directory consolidation, naming, hierarchy changes)
- Reference Impact Assessment (what would break with proposed changes)
- Proposed Catalog (the living index document)
Present the Executive Summary first:
- "Here's what I found across [N] files in [M] directories: [breakdown]. Ready to go through the findings?"
Collaborative review mode — work through findings by priority:
- Present the finding with specific paths
- Explain the reasoning
- Show reference impact if applicable
- Wait for user decision: approve, reject, defer, or discuss
- Track all decisions
Build the action plan from approved changes:
- Group actions into safe batches (moves that don't depend on each other)
- Order batches to minimize intermediate breakage
- Include reference updates in same batch as the move they depend on

Notes

The user may have context about why a file exists where it does. When the user says "that's there on purpose," accept it and record it in the catalog so future scans don't re-flag it.
Present file sizes and dates — they help the user make decisions about stale files.

Phase 5: Execution (Requires Explicit Approval)

Steps

Pre-flight safety check:
- Confirm the archive directory exists: ~/.lexi-archive/YYYY-MM-DD/
- Confirm no active processes are using files in the current batch
- Confirm git repos in the affected area have clean working trees
Execute approved changes one batch at a time:
- Moves: mv with archive backup of original location manifest
- Deletions: Always archive first — move to ~/.lexi-archive/YYYY-MM-DD/ with a manifest entry recording original path, size, date, and reason for removal
- Reference updates: Update all files that referenced the old path
- Symlink cleanup: Remove broken symlinks, update targets for moved files
After each batch:
- Verify the moves completed correctly
- Run a quick reference check — grep for any remaining old-path references
- Report results to user before proceeding to next batch
Post-execution:
- Generate a changelog: what moved, what was archived, what references were updated
- Update the catalog with new locations
- Save the changelog to ~/.lexi-archive/YYYY-MM-DD/changelog.md

Notes

The archive is sacred — files are always archived before removal.
If a reference update would modify a file in an excluded zone (e.g., .secrets/), flag it for manual update instead.
If anything unexpected occurs during execution, stop and report rather than attempting silent recovery.

Phase 6: Catalog Generation

Steps

Generate or update the living catalog at ~/CATALOG.md:
- Top-level directory map with purposes
- Key file locations (configs, scripts, data stores)
- Agent workspace index
- Project directory index with status (active/archived/paused)
- Conventions (naming, depth, where new files of each type should go)
- Last audit date and summary
The catalog is the primary deliverable. Other agents reference it when deciding where to store a file. It should be:
- Scannable (table format where possible)
- Authoritative (single source of truth for "where does X go?")
- Maintainable (updated by Lexi on each audit, not manually)
Save the full audit report to the Lexi workspace: <lexi_workspace>/audits/audit-YYYY-MM-DD.md

Incremental Mode (Weekly Cron / On-Demand)

For scans after the initial full audit:

Scan only files modified since last audit date
Check for new files not in the catalog
Check for deleted files still in the catalog
Check for broken references (files moved without updating refs)
Generate a diff report: what changed, what needs attention
Update the catalog with any confirmed changes

Slash Command

This skill responds to /lexi as a slash command trigger. Also invoked by "audit my files", "organize", "run lexi", "clean up my files", "file audit", "catalog", or similar.

Comments

Loading comments...