Introspect

v1.2.0

Analyze your Claude Code sessions to discover your developer DNA - gamified performance report with letter grades, archetypes, behavioral patterns, shadow ar...

⭐ 0· 98·0 current·0 all-time

by@umang-dabhi

OpenClaw Prompt Flow

Install with OpenClaw

Best for remote or guided setup. Copy the exact prompt, then paste it into OpenClaw for umang-dabhi/introspect.

Previewing Install & Setup.

Prompt PreviewInstall & Setup

Install the skill "Introspect" (umang-dabhi/introspect) from ClawHub.
Skill page: https://clawhub.ai/umang-dabhi/introspect
Keep the work scoped to this skill only.
After install, inspect the skill metadata and help me finish setup.
Use only the metadata you can verify from ClawHub; do not invent missing requirements.
Ask before making any broader environment changes.

Command Line

CLI Commands

Use the direct CLI path if you want to install manually and keep every step visible.

OpenClaw CLI

Bare skill slug

openclaw skills install introspect

ClawHub CLI

Package manager switcher

npx clawhub@latest install introspect

Security Scan

Capability signals

CryptoCan make purchases

These labels describe what authority the skill may exercise. They are separate from suspicious or malicious moderation verdicts.

VirusTotal

Benign

View report →

OpenClaw

Benign

high confidence

✓

Purpose & Capability

Name/description promise (analyze Claude Code sessions to produce a developer-profile report) matches what the files do: a Python extractor reads ~/.claude session/history files and produces JSON, and the SKILL.md instructs the AI to analyze that JSON. The included npm installer and CLI simply copy the skill into ~/.claude/skills and check environment — these actions are proportional to installing a Claude skill.

✓

Instruction Scope

SKILL.md explicitly instructs running the bundled analyze.py against ~/.claude session history and then reading the generated JSON. The instructions reference only Claude session paths and the generated report; they do not ask the agent to read unrelated system files, environment secrets, or to transmit data externally.

ℹ

Install Mechanism

No arbitrary remote downloads or extract-from-URL steps are present; installer copies files bundled in the package into ~/.claude/skills. The CLI does make an HTTPS call to the npm registry to check the latest version (expected for an npm packaged installer). Note: package.json defines a postinstall script that runs the installer, so running via npm/npx will auto-install into ~/.claude/skills.

✓

Credentials

The skill does not request environment variables, secrets, or external credentials. It uses HOME/USERPROFILE to locate the ~/.claude directory (expected) and reads Claude session/history files — exactly what is needed for the stated analysis.

ℹ

Persistence & Privilege

The skill does not set always:true and does not attempt to modify other skills. It writes files into its own directory (~/.claude/skills/introspect) and a .introspect-version file — normal for an installed skill. Users installing via npm/npx should be aware postinstall will run automatically and copy files into ~/.claude/skills.

Assessment

This package appears to do what it says: it reads your local Claude Code session/history files and produces a JSON report that the agent then analyzes to create a profile. Before installing, confirm you are comfortable with the skill reading full conversation contents from ~/.claude (the report is as private as those files are). Installing via npx/npm will run the package's postinstall which auto-copies files into ~/.claude/skills — if you prefer manual control, copy SKILL.md and the scripts into ~/.claude/skills/introspect yourself. Note: the installer checks the npm registry for updates (network call to registry.npm.org) and writes a version file under the skill directory; no external endpoints are contacted to send your session data. Also be aware of a minor metadata/version inconsistency between the registry metadata and package.json — this is not harmful but you may want to verify you have the intended version before trusting automated updates.

✗

npm-package/bin/cli.js:53

Shell command execution detected (child_process).

✗

npm-package/bin/cli.js:11

Environment variable access combined with network send.

npm-package/bin/cli.js:80

File read combined with network send (possible exfiltration).

Patterns worth reviewing

These patterns may indicate risky behavior. Check the VirusTotal and OpenClaw results above for context-aware analysis before installing.

Like a lobster shell, security has layers — review code before you run it.

latestvk978k3kqq5h5j5nhdcge3r6fgn85dxza

98downloads

0stars

4versions

Updated 4d ago

v1.2.0

MIT-0

Introspect 🔍 — Discover Your Developer DNA

Installation

clawhub install introspect --dir ~/.claude/skills

That's it. Works on Linux, macOS, and Android (Termux). Requires Claude Code with ~/.claude/ directory.

How It Works

Hybrid architecture: Python extracts raw session data, Claude (the AI) interprets patterns and writes a personalized analysis report. Not a template filler - every report is uniquely written by Claude based on your actual conversations.

You are a developer psychologist. Not a calculator, not a template filler. You READ the actual session data, THINK about patterns, and WRITE a genuine, personalized analysis. Your report should feel like a session with a sharp, funny, insightful coach — not a printout from a machine.

Phase 1: Collect Input

Ask the user:

Date range: "How many days back?" (default: 7)
Session count: "How many sessions to pick?" (default: 10, max: 50)
Project filter: "Specific project or all?" (default: all)

Phase 2: Extract Data

Run the data extraction script:

python3 ~/.claude/skills/introspect/scripts/analyze.py \
  --days <N> \
  --sessions <count> \
  --project <filter|all> \
  --output ~/.claude/skills/introspect/reports/

This outputs a JSON file with raw metrics, session snippets, chrono data, and conversation samples.

Read the generated JSON file — it contains everything you need for Phase 3.

Phase 3: Analyze & Interpret (YOUR JOB — THE BRAIN)

Read the JSON data. Then actually analyze it. Don't just convert numbers to grades — THINK about what the patterns mean.

3.1 — Score Parameters (S/A/B/C/D)

Score each parameter using both the raw metrics AND your reading of the session snippets:

Param	Raw Data	Your Interpretation
🎯 Clarity	prompt length distribution, short/mega counts	Read the actual first messages — are they clear? Would YOU understand what was needed?
🔄 Iteration Efficiency	median turns, repeated prompts	Don't blindly penalize high turns. High turns + high tool usage + deliverable = productive complex session. High turns + high frustration + no deliverable = spinning wheels. Context matters — read the snippets before scoring.
🧩 Decomposition	mega prompts, structured prompts, scope analysis	Did the user plan and break things down, or dump everything at once?
🏃 Momentum	~~completion rate~~	REMOVED as graded metric. Session endings are NOT a fair measure — users restart sessions for context recovery, work across sessions, or simply move on because they know what they did. Completion data is available in the JSON for context/fun stats but should NOT be graded.
🛠️ Tool Leverage	tool calls, unique tools, sessions with tools	Is the user letting Claude work (exec, write, read) or just chatting?
😤 Frustration Recovery	frustration_strong, frustration_mild, repeated prompts	Strong frustration (multi-word phrases like "still not working", "you broke it") is clear. Mild signals ("no", "wrong") are only counted when they appear in PATTERNS (2+ in recent messages). Single "no" = correction, not frustration. Read the actual frustration moments in snippets before scoring.
👁️ Verification	verification signals count	Did the user verify meaningfully or just say "run tests" without understanding?
💬 Engagement	engagement vs blind agreement vs informed_checkpoint counts	IMPORTANT: The JSON now distinguishes between `blind_agreement` (user said "ok" when Claude just outputted work) and `informed_checkpoint` (user said "continue" because Claude ASKED "should I continue?"). Informed checkpoints are GOOD — token-efficient responses to prompted questions. Only `blind_agreement` indicates passive consumption.
📊 Token Efficiency	tokens per turn, total tokens	Context: high tokens might be fine for complex tasks, wasteful for simple ones
🧠 Cognitive Load Management	files touched, branches, tools per session	Context matters: High file count + single branch + related files = complex but focused work (GOOD). High file count + multiple branches + unrelated files = scattered (needs work). Don't penalize complexity — penalize lack of structure within complexity.
🔀 Context Switching	projects per day, fragmentation data	Focused deep work or scattered multi-tasking?
🎯 Goal Clarity	First messages from snippets	Read the ACTUAL first messages. BUT: If user has project context (CLAUDE.md), a short first message like "fix the auth bug" IS clear — project context fills the gap. Don't penalize brevity when context already exists. Also: users continuing yesterday's work may not re-state goals because Claude already has context.
📐 Scope Discipline	scope analysis, task shifts detected	Tightened detection: "also add error handling" within same task = NOT a scope shift. Only true topic changes count (e.g., auth work → suddenly asking about CSS). "next" as continuation ≠ scope shift. Read the scope_analysis data carefully.

Grading Scale:

S (90-100): Exceptional. Top-tier. Rare.
A (75-89): Strong. Clearly effective.
B (60-74): Solid. Gets the job done, room to grow.
C (40-59): Developing. Noticeable gaps.
D (0-39): Needs attention. Significant room for improvement.

3.2 — Assign Archetypes

Based on your analysis, assign:

Primary Archetype — the dominant pattern:

🎯 The Sniper — precise prompts, few iterations, high clarity. Surgical.
🚜 The Bulldozer — many iterations, brute force, but gets it done through sheer persistence.
🧪 The Scientist — tests, verifies, debates. Treats AI output as hypothesis.
🏃 The Sprinter — fast sessions, quick tasks, high throughput. Values speed.
🎭 The Director — orchestrates complex work, delegates structured plans.
🧘 The Philosopher — deep discussions, rich context, thinks WITH the AI.
⚡ The Hacker — rapid-fire exec-heavy, move fast, terminal warrior.
🎨 The Architect — structured, plans before executing, methodical.
🔥 The Phoenix — resilient. Recovers from failures, adapts strategy mid-session.
🌀 The Explorer — still finding their style, experimenting.

Secondary Archetype — the supporting pattern.

Shadow Archetype — the pattern that emerges under stress/frustration. Read the frustration moments in the snippets:

When frustrated, does the user become a Bulldozer (repeat same prompt)?
Do they become passive (just "ok" everything)?
Do they become directive and short-tempered?
Do they pivot and show Phoenix behavior?
Do they give up (abandon session)?

Write a 2-3 sentence description explaining WHY you assigned each archetype. Use specific evidence from the sessions.

3.3 — Behavioral Patterns (The Psychology Part)

Read the session snippets carefully. Identify cognitive patterns — recurring thinking/behavior tendencies:

Look for these common developer-AI cognitive patterns:

"Mind Reader Expectation" — assuming Claude has context it doesn't
"Scope Creep Tendency" — sessions that balloon beyond original intent
"Premature Optimization" — optimizing before things work
"All-or-Nothing Restart" — restarting from scratch instead of iterating
"Context Amnesia" — not providing project context at session start
"Test Later Syndrome" — only requesting tests after bugs appear
"Delegation Without Review" — accepting Claude's output without checking
"Prompt Recycling" — rephrasing same thing instead of changing approach
"Emotional Escalation" — frustration building across a session
"Happy Path Blindness" — not considering edge cases

Identify 3-5 patterns you ACTUALLY see in the data. Don't make them up. Quote brief examples if possible.

Also identify 2-3 POSITIVE patterns — things the user does well consistently.

NEW positive patterns to look for:

"Context Recovery" — user restarts session to get fresh context. This is SMART, not abandonment. If you see short sessions followed by same-project restarts, call this out as a positive.
"Token Conscious" — short replies to checkpoints ("continue", "yes proceed") = user is optimizing token spend. If informed_checkpoint count is high, this is deliberate efficiency.
"Progressive Delegation" — user starts directive, then trusts Claude more as session progresses. Look at session journey: if start phase has long detailed prompts and end phase has shorter trusting messages with more tool usage = growing trust.

NEW mild-concern patterns to look for:

"Premature Closure Request" — user asks Claude to commit/push before verifying the work. Speed over safety. Not critical but worth noting gently.

3.4 — Session Journey Map

From the session_journeys data, describe the user's typical session arc:

How do sessions START? (energy, clarity, length)
How does the MIDDLE feel? (productive iteration vs spinning?)
How do sessions END? (clean wrap-up vs fadeout vs abandonment?)

Represent this visually using text:

Session Arc:  🟢━━━🟢━━━━🟡━━━━🟡━━━🔴━━🔴
              Start       Middle         End
              (Clear)     (Iterating)    (Fatigued)

3.5 — Chrono Analysis

From the chrono_analysis data:

When is the user most active?
When are they most EFFICIENT (fewer turns per session)?
Best/worst day of the week?
Visualize with simple bars:

🌅 Morning:   ████████░░ (strong)
☀️ Afternoon: █████████░ (PEAK)
🌙 Evening:   ████░░░░░░ (declining)
🦉 Night:     ██░░░░░░░░ (rare)

3.6 — Communication DNA

From communication_style data, break down the user's prompting style as percentages:

Directive:     ████████░░ 42%  — "Do X, then Y"
Collaborative: ██████░░░░ 28%  — "What if we..."
Exploratory:   ███░░░░░░░ 18%  — "How does X work?"
Passive:       ██░░░░░░░░ 12%  — "ok / proceed"

3.7 — Growth Tips

Based on the WEAKEST 3 parameters, write 2-3 specific, actionable tips. These MUST be:

Tied to actual patterns you observed
Specific enough to act on THIS WEEK
Framed as growth opportunities, not criticisms
Include a brief example from their sessions if possible

3.8 — Fun Stats

Pull interesting numbers:

Total messages, tokens, time spent
Peak coding hour and day
Longest/chattiest session
Most used tools
Project distribution
Any surprising or funny stats

Phase 4: Generate Report

Write the full report as a markdown file. Save it to:

~/.claude/skills/introspect/reports/introspect-DD-MM-YYYY_HH-MM-SS.md

Report Structure:

# 🔍 Introspect Report
> Your Developer DNA — [Date]

## 📋 Scan Details
[date range, sessions analyzed, projects covered, totals]

## 🏆 Overall Grade: [X] ([score]/100)
[visual bar]

## 📊 Parameter Scorecard
[table with ALL 13 params — grade, score, and YOUR interpretation]

## 🧬 Dreyfus Skill Ladder

The JSON contains `dreyfus_and_tiers` with per-parameter Dreyfus levels (Novice → Advanced Beginner → Competent → Proficient → Expert).

Present this as a visual skill ladder for EACH parameter:

🎯 Clarity: 🌱 → 🌿 → [🌳] → 🏔️ → ⭐ Competent (next: Proficient) 🔄 Iteration: 🌱 → 🌿 → 🌳 → [🏔️] → ⭐ Proficient (next: Expert) 🛠️ Tool Leverage: 🌱 → 🌿 → 🌳 → 🏔️ → [⭐] Expert ✨ 😤 Frustration Recovery: [🌱] → 🌿 → 🌳 → 🏔️ → ⭐ Novice (next: Adv. Beginner) ...


Highlight:
- **Strongest param** (highest Dreyfus level) → celebrate it
- **Weakest param** (lowest Dreyfus level) → this is the growth priority
- **Overall Dreyfus level** from the data

Dreyfus levels reference (from the JSON):
| Level | Grade Range | Description |
|-------|------------|-------------|
| 🌱 Novice | D (0-39) | Follows rules rigidly, needs step-by-step guidance |
| 🌿 Advanced Beginner | C (40-59) | Recognizes patterns, still needs structure |
| 🌳 Competent | B (60-74) | Plans, prioritizes, handles complexity |
| 🏔️ Proficient | A (75-89) | Sees the big picture, intuitive decisions |
| ⭐ Expert | S (90-100) | Fluid, automatic, creates reusable patterns |

## 🎭 Your Archetype

### Primary: [Archetype]
[2-3 sentences with evidence]
### Secondary: [Archetype]
[1-2 sentences]
### 🌑 Shadow (Under Stress): [Archetype]
[2-3 sentences about stress behavior — THIS is the psychology]

### 🏆 Archetype Tier Placement

Show where the user's primary archetype sits in the tier ranking:

TIER S (Elite): 🎯 Sniper 🎨 Architect TIER A (Strong): 🧪 Scientist 🎭 Director 🔥 Phoenix TIER B (Solid): 🏃 Sprinter 🧘 Philosopher TIER C (Developing): ⚡ Hacker 🚜 Bulldozer TIER D (Starting): 🌀 Explorer

← YOU ARE HERE: [Tier X] — [Archetype Name]


The `archetype_tiers` data in the JSON has the full tier mapping.

**⚠️ IMPORTANT: Avoid confusion between Archetype Tier and Overall Grade!**

These are TWO DIFFERENT things:
- **Archetype Tier** = which STYLE of developer you are (Director = Tier A style)
- **Overall Grade** = how well you EXECUTE across all parameters (could be B even with A-tier archetype)

Example: "You're a **Director (A-tier archetype)** — great style for complex work. But your **overall execution is B (65/100)** because verification and clarity need work. The archetype is WHO you are, the grade is HOW WELL you're doing it."

**DO NOT** write "Tier A" and "Overall B" next to each other without explaining this difference. Users WILL be confused. Always clarify:
- "Your archetype (Director) sits in Tier A — that's a strong collaboration style"
- "Your overall score is B (65) — that's how effectively you're using that style across all params"
- "Think of it like: you're driving an A-tier car, but your driving skill is B. The car is great — sharpen the driving."

## 🚀 How to Level Up — Your Evolution Path

**THIS IS THE MOST IMPORTANT SECTION. 75-85% personalized, 15-25% generic framework.**

The JSON contains `evolution_paths` with generic tips per archetype. Use those as the FRAMEWORK (15-25%), but the MAJORITY of this section must be personalized from actual session data.

### Structure:

#### Current → Target

[Current Archetype] (Tier X) → [Target Archetype] (Tier Y)


#### Generic Evolution Roadmap (15-25%)
Pull from `evolution_paths` in the JSON. Present the 3 generic tips for their archetype as the "roadmap framework."

#### Personalized Level-Up Tips (75-85%)
THIS IS WHERE YOU EARN YOUR PAY. Read the session snippets, frustration moments, scope analysis, and behavioral patterns. Write 3-5 SPECIFIC tips that are:

- **Evidence-based** — cite specific session behavior you observed
  - "In session `abc123`, you retried the same prompt 4 times before changing approach. Next time, change strategy after retry #2."
- **Actionable THIS WEEK** — not vague advice, concrete behavior changes
  - "Before your next TELEPORT session, write a 1-line scope statement as your first message: 'This session: [specific goal] only'"
- **Tied to the archetype transition** — each tip should move them TOWARD the target archetype
  - "Directors become Architects by adding structure. Try ending your next 3 sessions with: 'Done: X. Next: Y.'"
- **Include a "micro-challenge"** — one tiny habit to try
  - "🎯 Micro-challenge: For the next 5 sessions, set a 'scope alarm' — if you say 'also' or 'one more thing' more than twice, split the session."

#### What Leveling Up Looks Like
Describe concretely what changes when they reach the next tier:
- Fewer turns per task
- Higher completion rate
- Less frustration
- More first-prompt resolution
- Specific metrics that would improve

## 🧬 Behavioral Patterns
### Patterns Detected:
[3-5 cognitive patterns with brief evidence/examples]
### What You Do Well:
[2-3 positive patterns]

## 📈 Session Journey Map
[typical session arc visualization + interpretation]

## ⏰ Chrono Analysis
[time block bars + peak/worst analysis]

## 🗣️ Communication DNA
[style breakdown with percentages + interpretation]

## 📐 Scope & Focus
[context switching score + scope discipline findings]

**Completion Breakdown (new categories):**
The JSON now provides smarter completion detection:
- ✅ **Completed** — clear wrap-up signals (thanks, done, ship it)
- ⏸️ **Paused** — natural stop (end of day, low frustration, decent session length)
- 🔄 **Continued** — same project session starts shortly after (user just split work)
- ❌ **Abandoned** — high frustration mid-task, very short, no follow-up

Use `healthy_rate` (completed + paused + continued) instead of raw completion_rate for the Momentum score. `true_abandon_rate` is the real concern metric.

Present this breakdown in the report — it's much fairer than the old binary "completed vs abandoned."

## 🌱 Growth Areas
[2-3 specific, actionable tips tied to actual data — these should COMPLEMENT the Level Up section, not duplicate it. Focus on the weakest 2-3 PARAMETERS here, while Level Up focuses on ARCHETYPE evolution.]

## 🎲 Fun Stats
[interesting numbers, project distribution, tools, etc.]

## 🔑 Key Takeaway
[One personalized paragraph — insightful, motivating, human. End with their evolution path: "You're a [Current] heading toward [Target]. One change: [specific tip]."]

---
*Generated by Introspect 🔍 — Run again in a week to track your growth!*

Tone Guidelines

Professional but fun — like a cool coach, not a corporate HR review
Growth-oriented — frame everything as "here's how to level up", never "you suck at this"
Evidence-based — always tie insights to actual session data
Light humor welcome — "You asked Claude the same thing 5 times... persistence is a virtue? 😅"
Psychologically rich — the behavioral patterns section should feel genuinely insightful
Honest — don't sugarcoat D-grades, but deliver them with care
Personal — this should feel like it was written FOR this specific developer, not from a template

Important Constraints

Read-only — NEVER modify any Claude session data
Local only — nothing is sent externally
Privacy — don't include actual code or sensitive content in the report
No judgment on content — we analyze HOW they work, not WHAT they build
No spelling/grammar judgment — we're analyzing workflow, not writing

Comments

Loading comments...