Install
openclaw skills install @tommot2/autoresearch-pilotGuide for setting up and running Karpathy's autoresearch — autonomous AI-driven LLM training experiments. Helps write program.md, interpret results, and optimize configs for different GPU sizes. Use when: (1) setting up autoresearch, (2) writing or improving program.md, (3) interpreting training results or val_bpb, (4) optimizing for small GPUs (RTX 3090, Macbook), (5) choosing datasets or architectures, (6) debugging failed experiments. Homepage: https://clawhub.ai/skills/autoresearch-pilot
openclaw skills install @tommot2/autoresearch-pilotInstall: clawhub install autoresearch-pilot
Your co-pilot for Karpathy's autoresearch — autonomous AI-driven LLM training experiments on a single GPU.
Detect from user's message language. Default: English.
Autoresearch lets an AI agent modify train.py, run 5-minute experiments, check if val_bpb improved, and iterate. This skill helps you set it up, write optimal program.md, and interpret results.
| File | Role | Modified by |
|---|---|---|
prepare.py | Data prep, tokenizer, utilities | Never (fixed) |
train.py | Model, optimizer, training loop | The AI agent |
program.md | Instructions for the AI agent | You (the human) |
Walk the user through these steps when they want to start:
uv package managergit clone https://github.com/karpathy/autoresearchuv sync inside the repouv run prepare.py (one-time, ~2 min)uv run train.py (should complete in ~5 min)When the user has a smaller GPU, suggest these prepare.py changes:
vocab_size to 4096 or 2048 (or 256 for byte-level)MAX_SEQ_LEN to 256DEPTH to 4 in train.pyWINDOW_PATTERN of "L" onlyTOTAL_BATCH_SIZE to 2**14When the user asks for help with program.md, help them define:
Example structure for program.md:
When the user shares experiment logs:
| Metric | Good | Bad |
|---|---|---|
| val_bpb decreasing | Model is learning | Check for bugs |
| val_bpb plateaued | May need architecture change | Normal for small models |
| Training loss << val loss | Overfitting | Increase regularization |
| NaN loss | Learning rate too high or instability | Lower LR, check gradients |
| User says | Action |
|---|---|
| "set up autoresearch" | Walk through setup steps |
| "help me write program.md" | Draft research instructions |
| "my val_bpb is X" | Evaluate and suggest next steps |
| "optimize for small GPU" | Suggest parameter changes |
| "what should I try next" | Analyze recent experiments, propose new direction |
Install the full suite:
clawhub install autoresearch-pilot setup-doctor context-brief model-pilot