Install
openclaw skills install batch-file-processorParallel batch processing of large file sets using sub-agents (summarize, analyze, extract, transform). Use when performing the same operation across many files in a directory, such as generating file indexes/summaries, batch content analysis, bulk information extraction, or format conversion. Triggers: batch process, file index, directory summary, bulk analyze, summarize files. NOT for: single file operations (just read it directly), fewer than 5 files (manual is faster).
openclaw skills install batch-file-processorProcess large numbers of files in parallel using sub-agents, avoiding main agent context overflow.
find <directory> -type f -name "*.md" | sort
Split into batches of 2-4 files each (3 is optimal).
One sub-agent per batch. Task template:
Read the following files completely and generate a brief summary (under 50 words) for each.
1. /path/to/file1.md
2. /path/to/file2.md
3. /path/to/file3.md
Return ONLY a JSON array:
[{"file": "relative/path/file1.md", "summary": "..."},...]
Key parameters:
mode: "run" (one-shot task)runTimeoutSeconds: 120 (increase to 180 for large files)label: descriptive label, e.g. idx-project-batch1Sub-agents push results on completion. Use sessions_yield to wait and collect incrementally.
Once all results are in, the main agent compiles the final deliverable (index file, report, etc.).
| Mistake | Consequence |
|---|---|
head -20 to skim file headers | Poor summary quality, key information missed |
| One sub-agent processes entire directory | Context overflow, timeout failure |
| Main agent reads all files sequentially | Context window exhausted, later files unreadable |
| One sub-agent per large directory | Large directories timeout, small ones waste capacity |
70 files → 25 sub-agents (3 files each) → parallel execution → completed in 5 minutes → high accuracy summaries
Generate a brief summary (under 50 words) for each file.
Extract the following fields from each file: project name, budget, key contacts, risks.
Return JSON: [{"file": "...", "project": "...", "budget": "...", "contacts": [...], "risks": [...]}]
Classify each file by checking for these topics: security, compliance, migration.
Return JSON: [{"file": "...", "has_security": true/false, "has_compliance": true/false, "has_migration": true/false}]
Analyze each source file: count lines, list imports/dependencies, identify main functions.
Return JSON: [{"file": "...", "lines": N, "imports": [...], "main_functions": [...]}]