Execution Harness

v1.1.0

Agent 执行可靠性体系的导航入口。当用户询问 agent 为什么停了、怎么防 agent 提前退出、execution harness 是什么、40 个 pattern 有哪些、6 轴架构、hook 怎么配时匹配。不执行具体操作——各子 skill 分别处理。

0· 132·0 current·0 all-time
by_silhouette@lanyasheng

Install

OpenClaw Prompt Flow

Install with OpenClaw

Best for remote or guided setup. Copy the exact prompt, then paste it into OpenClaw for lanyasheng/execution-harness.

Previewing Install & Setup.
Prompt PreviewInstall & Setup
Install the skill "Execution Harness" (lanyasheng/execution-harness) from ClawHub.
Skill page: https://clawhub.ai/lanyasheng/execution-harness
Keep the work scoped to this skill only.
After install, inspect the skill metadata and help me finish setup.
Use only the metadata you can verify from ClawHub; do not invent missing requirements.
Ask before making any broader environment changes.

Command Line

CLI Commands

Use the direct CLI path if you want to install manually and keep every step visible.

OpenClaw CLI

Bare skill slug

openclaw skills install execution-harness

ClawHub CLI

Package manager switcher

npx clawhub@latest install execution-harness
Security Scan
Capability signals
Requires OAuth token
These labels describe what authority the skill may exercise. They are separate from suspicious or malicious moderation verdicts.
VirusTotalVirusTotal
Benign
View report →
OpenClawOpenClaw
Benign
medium confidence
Purpose & Capability
The declared purpose is a 'navigation entry' for execution reliability patterns and explicitly says it 'does not execute operations' itself. The package contains many bash hook scripts and tests and the SKILL.md gives example settings that call those scripts (e.g., bash execution-loop/scripts/ralph-stop-hook.sh). This is explainable (the hub points users to sub-skills and shipped scripts), but there's a mild mismatch between the 'not execute' wording and the fact that the repo includes runnable scripts and example configs that will execute them if the user wires them into ~/.claude/settings.json. Review the scripts before enabling.
Instruction Scope
Runtime instructions and examples direct agents (or operator's config) to run local bash scripts and to read/write session/transcript files under the user's home (e.g., ~/.openclaw/shared-context/sessions, .working-state, ~/.claude/settings.json). The scripts operate on local filesystem artifacts, tmux panes, run linters/tests, and manipulate state (git stash, write handoff files). They do not reference external endpoints or ask for secrets in the metadata, but they do have broad local-file access — expected for this purpose, but something to audit before enabling.
Install Mechanism
No install spec is provided (instruction-only metadata), which minimizes automatic remote code installation. The bundle includes scripts and tests but no automated download-from-untrusted-URL step. The user would need to place/clone these files and add hooks manually or via their agent config; that manual step is the security control point.
Credentials
The skill declares no required environment variables, credentials, or config paths. The included scripts expect runtime-provided variables (e.g., TRANSCRIPT_PATH, NC_SESSION, COMPACTION_EXTRACT_INTERVAL) typical for hooks; they do not request API keys or tokens. No unrelated service credentials are demanded.
Persistence & Privilege
The skill is not always-enabled and does not request elevated platform privileges, but the SKILL.md explicitly recommends editing ~/.claude/settings.json to add hooks that will run the shipped scripts. Enabling those hooks gives the scripts persistent operational presence (they'll run on hook events) and lets them write into user home directories (sessions, handoffs, working-state). This is expected for a hooks-based harness, but it increases the blast radius — inspect scripts and back up configs before enabling.
Assessment
This bundle is a coherent 'execution harness' and primarily operates on local files and hooks. Before installing or enabling the hooks, do the following: 1) Inspect the included bash scripts (scripts/*) to ensure they do only what you expect — pay attention to any 'git', 'rm', or tmux commands and to where files are written (HOME paths). 2) Backup your ~/.claude/settings.json and any repositories the harness might touch. 3) Run the scripts in a sandbox or container (or under a test HOME) to observe behavior. 4) Start by enabling only non-destructive hooks (e.g., read-only token-budget or context-usage) and review outputs before enabling hooks that run git stash, checkpoint/rollback, or send input to tmux. 5) If you run the test suite (pytest) do so in an isolated environment. 6) Only enable hooks if you trust the source or have audited the code; although there are no external endpoints or requested credentials in the metadata, scripts with filesystem and tmux access can still disrupt local workflows if misconfigured.

Like a lobster shell, security has layers — review code before you run it.

latestvk970cf1jwma296pd7jb8canmxx84h7bd
132downloads
0stars
4versions
Updated 2w ago
v1.1.0
MIT-0

Execution Harness — 导航入口

40 patterns x 6 轴,让 Claude Code agent 把活干完。不是框架,不做模型调用——只管 hook 和脚本。

这个 skill 不执行操作。它帮你找到该用哪个子 skill。

你遇到什么问题?

问题用哪个 skill关键 pattern
Agent 做了一半就停了execution-loop1.1 Ralph Stop Hook — 阻止提前退出,5 个安全阀
"应该可以"但没跑测试execution-loop1.2 Doubt Gate — 检测投机语言,要求验证
cargo build 重试 12 次tool-governance2.1 Tool Error Escalation — 3 次提示、5 次 block
rm -rf 毁了未提交代码tool-governance2.3 Checkpoint + Rollback — 自动 git stash
压缩后忘了设计决策context-memory3.1 Handoff Documents — 决策写磁盘
Context 快满了还在读大文件context-memory3.4 Token Budget — 80%+ 禁止直读
限速后 tmux 挂死error-recovery5.1 Rate Limit Recovery — cron 扫描恢复
5 个 agent 编辑同一文件multi-agent4.3 File Claim and Lock — 10min TTL 排他锁
提交了编译不过的代码quality-verification6.4 Test-Before-Commit — commit 前跑测试

6 轴速查

SkillPattern 数核心能力
1execution-loop7Ralph Stop Hook、Doubt Gate、Drift Re-anchoring
2tool-governance6错误升级、权限否决、破坏性命令拦截
3context-memory8Handoff 文档、Compaction 抢救、Token Budget
4multi-agent6Coordinator/Fork/Swarm、文件锁、盲审分离
5error-recovery7限速恢复、Crash 恢复、模型降级建议
6quality-verification6编辑后 lint、commit 前测试、session 指标

最小配置

3 个 hook 解决最常见的 3 个问题:

// ~/.claude/settings.json
{
  "hooks": {
    "Stop": [{"hooks": [
      {"type": "command", "command": "bash execution-loop/scripts/ralph-stop-hook.sh"},
      {"type": "command", "command": "bash execution-loop/scripts/doubt-gate.sh"}
    ]}],
    "PreToolUse": [{"hooks": [
      {"type": "command", "command": "bash tool-governance/scripts/tool-error-advisor.sh"}
    ]}],
    "PostToolUse": [{"matcher": {"tool_name": "Write|Edit|MultiEdit"}, "hooks": [
      {"type": "command", "command": "bash quality-verification/scripts/post-edit-check.sh"}
    ]}]
  }
}

仓库

GitHub: lanyasheng/execution-harness | 90 tests | 依赖:bash、jq、python3、pytest

Comments

Loading comments...