Install
openclaw skills install @jimmylegendary/taskopsManage AI-agent work as an execution graph instead of a flat TODO list. Use TaskOps to structure objectives, task decomposition, run readiness, execution logs, exploration, delegation/waiting, EoW closure, validation, summaries, and runner-driven progress.
openclaw skills install @jimmylegendary/taskopsTaskOps is a work-truth protocol, not just a task manager. It exists so that AI agents can be trusted with hours/days/weeks of work without pretending tasks are done, silently stopping, asking "what next?", or executing a wrong plan. Plans lie, logs drift, and TODO lists make agent work look simpler than it is; TaskOps separates task decomposition from execution reality and forces explicit, file-backed closure.
Use it when the user needs to know what should happen, what actually happened, what is blocked or delegated, and whether work is truly closed.
TaskOps v1 is md-first.
Canonical state lives in markdown files arranged around:
task-groups/snapshots/runs/<run-id>/derived/Do not treat .taskops/queue.sqlite, graph.json, or generated canvases as durable semantic truth. SQLite is an execution projection/ledger, not the task graph source of truth.
references/core-model.mdreferences/md-first-format.mdreferences/decomposition-protocol.mdreferences/run-readiness.md../examples/taskops-canonical-minimal-v1/entityType: work; legacy project can still be read)runs/<run-id>/ and may reference external runs/tasks without being mergedrunRefs plus run-node sourceTaskId / sourceTaskGroupVersionIdtype: delegate / status: waiting with delegatee, request, expected output, and optional timeout metadatapending | active | done | blocked | waiting | cancelledrunnable | needs_decomposition | needs_exploration | blockedneeds_exploration when the objective is meaningful but the system does not yet know enough to decompose honestly; exploratory runs may search, try, debug, prototype, and reflect to learn constraints for the next graph updateUse the npm CLI first:
taskops validate <path>
taskops summary <path>
taskops show <path> --json
taskops classify-runnable <work-dir> <task-id> --json
taskops next <work-dir> --json
taskops explain <work-dir> --json
taskops close <work-dir> <run-node-id|task-id> [--reason <reason>] [--json]
taskops init <dir> --id <id> --title <title> --objective <objective>
taskops vault-init <vault-dir> --repo-url <url> --branch <branch> --auto-sync true
taskops git-status <vault-dir>
taskops git-sync <vault-dir> --message <message>
taskops watch-sync <vault-dir> --debounce-ms 5000
taskops decompose <work-dir> --task-group-id <id> --spec <spec.json>
taskops refactor <work-dir> --task-group-id <id> --spec <spec.json> --supersedes <version-id>
taskops run <work-dir> [--run-id <id>] [--agent <agent-id>] [--executor dry-run|openclaw-agent] [--max-steps <n>] [--until <iso-timestamp>] [--timeout <seconds>] [--loopback none|self] [--max-loopbacks <n>] [--json]
taskops queue sync <work-dir> [--json]
taskops queue list <work-dir> [--json]
taskops queue claim <work-dir> [--runner-id <id>] [--ttl-seconds <n>] [--max-attempts <n>] [--json]
taskops queue heartbeat <work-dir> <lease-id> [--ttl-seconds <n>] [--json]
taskops queue release <work-dir> <lease-id> [--status done|failed|cancelled] [--json]
taskops queue reports <work-dir> [--json]
taskops runner once <work-dir> [--runtime dry-run|openclaw-cli] [--runner-id <id>] [--ttl-seconds <n>] [--max-attempts <n>] [--timeout <seconds>] [--report-sink none|ledger|openclaw-chat-inject] [--master-session-key <key>] [--json]
taskops runner watch <work-dir> [--runtime dry-run|openclaw-cli] [--runner-id <id>] [--ttl-seconds <n>] [--max-attempts <n>] [--timeout <seconds>] [--report-sink none|ledger|openclaw-chat-inject] [--master-session-key <key>] [--poll-interval-ms <n>] [--max-waves <n>] [--max-idle-cycles <n>] [--idle-exit-after-seconds <n>] [--until <iso-timestamp>] [--continue-on-failure] [--json]
taskops daemon run <work-dir> [--name <name>] [--runtime dry-run|openclaw-cli] [--runner-id <id>] [--ttl-seconds <n>] [--max-attempts <n>] [--timeout <seconds>] [--report-sink none|ledger|openclaw-chat-inject] [--master-session-key <key>] [--poll-interval-ms <n>] [--daemon-poll-interval-ms <n>] [--failure-backoff-ms <n>] [--max-daemon-cycles <n>] [--continue-on-failure] [--json]
taskops daemon unit <work-dir> [--name <name>] [--runtime dry-run|openclaw-cli] [--json]
taskops daemon install <work-dir> [--name <name>] [--runtime dry-run|openclaw-cli] [--start] [--dry-run] [--json]
taskops daemon start|stop|restart|status|logs|uninstall <name> [--json]
taskops restart <work-dir> --from <task-id> [--instruction <text>] [--instruction-file <path>] [--reason <text>] [--json]
These three commands are the small surface area that keeps long-running agents honest. They never silently mutate progress:
taskops next <work-dir> --json — returns the one next honest action: execute, decompose, explore, wait, delegation_pending, blocked, done, or no_runnable. Use it instead of guessing what to do next.taskops explain <work-dir> --json — explains why work is or is not closed: closure summary, next honest action, and concrete open reasons (missing EoW, blockers, waiting delegations, runnable/decompose/explore tasks, validation errors).taskops close <work-dir> <run-node-id|task-id> [--reason <reason>] [--json] — make EoW closure explicit and guarded. It refuses to close a task that already has an EoW, has open child branches, or is not yet done unless --reason manual_verified is supplied. It refuses to close a run node unless its status is done/cancelled or an explicit reason (failure, superseded, cancelled, manual_verified) is supplied. Use this rather than editing EoW files by hand.taskops run <work-dir> is the canonical way to advance a TaskOps work graph. The skill is passive guidance; the runner is the layer that actually mutates state.
taskops run <work-dir> instead of editing run nodes / EoW / runRefs / child task groups by hand. The runner deterministically picks the next task (active snapshot order, then task.order, then id), classifies it, and dispatches the matching action.runnable — creates the run node, executes via the executor, marks the task done, writes the task and run EoW nodes, and creates the closes_with edge.needs_decomposition — creates a type: decomposition run node, expands the task graph with a child task group and a v1 version (dry-run synthesizes a deterministic placeholder; openclaw-agent delegates authoring to the agent and verifies the result), sets the parent task's childTaskGroupId, closes the parent task with EoW reason decomposed_by_runner, and extends the active snapshot's selectedVersions so the new child task group/version becomes visible to later steps of the same runner invocation.needs_exploration — creates a type: exploration run node, writes a reflection artifact at runs/<run-id>/artifacts/<run-node-id>.md, then marks the parent done with EoW reason exploration_recorded_by_runner and sets its runReadiness to needs_decomposition so the next pass can author informed children.blocked tasks are excluded from execution. If only blocked tasks remain the runner stops with blocked_only.blockedBy references. If every referenced task/run node blocker is done or cancelled, it reopens the task (status: pending) and clears runReadiness: blocked unless unblockRunReadiness is set. Use taskops unblock-check <work-dir> --dry-run --json to inspect this without mutation.status: waiting tasks and non-delegate run nodes, and type: delegate run nodes that are not yet done/cancelled, pause the runner with stop reason waiting or delegation_pending. Delegate type wins over generic waiting, so type: delegate + status: waiting reports delegation_pending. Surface the pause to the user; do not auto-skip.--executor openclaw-agent --agent <agent-id> for real execution, decomposition, and exploration. Default --agent is main. Only use --executor dry-run for smoke tests, reviews, or to demonstrate the graph mutations without touching an external agent — it produces synthetic success and never performs real work. The synthetic decomposition placeholders are explicitly runReadiness: blocked so they cannot be mistaken for real progress.--max-steps <n> bounds the total number of actions (execute + decompose + explore). --until <iso-timestamp> bounds wall-clock work. Both are optional and combine with OR semantics: stop before a new step if either limit is reached.--max-steps nor --until is supplied, the runner defaults to --max-steps 1 — exactly one step, then stop.--until. Do not pass natural-language deadlines.all_closed, no_runnable, blocked_only, waiting, delegation_pending, max_steps, deadline_reached, max_loopbacks, task_failed, validation_failed. all_closed means the selected work is fully closed by task + run EoW with no waiting/delegated/blocked work; no_runnable means nothing actionable but the work is not yet closed. Always surface the reason to the user.runs/<run-id>/events.jsonl and runs/<run-id>/run-log.md, and holds a .taskops-runner.lock directory inside the work root while running. Do not launch a second runner against the same work until the lock is gone.taskops run again — it runs one task. Recursion is the orchestrator's job, not the worker's.--loopback none (default) keeps the cautious behaviour: every pending type: delegate stops the runner. Pass --loopback self to let the runner auto-resolve self-delegates (delegateeType: self, delegateeRef: self, or delegateeRef: <work-id>) by opening a type: loopback resolution node, executing the loopback once, writing a loopback edge, and closing both the loopback and the original delegate (reason: self_loopback_resolved, resolvedBy: self_loopback). Non-self delegates are still surfaced as delegation_pending. Each loopback counts against --max-steps and a separate --max-loopbacks budget (default 3); exceeding it stops with max_loopbacks and leaves the delegate open. The executing agent inside a loopback must still not call taskops run recursively — orchestration stays at the runner.taskops restart <work-dir> --from <task-id> --instruction "<text>" [--reason <text>] [--instruction-file <path>] [--json] rolls the active version of the containing task group forward to a new version, marks the prior version selected: false and supersededByVersionId, points the active snapshot at the new version, and updates the task group's activeVersionId. Upstream tasks (order < target.order) keep their status and gain preservedUpstream: true with a fresh preserved_upstream_after_restart EoW when they were done leaves. The target task is reset to pending with restartInstruction, optional restartReason, restartedFromVersionId, and restartedAt. Downstream tasks (order >= target.order, excluding the target) are reset to pending. Historical runs/run nodes/EoWs are not modified — they remain as evidence. Use this instead of editing tasks by hand when an upstream change invalidates a task and its downstream.taskops queue sync <work-dir> creates or refreshes .taskops/queue.sqlite from the canonical markdown state. The database is rebuildable projection state for queue items, leases, attempts, and progress reports. Deleting it and syncing again must not destroy semantic truth.
taskops runner once <work-dir> claims one executable queue item, runs exactly that claimed task through a runtime adapter, releases the lease, refreshes the queue projection, and optionally writes a progress report ledger row.
taskops runner watch <work-dir> is the local always-on primitive. It loops over runner once, waits when no queue item is currently claimable, and exits with all_closed when TaskOps closure says the work is complete. Bounds such as --max-waves, --max-idle-cycles, --idle-exit-after-seconds, and --until are for tests, controlled sessions, and supervised deployments. Watch mode defaults to --max-attempts 3; pass --max-attempts 0 only when an external supervisor owns retry safety.
Important boundary:
taskops daemon install <work-dir> --name <name> --start is the preferred unattended local mode. It writes a user-systemd service around taskops daemon run, not around runner watch, so normal all_closed watch exits do not become systemd restart loops.taskops daemon run repeatedly starts watch cycles, preserves stop reasons, sleeps between cycles, and is the foreground process that systemd supervises.--runtime openclaw-cli maps to same-host openclaw agent --json.--timeout <seconds> with --runtime openclaw-cli for unattended waves. Internal timeout finalizes the attempt as failed and releases the lease; external shell timeout should be a supervisor last resort, not the normal control path.--max-attempts <n> skips queue items whose current markdown fingerprint already has n failed runner attempts. Editing the task markdown changes the fingerprint and resets the retry budget.--continue-on-failure only with --max-attempts or a separate retry/attempt guard.--report-sink ledger records progress in .taskops/queue.sqlite.--report-sink openclaw-chat-inject delivers progress to --master-session-key with openclaw gateway call chat.inject and records delivery success/failure in the same SQLite report ledger.If the user is working in an Obsidian vault that should stay aligned with a GitHub repo, prefer:
taskops vault-init ... --repo-url ... --auto-sync true.taskops/taskops-sync.json in the vault roottaskops watch-sync/taskops git-sync so local vault edits are pushed back to GitHub instead of driftingpython3 scripts/graph_task.py ... still exists as a migration aid for the earlier graph-task prototype.
Only use it when the task is explicitly about legacy behavior or migration.
Run:
taskops validate <work-dir>
taskops summary <work-dir>
If you changed the skill itself, also run:
python3 /home/jimmy/.npm-global/lib/node_modules/openclaw/skills/skill-creator/scripts/package_skill.py <skill-dir> <output-dir>