buildwright

Autonomous development workflow with multi-agent Claw Architecture. Single-agent mode for simple features; multi-agent mode decomposes cross-domain work into...

MIT-0 · Free to use, modify, and redistribute. No attribution required.
1 · 378 · 0 current installs · 0 all-time installs
byRaunak Kathuria@raunakkathuria
MIT-0
Security Scan
VirusTotalVirusTotal
Benign
View report →
OpenClawOpenClaw
Benign
high confidence
Purpose & Capability
The skill is an autonomous development pipeline that commits, pushes, and opens PRs; requiring git, gh, and a GITHUB_TOKEN with repo scope matches that stated purpose. Optional scanners (semgrep, gitleaks, trufflehog) are reasonable for the declared security-scan steps.
Instruction Scope
SKILL.md explicitly instructs the agent to read the repository, .buildwright prompt files, generate specs, run tests/lints/security scans, and commit/push via gh. Reading the codebase and those workspace files is expected for feature implementation and review, but the agent will have broad read/write access to the repository it runs in — review which repository is used and what files live there before running.
Install Mechanism
Instruction-only skill with no install steps or external downloads; lowest install risk because it doesn't write or execute bundled code. It relies on existing system binaries (git, gh).
Credentials
Only one credential (GITHUB_TOKEN) is required and is the declared primary credential; this matches the need to create branches, push commits, and open PRs. The SKILL.md recommends using a fine-grained token scoped to a single repository, which is appropriate. BUILDWRIGHT_AUTO_APPROVE is non-secret and not required.
Persistence & Privilege
The skill is not marked always:true and does not request modifications to other skills or system-wide settings. It can run autonomously (normal for skills) and will create commits/PRs if allowed — this is expected behavior for a CI/agent that automates development actions.
Assessment
This skill appears coherent, but it will be able to read and modify whatever repository you run it in and use the provided GITHUB_TOKEN to push branches and create PRs. Before enabling it in a production repo: 1) prefer a fine‑grained GitHub token limited to a single repository and only the needed permissions (Contents read/write, Pull Requests read/write); 2) test the skill in a sandbox repository first with BUILDWRIGHT_AUTO_APPROVE unset so it stops for manual approval; 3) inspect any .buildwright/agents/ prompt files in your repo to confirm they contain only review prompts (no secrets or unexpected instructions); 4) store the token securely and rotate/revoke it if you stop using the skill; 5) monitor created branches and PRs until you’re comfortable with its behavior.

Like a lobster shell, security has layers — review code before you run it.

Current versionv0.0.9
Download zip
latestvk978d1h89arrjj714gngt2wfq9837z15

License

MIT-0
Free to use, modify, and redistribute. No attribution required.

Runtime requirements

Binsgit, gh
EnvGITHUB_TOKEN
Primary envGITHUB_TOKEN

SKILL.md

Buildwright

Spec-driven autonomous development. Humans approve intent; agents handle everything else.

What this skill does

When activated, Buildwright directs the agent to:

  1. Read your codebase and steering documents
  2. Write a one-page spec (docs/specs/[feature]/spec.md)
  3. Stop for human approval — unless BUILDWRIGHT_AUTO_APPROVE=true
  4. Implement the feature with TDD
  5. Run quality gates: typecheck, lint, test, build
  6. Run optional security scans (if semgrep / gitleaks / trufflehog are installed)
  7. Run a Staff Engineer prompt-based code review
  8. Commit, push, and open a PR via gh

Requirements

Credentials (required)

CredentialPurposeScopeHow to provide
GITHUB_TOKENPush commits and open PRs via ghrepo scope (read/write)export GITHUB_TOKEN=ghp_... or configure in OpenClaw config under skills.entries.buildwright.apiKey

The token must have repo scope to push branches and create pull requests. For minimal privilege, use a fine-grained personal access token scoped to a single repository with "Contents: Read and write" and "Pull requests: Read and write" permissions.

Alternatively, if you use SSH for git push, the GITHUB_TOKEN is still needed for gh pr create. You can use gh auth login to authenticate the GitHub CLI separately.

Binaries (required)

BinaryPurpose
gitCommits and pushes
ghOpens PRs via GitHub CLI

Optional tools

BinaryPurpose
semgrepSAST security scan
gitleaks / trufflehogSecrets detection

Agent Personas (prompt-based, no binaries)

Staff Engineer and Security Engineer are prompt-engineering personas — instructions loaded from .buildwright/agents/ files in the workspace. They are not external tools or binaries. The agent adopts these personas to review specs and code using defined criteria and confidence thresholds. These files contain only prompt instructions and review checklists — no secrets or credentials.

Configuration

BUILDWRIGHT_AUTO_APPROVE (optional, not a credential)

This is an optional boolean flag that controls whether the agent waits for human approval at the spec stage. It is not a secret and not declared in requires.env because it is not required to run the skill.

ValueBehavior
Not setInteractive (default) — stops and waits for "approved" before building
falseInteractive — same as default
trueAutonomous — commits spec to git (audit trail) and proceeds without waiting

Recommendation for first use: Leave BUILDWRIGHT_AUTO_APPROVE unset until you have reviewed a few specs and are comfortable with the workflow. Start with interactive mode in a sandbox repository to observe behavior before enabling autonomous commits and PRs.

Commands

/bw-new-feature <description>

Full pipeline for new features. Auto-detects greenfield vs existing projects.

/bw-new-feature "Add OAuth2 login"

Flow: Detect (greenfield or existing?) → Research → Spec → Staff Engineer validates → Human approves → TDD build → Verify → Security scan → Code review → PR

Artifacts produced:

  • docs/specs/[feature]/research.md — what the agent found in your codebase
  • docs/specs/[feature]/spec.md — implementation plan with approaches considered

/bw-claw <feature>

Multi-agent pipeline using the Claw Architecture. Architect decomposes the feature into domain-specific claw tasks (UI, API, DB), defines interface contracts, and coordinates execution.

/bw-claw "Add profile photo upload for team members"

Flow: Architect analyzes → Decomposes into claw tasks → Defines interface contract → Claws execute per domain (TDD) → Architect integrates → Buildwright quality gates → PR

Best for: Features that cross domain boundaries (e.g., need DB schema + API endpoint + UI component).

Artifacts produced:

  • docs/specs/[feature]/claw-plan.md — decomposition plan with interface contracts
  • docs/specs/[feature]/claw-[domain].md — per-claw execution report

/bw-quick <task>

Fast path for bug fixes and small tasks (<2 hrs). No spec, no approval step. Runs security scan and code review on the changed diff before committing.

Flow: Understand → Research → TDD → Verify → Security scan → Code review → Commit

/bw-quick "Fix the login timeout bug"

/bw-ship [message]

Quality pipeline for existing work: verify → security → review → PR.

/bw-ship "feat(auth): add OAuth2 support"

/bw-verify

Quick checks only: typecheck → lint → test → build.


/bw-analyse

Analyse an existing codebase and write structured docs to .buildwright/codebase/. Creates docs from scratch if missing; auto-refreshes existing docs when BUILDWRIGHT_AUTO_APPROVE=true (only asks in interactive mode). Creates tech.md from template if it doesn't exist, then populates it with the discovered stack and commands. Run this first on any brownfield project to give every subsequent session real context.

/bw-analyse

Produces: STACK.md, ARCHITECTURE.md, CONVENTIONS.md, CONCERNS.md under .buildwright/codebase/.


/bw-plan <question or task file>

Research a question or topic and produce a written deliverable — no implementation, no commits. Use when someone asks a question or needs an analysis, plan, or report before (or instead of) writing code.

/bw-plan "what are the performance risks in this Flutter app?"
/bw-plan "plan a migration from monolith to microservices"
/bw-plan tasks/flutter-perf-review.md

Flow: Understand question/task → Clarify if needed → Research (read code + run read-only tools) → Synthesize findings → Write deliverable → Summarize

Accepts two invocation styles:

  • Inline question — describe the question or topic directly; the agent infers scope and writes to docs/plans/<slug>/<date>/plan.md
  • Task file — a structured .md file with Inputs, Rules, Research Areas, and Outputs blocks; the agent parses and executes it exactly

Use /bw-plan when you want: a performance review, architecture decision record, migration plan, technology evaluation, static analysis report, or any "research this and give me a written output" task.

Use /bw-new-feature instead when you want the plan executed (research + spec + implement + ship).

Artifacts produced: whatever the task specifies; at minimum plan.md in the output directory.

Hard constraints: never modifies source files, never commits or creates PRs, every finding must cite evidence.


/bw-help

Show all available commands.


Failure Behavior

If any gate fails after retries, the agent commits completed work, pushes, and opens a PR with a structured failure report. It does not leave orphaned branches or silent failures.

Retry Policy

GateRetriesRationale
Verify (typecheck, lint, test, build)2xFixable by the agent
Security scanNoneRequires human judgment
Code reviewNoneArchitectural decisions need humans

Security Considerations

This skill performs autonomous code changes, commits, and pull requests. Understand what it does before enabling it on repositories with sensitive or production code.

What the skill reads: Your repository source code, .buildwright/agents/ persona files (prompt instructions only, no secrets), and .buildwright/steering/ context files.

What the skill writes: Spec files under docs/specs/, source code changes, git commits on feature branches, and pull requests via gh.

What the skill does NOT do: It does not modify .env files, access secrets stores, run destructive git operations (force push, reset), or merge PRs. All changes go to feature branches with PRs for human review.

Recommended setup for first use:

  1. Start with a fork or sandbox repository, not production code
  2. Leave BUILDWRIGHT_AUTO_APPROVE unset (interactive mode) to review specs before builds
  3. Use a fine-grained GitHub token scoped to a single repository with minimal permissions
  4. Rotate tokens regularly and revoke when no longer needed
  5. Review generated PRs before merging — the skill creates PRs, it does not merge them

More Information

Full documentation, source code, and setup instructions: https://github.com/raunakkathuria/buildwright

Files

1 total
Select a file
Select a file to preview.

Comments

Loading comments…