Skill

v2.0.3

Self-Improvement System for AI agents. Features: feedback collection, hint extraction, rule learning. Use for: learning from user corrections.

⭐ 0· 135·0 current·0 all-time

byPeter Cheng@petercheng

OpenClaw Prompt Flow

Install with OpenClaw

Best for remote or guided setup. Copy the exact prompt, then paste it into OpenClaw for petercheng/opensourceclaw-claw-rl.

Previewing Install & Setup.

Prompt PreviewInstall & Setup

Install the skill "Skill" (petercheng/opensourceclaw-claw-rl) from ClawHub.
Skill page: https://clawhub.ai/petercheng/opensourceclaw-claw-rl
Keep the work scoped to this skill only.
After install, inspect the skill metadata and help me finish setup.
Required binaries: python3
Use only the metadata you can verify from ClawHub; do not invent missing requirements.
Ask before making any broader environment changes.

Command Line

CLI Commands

Use the direct CLI path if you want to install manually and keep every step visible.

OpenClaw CLI

Bare skill slug

openclaw skills install opensourceclaw-claw-rl

ClawHub CLI

Package manager switcher

npx clawhub@latest install opensourceclaw-claw-rl

Security Scan

VirusTotal

Benign

View report →

OpenClaw

Suspicious

medium confidence

✓

Purpose & Capability

Name, description, scripts, and docs consistently describe a self-improvement RL system that collects feedback, extracts hints, stores learned rules, and injects them into sessions. Required binary (python3) and default workspace paths align with this purpose.

ℹ

Instruction Scope

SKILL.md and scripts instruct the agent to install an external package (pip install claw-rl), run CLI helpers, start/stop a learning daemon, queue feedback, and auto-inject rules into sessions via OpenClaw config. The instructions do not directly read unrelated system files or require secrets, but autoInject/autoLearn behavior means the skill can alter agent behavior globally if enabled.

Install Mechanism

There is no curated install spec; the docs ask you to pip install 'claw-rl' from PyPI (or another index). Relying on an external package means arbitrary code will run that is not present in the skill bundle. That is a moderate-to-high risk unless the package source is audited or pinned to a known good release.

✓

Credentials

The skill requests no environment variables, credentials, or config paths beyond a workspace directory (~/.openclaw/workspace). The lack of secret requirements is appropriate for a local feedback-and-learning plugin.

ℹ

Persistence & Privilege

The skill does not set always:true and is user-invocable only, but the recommended OpenClaw config enables autoInject and autoLearn which give it ongoing influence over sessions (it may inject learned rules automatically). Starting a background learning daemon implies persistent background activity and writes to user workspace (SQLite DBs).

What to consider before installing

This skill appears to do what it says (collect feedback, learn rules, inject them), but most core logic is provided by an external Python package named 'claw-rl' — not by the scripts included here. Before installing or enabling autoInject/autoLearn you should: (1) verify the provenance of the 'claw-rl' package (check the GitHub repo, release tags, and PyPI owner), (2) audit the package for network calls, telemetry, or credential access, (3) avoid enabling autoInject in production until you trust the rules it will inject, (4) run the daemon and package in an isolated or sandboxed environment and back up ~/.openclaw/workspace, and (5) prefer a pinned package version or vendored, reviewed code if possible. If you cannot audit the external package, treat enabling this skill as higher risk.

Like a lobster shell, security has layers — review code before you run it.

Runtime requirements

🔄 Clawdis

Binspython3

latestvk9729tkk80w76xa9s16apw4vp1850b18

135downloads

0stars

6versions

Updated 1w ago

v2.0.3

MIT-0

claw-rl: Self-Improvement System for AI agents

Self-improvement system for AI agents with reinforcement learning and continuous learning.

Prerequisites

pip install claw-rl

Quick Start

Collect Feedback

python3 {baseDir}/scripts/collect_feedback.py "Great job!" --action file_created
python3 {baseDir}/scripts/collect_feedback.py "Wrong, use Chinese instead" --action response --negative

Get Learned Rules

python3 {baseDir}/scripts/get_rules.py --top-k 10
python3 {baseDir}/scripts/get_rules.py --context "user preference"

Check Learning Status

python3 {baseDir}/scripts/status.py

Start Learning Daemon

python3 {baseDir}/scripts/daemon.py start
python3 {baseDir}/scripts/daemon.py stop
python3 {baseDir}/scripts/daemon.py status

Core Components

Component	Purpose	Example
Binary RL Judge	Evaluate satisfaction from feedback	👍 → positive, 👎 → negative
OPD Extractor	Extract improvement hints	"Use Chinese" → rule hint
Learning Loop	Continuous background learning	Process feedback queue
MAB Strategy	Strategy selection via bandits	Thompson Sampling, ε-greedy

Learning Modes

Mode	Description	Use Case
`calibration`	Calibration learning	User satisfaction calibration
`strategy`	Strategy learning	Action selection optimization
`value`	Value preference learning	User preference learning
`context`	Context-aware learning	Situational rules

Configuration

OpenClaw config (openclaw.config.json):

{
  "plugins": {
    "slots": {
      "context-engine": "claw-rl"
    },
    "claw-rl": {
      "config": {
        "workspaceDir": "~/.openclaw/workspace",
        "autoInject": true,
        "autoLearn": true,
        "topK": 10
      }
    }
  }
}

Performance

Operation	Latency
Initialize	~2ms
Collect feedback	~0.03ms
Extract hint	~1ms
Get rules	~0.7ms
Process learning	~0.5ms