Smart Web Monitor (智能网页监控)

v1.0.0

Smart web monitor with AI-powered matching. Unlike keyword/regex monitors, this skill uses the agent's own LLM reasoning to evaluate whether a web page match...

⭐ 0· 121·0 current·0 all-time

bySkywalker326@skywalker-lili

OpenClaw Prompt Flow

Install with OpenClaw

Best for remote or guided setup. Copy the exact prompt, then paste it into OpenClaw for skywalker-lili/jclaw-smart-web-monitor.

Previewing Install & Setup.

Prompt PreviewInstall & Setup

Install the skill "Smart Web Monitor (智能网页监控)" (skywalker-lili/jclaw-smart-web-monitor) from ClawHub.
Skill page: https://clawhub.ai/skywalker-lili/jclaw-smart-web-monitor
Keep the work scoped to this skill only.
After install, inspect the skill metadata and help me finish setup.
Use only the metadata you can verify from ClawHub; do not invent missing requirements.
Ask before making any broader environment changes.

Command Line

CLI Commands

Use the direct CLI path if you want to install manually and keep every step visible.

OpenClaw CLI

Bare skill slug

openclaw skills install jclaw-smart-web-monitor

ClawHub CLI

Package manager switcher

npx clawhub@latest install jclaw-smart-web-monitor

Security Scan

VirusTotal

Benign

View report →

OpenClaw

Benign

high confidence

✓

Purpose & Capability

Name/description match the implementation: scripts fetch webpages, extract text, run simple local matchers and emit JSON for LLM evaluation. No unrelated credentials, binaries, or surprising privileges are requested.

ℹ

Instruction Scope

SKILL.md instructs the cron/agent to run fetch and have the agent perform LLM-based matching on extracted page text. This is expected, but it means the agent will process (and depending on platform settings, send) the full extracted page text to the model and/or to announce channels (example uses Discord announcements). That behaviour is functionally needed but increases data exposure risk — monitor contents may include sensitive information if you point it at internal or authenticated pages.

✓

Install Mechanism

No install spec; this is an instruction + script bundle only. All files are included in the skill package and nothing is downloaded or executed from external URLs during install.

✓

Credentials

The skill declares no required environment variables or credentials. The code does allow optional per-URL 'headers' in monitor configs — those could carry credentials if you add them, so review monitor JSON before enabling.

✓

Persistence & Privilege

always:false (no force-inclusion). The skill stores monitor JSON and report files under its own skill directory and does not modify other skills or global agent configs. Cron integration relies on platform cron features but that is expected.

Assessment

This skill appears to do what it says: fetch pages, extract text, and let the agent (LLM) decide matches. Before enabling or scheduling monitors: 1) Review each monitor JSON in the monitors/ directory — don't add internal-only or authenticated URLs unless you understand the exposure. 2) Check any 'headers' fields in monitor configs to ensure you are not embedding API keys, cookies, or Authorization headers that could be leaked. 3) Remember LLM matching runs in the agent context: extracted page text may be sent to your configured model provider or posted to notification channels (the SKILL.md examples use Discord announce); make sure those destinations and your model usage policy are acceptable. 4) If you need strict data isolation, avoid LLM match type and use local keyword/regex/css/jsonpath matching only. 5) Optionally run the scripts manually (fetch/run) to inspect outputs before adding cron jobs that announce results to users or channels.

Like a lobster shell, security has layers — review code before you run it.

latestvk971nt08m0cezxdakryz2ggp5h83w5g0

121downloads

0stars

1versions

Updated 4w ago

v1.0.0

MIT-0

Smart Web Monitor (智能网页监控)

Watch any web page for changes — with AI-powered understanding. Instead of rigid keyword matching, describe what you're looking for in natural language and let the agent decide.

Why "Smart"?

普通监控	Smart Web Monitor
关键词匹配：`risk manager`	自然语言：`"是否有风控岗位？排除实习"`
静态正则：`risk.{0,20}manager`	Agent 理解上下文语义
匹配即通知	匹配后 Agent 可以总结、分析、过滤
一次判断	Agent 持续推理，理解页面结构变化

Quick Start

Create a Monitor

# Simple: keyword match
python3 scripts/monitor.py create \
  --name "hk-risk-jobs" \
  --url "https://careers.example.com/risk" \
  --match-type keyword \
  --match-value "risk manager,hong kong,compliance"

# Smart: LLM match (agent evaluates with natural language understanding)
python3 scripts/monitor.py create \
  --name "hk-fintech-news" \
  --url "https://36kr.com" \
  --match-type llm \
  --match-value "是否有香港金融科技、虚拟银行相关新闻？排除广告。"

Run (keyword/regex/css/jsonpath)

python3 scripts/monitor.py run --event hk-risk-jobs
# → auto-matches, auto-pauses on match

Fetch for LLM (extract text only)

python3 scripts/monitor.py fetch --event hk-fintech-news
# → outputs JSON with page text for LLM agent to evaluate

After Match — Pause & Resume

python3 scripts/monitor.py resume --event xxx   # 继续
python3 scripts/monitor.py pause --event xxx    # 暂停
python3 scripts/monitor.py disable --event xxx  # 停用
python3 scripts/monitor.py enable --event xxx   # 启用

Match Types

Type	How It Works	Smart?	Example
`keyword`	Comma-separated keywords (case-insensitive)	❌	`risk manager,hong kong`
`regex`	Regular expression pattern	❌	`risk.{0,20}manager`
`css`	CSS selector — matches if elements exist	❌	`.job-listing .title`
`jsonpath`	JSONPath for API responses	❌	`$.data.jobs[*].title`
`llm`	🤖 Agent evaluates with LLM reasoning	✅	`是否有风控岗位？排除实习`

🤖 LLM Match (Smart Matching — 核心功能)

This is what makes the monitor "smart". Instead of rigid pattern matching, the agent reads the page and thinks.

How it works:

monitor.py fetch — fetches page, extracts clean text → outputs JSON
Cron agent — reads the text, evaluates using its own LLM reasoning
Agent decides: does this match the condition?
If matched → notifies you with a summary → auto-pauses (won't spam)
If not matched → silently skips

This enables natural language conditions that would be impossible with regex:

"是否有新的职位空缺？排除已见过的"
"页面上有没有提到 Series B 或以上的融资？"
"是否出现了与 AI 安全相关的文章？排除广告和推广内容"

State Machine

active ──(match found)──→ paused ──(resume)──→ active
   │                                            ↑
   └──(disable)──→ disabled ──(enable)──────────┘

Monitor Config Format

{
  "name": "hk-risk-jobs",
  "description": "Hong Kong risk management jobs",
  "urls": [{"url": "https://careers.example.com/risk", "label": "Example"}],
  "match": {
    "type": "llm",
    "value": "是否有香港风控或合规岗位？排除实习。"
  },
  "interval": 7200,
  "state": "active",
  "enabled": true
}

Cron Integration

For keyword/regex/css/jsonpath monitors:

openclaw cron add \
  --name "Monitor: hk-risk-jobs" \
  --cron "0 */2 * * *" \
  --tz "Asia/Hong_Kong" \
  --session isolated \
  --message "Run monitor: cd /home/node/.openclaw/workspace/skills/web-monitor && python3 scripts/monitor.py run --event hk-risk-jobs" \
  --announce --channel discord --to "user:YOUR_ID" --light-context

For LLM monitors:

openclaw cron add \
  --name "Monitor: hk-fintech-news" \
  --cron "0 */3 * * *" \
  --tz "Asia/Hong_Kong" \
  --session isolated \
  --message 'You are a web monitor agent. Run: cd /home/node/.openclaw/workspace/skills/web-monitor && python3 scripts/monitor.py fetch --event hk-fintech-news. Read the output JSON. For each URL, evaluate if the page content matches the condition: "是否有香港金融科技相关新闻？排除广告。" If matched: output a brief summary of what matched. If not matched: say "no match". If matched, also run: python3 scripts/monitor.py pause --event hk-fintech-news' \
  --announce --channel discord --to "user:YOUR_ID" --light-context

All Commands

Command	Usage
`create`	Create new monitor
`run --event X`	Run with built-in matching
`fetch --event X`	Fetch page text (for LLM)
`list`	Show all monitors
`show --event X`	Show config
`resume/pause/disable/enable`	State management
`delete --event X`	Delete monitor
`add-url`	Add URL to monitor

Comments

Loading comments...