Scrape Web

使用 Python + Scrapling 获取网页内容,支持简单选择器

MIT-0 · Free to use, modify, and redistribute. No attribution required.
1 · 760 · 4 current installs · 4 all-time installs
by李金木@jnmhub
MIT-0
Security Scan
VirusTotalVirusTotal
Benign
View report →
OpenClawOpenClaw
Benign
high confidence
Purpose & Capability
Name/description, SKILL.md, and the included Python script all align: the skill fetches pages (httpx or Scrapling) and supports CSS-like selectors. No unrelated credentials, binaries, or capabilities are requested.
Instruction Scope
Runtime instructions stay within scraping: pip install scrapling[all], scrapling install, pip install httpx, then run the script with --url/--selector/--out. Note: 'scrapling install' may download browser components (Chromium) and the StealthyFetcher will execute headless browser fetches, which is expected for this functionality but increases runtime side effects.
Install Mechanism
There is no formal install spec in the registry; the SKILL.md instructs the operator to pip-install dependencies and run 'scrapling install'. This uses PyPI (normal) and likely downloads browser binaries via scrapling, which is expected but more heavyweight than a pure-HTTP library.
Credentials
The skill declares 'python' in requires.env which is unusual (the code does not read a 'python' env var). This appears to be a check that Python is available rather than a secret. No sensitive credentials or unrelated env vars are requested.
Persistence & Privilege
The skill is not always-enabled and does not request elevated platform privileges. It does not modify other skills or system config. Autonomous invocation is allowed (platform default) but not combined with other concerning flags.
Assessment
This skill is internally consistent with a web-scraping utility. Before installing, be aware that: (1) you will need a Python environment and will run pip install commands that pull third-party packages (inspect those packages if you need to vet supply-chain risk); (2) 'scrapling install' typically downloads browser binaries (Chromium), which increases disk/network activity; (3) the script will fetch any URL you give it — avoid scraping private/internal resources or sites you don't have permission to access; and (4) no secrets are requested by the skill itself, but be careful not to pass sensitive URLs or credentials to the tool. If you want higher assurance, review the scrapling package's install steps and source before running the install commands.

Like a lobster shell, security has layers — review code before you run it.

Current versionv1.0.0
Download zip
latestvk975vt2p6ffgkb0d6qfk0cvstx8337ykpythonvk975vt2p6ffgkb0d6qfk0cvstx8337ykscrapevk975vt2p6ffgkb0d6qfk0cvstx8337ykwebvk975vt2p6ffgkb0d6qfk0cvstx8337yk

License

MIT-0
Free to use, modify, and redistribute. No attribution required.

Runtime requirements

Envpython

SKILL.md

Scrape Web Skill

使用 Scrapling 获取网页内容,返回纯文本或选择器结果。

安装依赖

pip install "scrapling[all]"
scrapling install
pip install httpx

用法

1) 直接抓取纯文本

python scripts/scrape_web.py --url "https://example.com"

2) 使用 CSS 选择器

python scripts/scrape_web.py --url "https://example.com" --selector "title::text"

3) 保存到文件

python scripts/scrape_web.py --url "https://example.com" --out "output.txt"

Files

2 total
Select a file
Select a file to preview.

Comments

Loading comments…