Web Scraping Tool Selection Strategy

v1.0.0

如何选择合适的网页抓取工具进行数据采集。当用户提到网页抓取、数据采集、爬虫、自动化测试、浏览器自动化、网站监控、竞品分析、价格监控、评论抓取、社交媒体数据分析、电商数据采集、小红书/知乎/京东/淘宝/1688抓取、结构化数据提取、反爬绕过、浏览器复用、API抓取、实时数据监控等场景时使用此技能。包含opencli...

0· 134·0 current·0 all-time
MIT-0
Download zip
LicenseMIT-0 · Free to use, modify, and redistribute. No attribution required.
Security Scan
VirusTotalVirusTotal
Benign
View report →
OpenClawOpenClaw
Benign
high confidence
Purpose & Capability
The skill's name and description match the instructions: it is a tool-selection strategy between opencli and playwright-cli. It does not request unrelated credentials or binaries. Minor inconsistency: SKILL.md references companion scripts/files (e.g., scripts/web_scraping_validator, references/platform_mapping_table) that are not present in the file manifest—this is a documentation/packaging omission but does not imply malicious behavior.
Instruction Scope
Instructions stay on-topic (how to choose and invoke opencli/playwright-cli). They explicitly recommend reusing logged-in Chrome browser state to access post-login data and to bypass anti-bot measures; while coherent for the stated purpose, this step can expose private account data if performed automatically or without care. The skill does not instruct the agent to read arbitrary system files or exfiltrate data to external endpoints, but following its guidance requires elevated access to a browser profile/session outside the skill's own control.
Install Mechanism
No install spec and no code files to execute — instruction-only skill. This minimizes surface area: nothing is downloaded or written by the skill itself.
Credentials
The skill declares no required env vars or credentials (proportional). However it implicitly depends on user-managed credentials/sessions (logged-in browser state and site accounts). That dependence is reasonable for the guidance given, but users should not hand over browser profiles, cookies, or credentials to untrusted agents.
Persistence & Privilege
The skill is not always-enabled and makes no requests to modify other skills or system configuration. Autonomous invocation is allowed by platform default but the skill does not request elevated persistent privileges.
Assessment
This skill is a coherent, instruction-only guide for choosing opencli vs playwright-cli. Before using it: 1) Verify you will manually install and review opencli/playwright-cli from official sources (don’t run unknown installers). 2) Be cautious about reusing logged-in browser state — don’t give an agent access to your browser profile, cookies, or passwords unless you explicitly trust the environment; doing so can expose private account data. 3) The SKILL.md mentions companion scripts that aren’t bundled here—inspect any such scripts before running. 4) Ensure your scraping activities comply with target sites’ terms of service and applicable laws. 5) Prefer manual review and least-privilege testing (use throwaway accounts or isolated browser profiles) when validating the recommended commands.

Like a lobster shell, security has layers — review code before you run it.

latestvk978xy0z4szed2me7sssa9xgr584h9bq

License

MIT-0
Free to use, modify, and redistribute. No attribution required.

Runtime requirements

🕷️ Clawdis

Comments