微信公众号文章抓取 (Jeff版)

v1.0.0

微信公众号文章抓取工具。将微信公众号文章转换为 Markdown 格式,支持图片本地下载。当用户提到抓取微信公众号文章、提取公众号内容、爬取微信文章时触发。

0· 105·0 current·0 all-time
MIT-0
Download zip
LicenseMIT-0 · Free to use, modify, and redistribute. No attribution required.
Security Scan
VirusTotalVirusTotal
Benign
View report →
OpenClawOpenClaw
Benign
high confidence
Purpose & Capability
Name/description match the code and scripts: the package fetches rendered WeChat pages (camoufox), parses content, converts to Markdown, and optionally downloads images. The use of a stealth browser (camoufox) and an MCP server is coherent for anti-detection scraping and AI integration.
Instruction Scope
SKILL.md instructs the agent to run the included CLI or MCP server and to write outputs into an output directory. The runtime instructions and code operate on article URLs and local output paths only; they do not ask the agent to read unrelated files, environment variables, or exfiltrate data to unexpected endpoints.
Install Mechanism
The registry contains no formal install spec (instruction-only), but the repository includes requirements.txt and uses camoufox which will auto-download a browser binary on first run. Running the skill will therefore pull packages (pip) and possibly large browser artifacts from the internet — review and run in a controlled environment if you do not trust those origins.
Credentials
The skill declares no required environment variables or credentials and the code does not access secrets. Network access is required to fetch WeChat pages and images (expected for a scraper). There are minor dependency-list mismatches (SKILL.md lists aiohttp while requirements.txt lists mcp), but nothing requesting unrelated credentials.
Persistence & Privilege
The skill does not request always:true and does not modify other skills or system-wide configs. It exposes an MCP server (stdio transport) so an AI client can invoke its tools; autonomous invocation is the platform default and not a concern by itself. Outputs are written to the provided output directory (and debug HTML when parsing fails).
Assessment
This skill appears to do what it says: fetch WeChat article pages in a headless browser, convert HTML to Markdown, and download images into an output folder. Before installing or running: - Be aware it performs live web requests and will download a camoufox browser binary on first run — run it in an isolated environment if you want to limit risk. - You may need to pip-install requirements (requirements.txt) before running; confirm dependency sources. - Running the MCP server exposes conversion tools to any MCP-capable client (over stdio by default); that lets an AI invoke scraping and write files — only enable the server for trusted clients. - If you need stricter assurance, review the camoufox package and its download behavior (where the browser binary comes from) and run the tool manually on a sample URL in a safe workspace first.

Like a lobster shell, security has layers — review code before you run it.

latestvk97e5746wagvahdvtq400yhx6n838ccn

License

MIT-0
Free to use, modify, and redistribute. No attribution required.

Comments