Wechat Mp Fetch

v1.1.1

WeChat Official Account article fetcher — extracts title, body text, and final URL from mp.weixin.qq.com links via Playwright. 微信公众号文章抓取工具,提取标题、正文、原始URL,支持重定...

1· 223·0 current·0 all-time
Security Scan
VirusTotalVirusTotal
Benign
View report →
OpenClawOpenClaw
Benign
high confidence
Purpose & Capability
Name/description match the code and files: the scripts use Playwright to render mp.weixin.qq.com pages and extract title/content/URL. Required dependencies (playwright, commander) are consistent with this purpose.
Instruction Scope
SKILL.md instructs local npm install and running the included scripts; runtime instructions only navigate to the provided URL, wait for #js_content, extract text, and print formatted output. The instructions do not read unrelated files or environment variables nor send data to external endpoints.
Install Mechanism
There is no install spec in the skill bundle (instruction-only), but SKILL.md instructs users to run npm install and npx playwright install chromium. This is expected for a Playwright-based tool but requires downloading Playwright and Chromium binaries (large network download). package-lock.json points to an npm mirror (registry.npmmirror.com) which is a registry mirror — expected but worth noting.
Credentials
The skill declares no required env vars, primary credential, or config paths and the code does not access environment secrets. No credential or unrelated service access is requested.
Persistence & Privilege
Skill does not request permanent/always-on privileges (always:false). It is user-invocable and may be invoked autonomously by the agent by default, which is normal for skills; it does not modify other skills or system-wide configs.
Assessment
This skill appears coherent and does what it claims: a local Playwright scraper for mp.weixin.qq.com articles. Before installing, be aware it requires Node.js >=18, will download Playwright and a Chromium browser (large download), and will load/execute page JavaScript from the sites you target (normal for a renderer but worth considering if running in a sensitive environment). Only provide article URLs you trust and avoid supplying any credentials; consider running the tool in an isolated container or VM if you are concerned about running arbitrary webpage content. The package-lock points to an npm mirror (registry.npmmirror.com) for dependency resolution — not inherently malicious but verify your expected registry if you require strict provenance.

Like a lobster shell, security has layers — review code before you run it.

latestvk979vgf6t20cenppj5bbzc1b5x84ke52
223downloads
1stars
4versions
Updated 1w ago
v1.1.1
MIT-0

📖 WeChat Article Fetch | 微信公众号文章抓取

Extract article title, body text, and original URL from WeChat Official Account links (mp.weixin.qq.com)


✨ Features | 功能

  • 🎯 Title Extraction — Extracts article title from rendered page
  • 📝 Body Text — Extracts clean text content from #js_content
  • 🔗 URL Resolution — Handles redirects, returns final canonical URL
  • 🌐 Full Rendering — Uses Playwright/Chromium for JS-heavy pages
  • 🔒 Privacy First — 100% local, no data uploaded anywhere

🚀 Quick Start

Installation

cd <skill-path>
npm install
npx playwright install chromium

Usage

node scripts/wx-article-fetch.js "https://mp.weixin.qq.com/s/xxxxx"

Output

{
  "success": true,
  "title": "文章标题",
  "content": "正文内容...",
  "url": "https://mp.weixin.qq.com/s/xxxxx"
}

💡 Usage Scenarios | 使用场景

ScenarioDescription场景
📚 Content ArchivalSave articles for offline reading文章离线保存
📝 Note-takingConvert articles to notes文章转笔记
🔍 ResearchBatch collect article content批量采集资料
✍️ Writing ReferenceExtract key info for writing写作素材收集
🔄 Content RepurposingExtract text for rewriting内容再创作

⚠️ Known Limitations | 已知限制

IssueDescription
🔐 Login RequiredSome articles require WeChat login
💰 Paid ContentPaywalled articles cannot be fetched
🔒 Private AccountsPrivate official accounts inaccessible
🖼️ ImagesCurrently extracts text only; images keep original URLs

🔧 Technical Details | 技术细节

  • Rendering Engine: Playwright + headless Chromium
  • Content Selector: #js_content container
  • No External APIs: All processing is 100% local
  • Browser Mode: Headless, no UI, no state leakage

🛡️ Security & Privacy

  • 100% Local — All operations run in local browser, no external server
  • No Login Required — No WeChat credentials needed
  • No Data Storage — Content exists only in caller's session
  • No Tracking — No analytics, no telemetry, no third-party deps

📁 Project Structure

wechat-mp-fetch/
├── _meta.json
├── SKILL.md
├── package.json
├── package-lock.json
└── scripts/
    └── wx-article-fetch.js    # Main script

🔗 Links

Comments

Loading comments...