zt-web-fetcher

当需要获取网页内容时使用(搜索结果页面、博客文章、文档等)。使用 URL 转 Markdown 服务将网页转换为可读文本。 触发场景:用户说"帮我查一下"、"看看这个链接"、"获取 xx 的信息"等需要上网抓信息的情况。

MIT-0 · Free to use, modify, and redistribute. No attribution required.
0 · 110 · 0 current installs · 0 all-time installs
MIT-0
Security Scan
VirusTotalVirusTotal
Benign
View report →
OpenClawOpenClaw
Benign
high confidence
Purpose & Capability
Name/description match the SKILL.md: the skill converts webpages to Markdown using r.jina.ai, markdown.new, defuddle.md and—if those fail—suggests a Scraping fallback. It requests no credentials, binaries, or config paths, which is consistent with its stated purpose.
Instruction Scope
Instructions stay within the scraping/formatting scope, but they direct the agent to fetch webpages via third-party URL-to-Markdown services (r.jina.ai, markdown.new, defuddle.md). Those services will receive the target URL (and will fetch page content), so private pages or sensitive URLs would be exposed to third parties. The SKILL.md also suggests 'pip install scrapling' as a fallback—this would cause code to be installed/executed in the runtime environment and should be vetted before use. There is a minor internal inconsistency in which service is 'preferred' vs the ordered try sequence.
Install Mechanism
There is no formal install spec (instruction-only), which is low risk. The fallback recommendation to 'pip install scrapling' points to a GitHub project; installing that package would execute third-party code from a repo (moderate risk) and should be reviewed before installation. The primary flows (using URL-to-Markdown endpoints) do not require installing code.
Credentials
No environment variables, credentials, or config paths are requested. The skill does not ask for unrelated secrets or high-privilege access.
Persistence & Privilege
always is false and there is no install-time modification or persistent configuration. The skill does not request elevated or permanent presence beyond normal invocation.
Assessment
This skill is coherent for fetching and converting webpages, but be aware of two practical risks before using it: (1) The URL-to-Markdown services (r.jina.ai, markdown.new, defuddle.md) will fetch the given URL themselves, so do not send private, authenticated, or sensitive URLs you don't want a third party to see. (2) The suggested fallback ('pip install scrapling' from GitHub) would run third-party code in your environment—review that repository and prefer installing only vetted packages. If you need to fetch pages behind login or that contain sensitive data, do not rely on these public converters; instead use a controlled, audited crawler or provide the content directly.

Like a lobster shell, security has layers — review code before you run it.

Current versionv1.0.0
Download zip
fetchvk97bgpspjhchfntztymbqfq8ax83698nlatestvk97bgpspjhchfntztymbqfq8ax83698nmarkdownvk97bgpspjhchfntztymbqfq8ax83698nscrapervk97bgpspjhchfntztymbqfq8ax83698nwebvk97bgpspjhchfntztymbqfq8ax83698n

License

MIT-0
Free to use, modify, and redistribute. No attribution required.

SKILL.md

网页抓取

当需要获取网页内容时,按以下顺序尝试:

首选方案:URL 转 Markdown 服务

  1. markdown.new/ (推荐)

    • 用法:在网址前加 https://markdown.new/
    • 例如:https://markdown.new/https://example.com
    • 适合 Cloudflare 支持的网站
  2. r.jina.ai/ (备用)

    • 用法:在网址前加 https://r.jina.ai/
    • 例如:https://r.jina.ai/https://example.com
    • 兼容性好
  3. defuddle.md/ (备用)

    • 用法:在网址前加 https://defuddle.md/
    • 例如:https://defuddle.md/https://example.com

备选方案:爬虫工具

如果以上服务都无法获取,尝试 Scrapling:

使用流程

  1. 先尝试 r.jina.ai/{url} (最稳定)
  2. 如果失败,尝试 markdown.new/{url}
  3. 再失败,尝试 defuddle.md/{url}
  4. 都失败再考虑 Scrapling

注意事项

  • 不需要配置任何搜索 API
  • 这些服务会将网页转换为纯文本/ Markdown 格式
  • 适合大多数静态网页
  • 对于需要登录的页面可能无效

Files

1 total
Select a file
Select a file to preview.

Comments

Loading comments…