Web Scraper Jina

Bypass Cloudflare and scrape any website using r.jina.ai API. Works on sites with strong protection like Truth Social, Cloudflare Turnstile, etc.

MIT-0 · Free to use, modify, and redistribute. No attribution required.

⭐ 0 · 641 · 5 current installs · 5 all-time installs

by@itonlyforfun-AI

MIT-0

Security Scan

VirusTotal

Benign

View report →

OpenClaw

Suspicious

medium confidence

ℹ

Purpose & Capability

The name/description match the SKILL.md: the skill simply tells the agent to prepend https://r.jina.ai/ to target URLs to retrieve content. The functionality claimed (scraping protected sites) is entirely delegated to the third-party r.jina.ai service; the skill itself has no code, installs, or extra credentials.

Instruction Scope

The instructions explicitly advise bypassing Cloudflare, Turnstile, and other protections and list targeted sites (Truth Social, etc.). While the instructions do not request local files, credentials, or system access, they directly instruct circumvention of access controls and encourage potentially terms-violating or illegal scraping behavior.

✓

Install Mechanism

Instruction-only skill with no install spec and no code files — nothing is written to disk by the skill itself, which is low technical risk in terms of install mechanism.

✓

Credentials

No environment variables, credentials, or config paths are requested — the skill does not ask for secrets or unrelated permissions.

✓

Persistence & Privilege

No elevated privileges or always-on behavior requested (always: false). The skill does not attempt to modify other skills or system settings.

What to consider before installing

This skill is essentially a short how-to that tells the agent to use a third-party scraping proxy (r.jina.ai) to fetch pages, including ones protected by anti-bot measures. Before installing, consider: 1) provenance — the publisher and homepage are unknown; that increases trust risk; 2) legality and ToS — intentionally circumventing protections can violate site terms of service or laws in some jurisdictions; 3) privacy — fetched content may include private data or trigger rate-limiting/blocks on your account; 4) dependency on a third-party proxy — your requests go through r.jina.ai, so review their terms and privacy policy; 5) safer alternatives — prefer official APIs, site-provided feeds, or getting explicit permission. If you still want to use it, avoid supplying credentials, limit use to public content you have permission to access, and test in a controlled environment. If you need higher assurance, ask the publisher for provenance or request a version that uses a maintainer-trusted backend or official APIs.

Like a lobster shell, security has layers — review code before you run it.

Current versionv1.0.1

Download zip

latestvk9798jmv77dq16npdy48qvwchd82ccs9

License

MIT-0

Free to use, modify, and redistribute. No attribution required.

Termshttps://spdx.org/licenses/MIT-0.html

SKILL.md

Web Scraper using r.jina.ai

Bypass Cloudflare and scrape any website using free r.jina.ai API.

Features

Bypass Cloudflare, Turnstile, and other protections
Works on Truth Social, Bitget, and other protected sites
Returns clean Markdown content
Free to use

Usage

Simply prepend https://r.jina.ai/ to any URL:

https://r.jina.ai/https://truthsocial.com/@realDonaldTrump
https://r.jina.ai/https://bitget.com/events/poolx

Examples

Get Trump Truth Social posts:

curl -s "https://r.jina.ai/https://truthsocial.com/@realDonaldTrump"

Get any protected page:

curl -s "https://r.jina.ai/https://example.com"

In Code

import requests

def scrape(url):
    return requests.get(f"https://r.jina.ai/{url}").text

Use Cases

Scrape Truth Social, Gab, Gettr
Bypass Cloudflare protected sites
Extract content from news articles
Monitor competitors

Files

1 total

Select a file

Select a file to preview.

Comments

Loading comments…