Yandex Archive Scraper
v1.0.0Search and extract data from Yandex.Archive (Яндекс.Архив) — metric books, newspapers, directories. Bypasses bot protection via Scrapling.
⭐ 0· 39·0 current·0 all-time
byFlo@flobo3
MIT-0
Download zip
LicenseMIT-0 · Free to use, modify, and redistribute. No attribution required.
Security Scan
OpenClaw
Benign
high confidencePurpose & Capability
Name/description (Yandex.Archive scraping, bypassing bot protection) align with the included Python scripts and declared dependencies (scrapling, playwright, etc.). The code only targets Yandex.Archive URLs and extracts site-specific JSON/HTML.
Instruction Scope
SKILL.md and README instruct installing the listed Python packages and using StealthyFetcher to fetch archive pages. The runtime instructions and scripts stay focused on constructing search URLs, fetching pages, and parsing results; they do not read unrelated files or environment variables.
Install Mechanism
The package is instruction-first and contains code files but no formal install spec. README suggests pip installing several packages and running 'playwright install chromium' — this will download browser binaries and execute third-party code (scrapling, browserforge). That is expected for a scraper but increases runtime footprint and risk from third-party packages.
Credentials
The skill requests no environment variables, no credentials, and accesses no system config paths. The lack of secret access is proportionate to a public-web scraping task.
Persistence & Privilege
Skill does not request always:true, does not attempt to modify other skills or agent-wide settings, and requires no persistent credentials. Autonomous invocation is allowed (platform default) but not combined with additional privileges.
Assessment
This skill appears internally consistent with its stated purpose: it fetches and parses Yandex.Archive pages and uses Scrapling/Playwright to avoid bot protections. Before installing, consider that: (1) bypassing bot protection may violate Yandex's terms of service or local law — ensure you have the right to scrape the target; (2) installing Playwright will download browser binaries and the listed Python packages (third-party code) which will run on your system — audit or sandbox the environment and verify package sources (PyPI project pages, authors); (3) run this skill in an isolated environment (container/VM) if you are concerned about third-party dependencies; and (4) no secrets are required by the skill itself, but if you modify it to integrate other services, re-evaluate requested credentials. If you want, I can list the exact package pages to review or suggest safer alternatives (site APIs, manual downloads, or permissioned data access).Like a lobster shell, security has layers — review code before you run it.
latestvk972tx8bx13vn889fk42dtpeq18434rb
License
MIT-0
Free to use, modify, and redistribute. No attribution required.
