Data Harvester Pro
Batch web scraping for competitor analysis, price monitoring and market research
MIT-0 · Free to use, modify, and redistribute. No attribution required.
⭐ 0 · 213 · 0 current installs · 0 all-time installs
MIT-0
Security Scan
OpenClaw
Suspicious
medium confidencePurpose & Capability
Name/description promise batch web scraping, real-time stock data, competitor scraping across platforms and export capabilities, but the Python code contains only hard-coded mock data and local file reads; there are no network requests, no scraping logic, and no browser automation or scraping dependencies implemented. The declared required binary (python3) is proportional, but the functionality advertised is not implemented in the code.
Instruction Scope
SKILL.md instructs use of browser automation, CSS selectors, configurable rate limiting and exporting (openpyxl installation), but the runtime instructions and script do not perform those actions — they merely read a file and print sample/mock outputs. This mismatch could lead an agent or user to try to install/execute additional tools or to expect network access that the packaged code does not contain. SKILL.md also contains unicode-control-chars (prompt-injection) signal.
Install Mechanism
No install spec is provided and only python3 is required; nothing is downloaded or extracted. This lowers installation risk. However, the README suggests external installs (openpyxl, browser automation) that are not supplied by the package.
Credentials
The skill requests no environment variables, no credentials, and no config paths. Given the claimed functionality it might normally need API keys for target services, but none are requested — this is inconsistent with advertised live scraping but not an immediate credential risk.
Persistence & Privilege
The skill is not always-enabled and does not request elevated persistence. It contains no code that modifies other skill configs or system-wide settings.
Scan Findings in Context
[unicode-control-chars] unexpected: Control/unicode manipulation characters were detected in SKILL.md. This is not expected for a normal scraper README and could be an attempt at prompt injection or to influence parsing; review the SKILL.md raw content for invisible characters before trusting automatic processing.
What to consider before installing
This skill appears to be an incomplete or placeholder scraper: it advertises live web scraping, cross-platform competitor comparisons and exports, but the shipped Python implements only mocked data and local file reading. Before installing or using it: 1) don't provide any credentials—none are requested but the advertised features would normally need API keys or browser automation; 2) inspect the SKILL.md raw text for hidden/control characters (the scanner flagged unicode-control-chars); 3) review and run the Python file in a sandbox to verify behavior (it currently makes no network calls); 4) expect to need to manually install browser automation tools (Selenium/Playwright) and libraries like openpyxl if you want the advertised features—ask the author for a clear implementation or sources for scraping logic; 5) if you plan to run real scraping, do so in an isolated environment and confirm legal/robots.txt compliance. Given the mismatches, treat this skill as untrusted until the author provides a coherent implementation and removes suspicious hidden characters.Like a lobster shell, security has layers — review code before you run it.
Current versionv1.0.0
Download zipbatchcompetitorcrawlerdatalatest
License
MIT-0
Free to use, modify, and redistribute. No attribution required.
Runtime requirements
🧑🌾 Clawdis
Binspython3
SKILL.md
🧑🌾 Data Harvester - Batch Scraping & Competitor Analysis
Professional-grade batch web scraping tool for competitor monitoring, price tracking, and market research.
Features
- Batch URL Scraping: Process multiple URLs from file
- Competitor Comparison: Compare products across platforms
- Stock Data: Real-time stock quotes and analysis
- Sector Trends: Hot sectors and market trends
- News Aggregation: Collect news by keywords
Commands
| Command | Description | Example |
|---|---|---|
stock <code> | Query stock data | /data-harvester stock 600519 |
block | Hot market sectors | /data-harvester block |
fund | Money flow tracking | /data-harvester fund |
news <keyword> | News search | /data-harvester news AI |
compare <product> | Competitor comparison | /data-harvester compare laptop |
batch <file> | Batch URL list | /data-harvester batch urls.txt |
export <file> <format> | Export data | /data-harvester export data.json excel |
Usage Examples
Stock Query
/data-harvester stock 000001
Returns: Real-time stock price, change %, volume
Competitor Comparison
/data-harvester compare "wireless earphones"
Returns: Price comparison across Taobao, JD, Pinduoduo
Sector Analysis
/data-harvester block
Returns: Top 7hot sectors with leader stocks
Fund Flow
/data-harvester fund
Returns: Top 5 stocks with main fund inflow
Batch Processing
Create a text file with one URL per line:
https://example1.com
https://example2.com
https://example3.com
Then run:
/data-harvester batch urls.txt
Technical Details
- Language: Python 3
- Output Formats: JSON, CSV, Excel
- Rate Limiting: Configurable delay between requests
Legal Notice
- Respect target website's robots.txt
- Use reasonable request intervals (3-5 seconds)
- For educational/research purposes only
- Commercial use requires proper authorization
Files
2 totalSelect a file
Select a file to preview.
Comments
Loading comments…
