{"skill":{"slug":"afrexai-web-scraping-engine","displayName":"Web Scraping & Data Extraction Engine","summary":"Complete web scraping methodology — legal compliance, architecture design, anti-detection, data pipelines, and production operations. Use when building scrap...","tags":{"automation":"1.0.0","crawling":"1.0.0","data":"1.0.0","extraction":"1.0.0","latest":"1.0.0","pipeline":"1.0.0","proxy":"1.0.0","scraping":"1.0.0","web":"1.0.0"},"stats":{"comments":0,"downloads":1410,"installsAllTime":5,"installsCurrent":5,"stars":0,"versions":1},"createdAt":1771759763570,"updatedAt":1777525321068},"latestVersion":{"version":"1.0.0","createdAt":1771759763570,"changelog":"Initial release of the Web Scraping & Data Extraction Engine:\n\n- Provides a comprehensive methodology covering legal compliance, scraper architecture, anti-detection techniques, data pipelines, and operational best practices.\n- Includes a quick health check scoring system to assess the production readiness of scraping projects.\n- Offers detailed legal guidance, decision rules, and risk assessment based on current regulations and case law.\n- Presents an architecture decision matrix and decision tree for optimal tool selection based on site complexity and anti-bot measures.\n- Shares request engineering best practices, including header rotation, rate limiting, and retry strategies.\n- Gives practical guidance for data extraction, error handling, monitoring, storage, and scheduling for scalable web data collection.","license":null},"metadata":null,"owner":{"handle":"1kalin","userId":"s17e1q0nx23qnh4n429zzqc05x83hvsw","displayName":"1kalin","image":"https://avatars.githubusercontent.com/u/15705344?v=4"},"moderation":{"isSuspicious":true,"isMalwareBlocked":false,"verdict":"suspicious","reasonCodes":["suspicious.llm_suspicious","suspicious.vt_suspicious"],"summary":"Detected: suspicious.llm_suspicious, suspicious.vt_suspicious","engineVersion":"v2.4.5","updatedAt":1777525321068}}