Install
openclaw skills install markdown-new-crawlUse `https://markdown.new/crawl/{target_url}` endpoints to recursively crawl a site section and return markdowns. Trigger this skill when the user asks for multi-page extraction, whole-docs crawl, link-depth crawling, or job-based crawl polling from a URL. Prefer local terminal access (`curl`) with `/crawl`, `/crawl/status/{jobId}`, and `/crawl/{url}` before other browsing methods.
openclaw skills install markdown-new-crawlUse markdown.new/crawl for multi-page crawling and async Markdown generation.
curl (or any suitable alternative tools).POST /crawl to get a job ID.GET /crawl/status/{jobId} until crawl completes.?format=json only when structured output is needed.limit, depth, subdomain/external toggles, include/exclude patterns).500 pages per job./crawl fails (network, timeout, blocked host), fall back to another method and state the fallback.curl -X POST "https://markdown.new/crawl" \
-H "Content-Type: application/json" \
-d '{"url":"https://docs.example.com","limit":50}'
curl "https://markdown.new/crawl/status/<jobId>"
curl "https://markdown.new/crawl/status/<jobId>?format=json"
curl "https://markdown.new/crawl/https://docs.example.com"
curl -X DELETE "https://markdown.new/crawl/status/<jobId>"
POST /crawl: create async crawl job and return job ID.GET /crawl/status/{jobId}: return crawl output and status.DELETE /crawl/status/{jobId}: cancel a running crawl; completed pages remain available.GET /crawl/{url}: browser-style shortcut that starts crawl and returns tracking page.url (required): crawl starting URL.limit: max pages, 1-500 (default 500).depth: max link depth, 1-10 (default 5).render: enable JS rendering for SPA pages.source: URL discovery strategy (all, sitemaps, links).maxAge: max cache age seconds (0-604800, default 86400).modifiedSince: UNIX timestamp; crawl pages modified after this time.includeExternalLinks: include cross-domain links.includeSubdomains: include subdomains.includePatterns / excludePatterns: wildcard URL filtering.GET /crawl/status/{jobId} returns concatenated Markdown by default.?format=json for per-page structured records.?retain_images=true to keep them.limit values for targeted extraction to reduce cost and runtime.