PLS URL to Markdown

Fetch a URL and convert its web page content into clean Markdown for research, documentation, or knowledge base creation.

Audits

Pass

Install

openclaw skills install pls-url-to-markdown

URL to Markdown Converter

Fetches URLs and converts web pages to clean Markdown.

Quick Start

Python Method (markdownify)

pip install requests beautifulsoup4 markdownify

python3 -c "... fetching and converting URL ..."

CLI Tools (html2text, pandoc)

curl -s URL | html2text
wget -q -O - URL | pandoc -f html -t markdown

Full Extraction Script

import requests
from bs4 import BeautifulSoup
from markdownify import markdownify as md

def url_to_markdown(url, output_file=None):
    # ... fetch, parse, convert logic ...
    pass

Content Extraction Patterns

Extract Article Body

def extract_article(html):
    soup = BeautifulSoup(html, 'html.parser')
    article = soup.find('article') or soup.find('main')
    return md(str(article)) if article else None

Preserve Code Blocks

def preserve_code(html):
    # ... logic to wrap code in ``` ...
    pass

CLI Usage

python url_to_markdown.py URL -o output.md

Error Handling

def safe_fetch(url, retries=3):
    # ... retry logic ...
    pass