Install
openclaw skills install markitdown-skillOpenClaw agent skill for converting documents to Markdown. Documentation and utilities for Microsoft's MarkItDown library. Supports PDF, Word, PowerPoint, Excel, images (OCR), audio (transcription), HTML, YouTube.
openclaw skills install markitdown-skillDocumentation and utilities for converting documents to Markdown using Microsoft's MarkItDown library.
Note: This skill provides documentation and a batch script. The actual conversion is done by the
markitdownCLI/library installed via pip.
Use markitdown for:
# Convert file to markdown
markitdown document.pdf -o output.md
# Convert URL
markitdown https://example.com/docs -o docs.md
| Format | Features |
|---|---|
| Text extraction, structure | |
| Word (.docx) | Headings, lists, tables |
| PowerPoint | Slides, text |
| Excel | Tables, sheets |
| Images | OCR + EXIF metadata |
| Audio | Speech transcription |
| HTML | Structure preservation |
| YouTube | Video transcription |
The skill requires Microsoft's markitdown CLI:
pip install 'markitdown[all]'
Or install specific formats only:
pip install 'markitdown[pdf,docx,pptx]'
markitdown https://github.com/user/repo/blob/main/README.md -o readme.md
markitdown document.pdf -o document.md
# Using included script
python ~/.openclaw/skills/markitdown/scripts/batch_convert.py docs/*.pdf -o markdown/ -v
# Or shell loop
for file in docs/*.pdf; do
markitdown "$file" -o "${file%.pdf}.md"
done
from markitdown import MarkItDown
md = MarkItDown()
result = md.convert("document.pdf")
print(result.text_content)
pip install 'markitdown[all]'
# Ubuntu/Debian
sudo apt-get install tesseract-ocr
# macOS
brew install tesseract
| Component | Source |
|---|---|
markitdown CLI | Microsoft's pip package |
markitdown Python API | Microsoft's pip package |
scripts/batch_convert.py | This skill (utility) |
| Documentation | This skill |