Office → Markdown Skill
PassAudited by VirusTotal on May 7, 2026.
Overview
Type: OpenClaw Skill Name: office-to-markdown Version: 1.0.1 The office-to-markdown skill is a well-documented tool for converting various document formats (PDF, DOCX, PPTX, XLSX, CSV) into Markdown. It uses a two-pass strategy for text extraction and optional OCR via the Anthropic Vision API, which explicitly requires user confirmation as per the SKILL.md instructions. The Python scripts (e.g., pdf-to-md.py, docx-to-md.py) manage their own dependencies by installing pinned versions into an isolated temporary directory (/tmp/office_md_deps/), and the external network calls to api.anthropic.com are strictly aligned with the stated purpose of document processing.
Findings (0)
Artifact-based informational review of SKILL.md, metadata, install specs, static scan signals, and capability signals. ClawScan does not execute the skill or run runtime probes.
Running the skill may download and install Python packages from the package ecosystem in order to perform conversions.
The converter installs third-party packages at runtime into a temp directory. The packages are pinned and the behavior is disclosed, so this is a supply-chain note rather than a concern.
subprocess.run([sys.executable, "-m", "pip", "install", "--quiet", "--target", str(_DEP_DIR), "pdfplumber==0.11.4", "pymupdf==1.24.14"], check=True)
Run it in a trusted environment, keep network/package policy in mind, and prefer preinstalled or hash-verified dependencies if stricter supply-chain control is needed.
If vision extraction is approved, scanned pages or embedded images from the document may leave the local workspace and be sent to Anthropic for processing.
The skill may transmit document images to an external provider for OCR. This is clearly disclosed and consent-gated, but the documents could contain sensitive information.
For scanned or image-only content, pages are sent to Anthropic's vision API. **Always ask the user for confirmation before enabling vision**
Only approve vision extraction for documents you are comfortable sending to Anthropic; otherwise use the text-only output and leave image-only pages skipped.
