Office → Markdown Skill

PassAudited by VirusTotal on May 7, 2026.

Overview

Type: OpenClaw Skill Name: office-to-markdown Version: 1.0.1 The office-to-markdown skill is a well-documented tool for converting various document formats (PDF, DOCX, PPTX, XLSX, CSV) into Markdown. It uses a two-pass strategy for text extraction and optional OCR via the Anthropic Vision API, which explicitly requires user confirmation as per the SKILL.md instructions. The Python scripts (e.g., pdf-to-md.py, docx-to-md.py) manage their own dependencies by installing pinned versions into an isolated temporary directory (/tmp/office_md_deps/), and the external network calls to api.anthropic.com are strictly aligned with the stated purpose of document processing.

Findings (0)

Artifact-based informational review of SKILL.md, metadata, install specs, static scan signals, and capability signals. ClawScan does not execute the skill or run runtime probes.

Note

ASI04: Agentic Supply Chain Vulnerabilities

What this means

Running the skill may download and install Python packages from the package ecosystem in order to perform conversions.

Why it was flagged

The converter installs third-party packages at runtime into a temp directory. The packages are pinned and the behavior is disclosed, so this is a supply-chain note rather than a concern.

Skill content

subprocess.run([sys.executable, "-m", "pip", "install", "--quiet", "--target", str(_DEP_DIR), "pdfplumber==0.11.4", "pymupdf==1.24.14"], check=True)

Recommendation

Run it in a trusted environment, keep network/package policy in mind, and prefer preinstalled or hash-verified dependencies if stricter supply-chain control is needed.

Note

ASI07: Insecure Inter-Agent Communication

What this means

If vision extraction is approved, scanned pages or embedded images from the document may leave the local workspace and be sent to Anthropic for processing.

Why it was flagged

The skill may transmit document images to an external provider for OCR. This is clearly disclosed and consent-gated, but the documents could contain sensitive information.

Skill content

For scanned or image-only content, pages are sent to Anthropic's vision API. **Always ask the user for confirmation before enabling vision**

Recommendation

Only approve vision extraction for documents you are comfortable sending to Anthropic; otherwise use the text-only output and leave image-only pages skipped.