Pymupdf Pdf Parser Clawdbot Skill Hardened

PassAudited by ClawScan on Apr 29, 2026.

Overview

The skill is internally consistent: it performs local PDF parsing with PyMuPDF, requests no credentials or unusual installs, and its instructions and code align with the stated purpose.

This skill appears to do exactly what it says, but check these practical points before use: - Inspect the scripts locally (you already have the source); the code does not perform network I/O or spawn shells. - Install PyMuPDF from the official source (pip install pymupdf) and be mindful of Nix libstdc++ notes if on NixOS. - Outputs are written as cleartext under the outroot directory and will overwrite existing files with the same path; ensure you choose an appropriate outroot and back up any important outputs. - The script uses pdf_path.stem for output folder names — different PDFs with identical basenames (from different directories) will produce the same per-document name under a shared outroot, which can cause confusion or collisions. - The SKILL.md guardrails forbid sending parsed content to external endpoints and require confirmation for bulk processing; those are documentation-level protections — if you or another automated agent instructs the skill to exfiltrate data, that would be a user action outside the code itself. Run the skill in a trusted environment when parsing sensitive documents.