pdf-extract-skill
Analysis
This skill is coherently focused on local PDF extraction, with reasonable cautions needed around installing the external PDF tool, handling sensitive PDF outputs, and keeping the optional hybrid backend local.
Findings (3)
Artifact-based informational review of SKILL.md, metadata, install specs, static scan signals, and capability signals. ClawScan does not execute the skill or run runtime probes.
Checks for instructions or behavior that redirect the agent, misuse tools, execute unexpected code, cascade across systems, exploit user trust, or continue outside the intended task.
Source: unknown; Homepage: none; No install spec — this is an instruction-only skill; Required binaries: java, python3, opendataloader-pdf
The skill depends on external local binaries/packages, but the registry metadata does not provide a source or install specification, so users need to verify the package before installing or running it.
Checks for exposed credentials, poisoned memory or context, unclear communication boundaries, or sensitive data that could leave the user's control.
opendataloader-pdf-hybrid --port 5002
Hybrid/OCR mode starts a backend listener on a local port; this is disclosed and purpose-aligned, but sensitive PDFs should only be sent to a backend that is locally bound and trusted.
RAG and LLM-ready outputs (json + markdown).
The skill is designed to transform PDF contents into reusable RAG/LLM context files, which may preserve sensitive document text and metadata.
