habib-pdf-to-json
Analysis
This is a coherent PDF extraction instruction skill, but users should install its PDF/OCR dependencies carefully and run it only on documents they intend to process.
Findings (3)
Artifact-based informational review of SKILL.md, metadata, install specs, static scan signals, and capability signals. ClawScan does not execute the skill or run runtime probes.
Checks for instructions or behavior that redirect the agent, misuse tools, execute unexpected code, cascade across systems, exploit user trust, or continue outside the intended task.
pip install pdfplumber pandas openpyxl ... pip install pytesseract pdf2image ... pip install pypdf
The skill asks users to install unpinned third-party PDF/OCR Python packages. This is expected for a PDF extraction skill, but it changes the local Python environment and depends on package-source trust.
with pdfplumber.open("construction_spec.pdf") as pdf: ... df.to_excel("extracted_data.xlsx", index=False)The examples read local PDF files and write extracted data to local output files. This is central to the skill's purpose, but users should be aware that document contents may be copied into new files.
"ownerId": "kn75fhjxn1jz5xbgd9ggj0nrtd80q1dz", "slug": "pdf-to-structured"
The bundled metadata uses a different owner ID and slug than the registry metadata supplied for the review. This is a provenance/identity inconsistency, although no hidden code or harmful behavior is shown.
