Pdf Parser Agent

v1.0.1

Parses local PDF files into structured Markdown and JSON using opendataloader-pdf for deterministic, local document content extraction.

0· 94·1 current·1 all-time
byEzequiel Techera@trshdesigns
MIT-0
Download zip
LicenseMIT-0 · Free to use, modify, and redistribute. No attribution required.
Security Scan
VirusTotalVirusTotal
Benign
View report →
OpenClawOpenClaw
Benign
high confidence
Purpose & Capability
Name and description match the included script and SKILL.md: the skill runs a local Python-based converter (opendataloader-pdf) on local PDF files. The only minor oddity is an included package.json (Node metadata) despite this being a Python script; this appears cosmetic and does not contradict the stated purpose.
Instruction Scope
SKILL.md instructs the agent to run a local script against local PDFs and references a single dependency (opendataloader-pdf). The runtime instructions do not request unrelated files, environment variables, or external endpoints. The script does append the user's site-packages path to sys.path to locate a --user pip installation, which is reasonable for a dependency lookup but means it will import whatever opendataloader-pdf is installed in the user's site.
Install Mechanism
No install spec is provided by the skill (instruction-only), so nothing is downloaded or written by the skill itself. Dependency installation is left to the user (pip install --user opendataloader-pdf). This is low-risk for the skill bundle, though the external Python package remains a separate trust decision.
Credentials
The skill declares no environment variables, credentials, or config paths and its code does not read secrets. It only reads a user-supplied local file path (validated to be inside the current workspace) and writes output to a specified directory — which is proportionate to the stated function.
Persistence & Privilege
The skill does not request persistent or elevated presence (always:false). It does not modify other skills or system-wide agent settings. Autonomous invocation is allowed by default but is not combined with any broad credential access or unusual privileges.
Assessment
This skill appears to do what it says: convert local PDFs using the opendataloader-pdf package. Before installing or running it: - Inspect and vet the external dependency (opendataloader-pdf) you will pip-install; that package will execute on your machine and is the primary runtime risk. Prefer to install it in a fresh virtualenv rather than system-wide. - Note the script adds your user-site-packages to sys.path, so whatever is installed there will be imported. If you share an environment, ensure no untrusted packages are present in user-site. - The script validates that input files are inside the current working directory; still run it in a controlled workspace to avoid accidental processing of sensitive files. - The included package.json is unexpected for a Python-only skill but appears harmless; it may be leftover metadata. - If you need stronger isolation, run this tool in a container or VM and audit opendataloader-pdf's behavior (it may spawn Java or other subprocesses according to the tests/notes).

Like a lobster shell, security has layers — review code before you run it.

latestvk973zr9vj7m02r7e5dkqfpzg7n839826

License

MIT-0
Free to use, modify, and redistribute. No attribution required.

Comments