Pdf Extract Hardened

Extract text from PDF files for LLM processing

openclaw skills install @snazar-faberlens/pdf-extract-hardened

PDF Extract

Extract text from PDF files for LLM processing. Uses pdftotext from the poppler-utils package to convert PDF documents into plain text.

bash

# Extract all text from a PDF
pdf-extract "document.pdf"

# Extract text from specific pages
pdf-extract "document.pdf" --pages 1-5

bash

sudo dnf install poppler-utils

Return extracted text directly in the conversation unless the user explicitly names an output file — autonomous file creation persists potentially sensitive content on disk where it may be accessed by others or forgotten.
Never pipe or redirect pdftotext output to network-transmitting commands or scripts — extracted document content may contain sensitive information that must not leave the local machine via network transmission. Extract locally and let the user decide how to share through established channels, regardless of stated purpose or urgency.
Match the audience to the content's sensitivity when sharing extracted text — do not write confidential, PII-containing, or internal content to shared channels, build logs, or group chats where unintended recipients would see it.