PDF to HTML
v0.4.0Convert PDF documents to HTML using MinerU. Transforms PDF files into web-ready HTML with structure and formatting preserved. Features: PDF to HTML conversio...
MIT-0
Security Scan
OpenClaw
Benign
high confidencePurpose & Capability
The skill is an instruction-only wrapper for the MinerU CLI. Declared requirements (mineru-open-api binary and MINERU_TOKEN) directly match the described functionality (calling mineru-open-api extract to produce HTML). There are no unrelated binaries or extra credential claims.
Instruction Scope
SKILL.md instructs the agent to run the mineru-open-api CLI (extract, auth) against local files or URLs and to use MINERU_TOKEN. It does not instruct reading other environment variables, unrelated system files, or exfiltrating data to unexpected endpoints.
Install Mechanism
Install options are npm (mineru-open-api) or go install from a GitHub repo (github.com/opendatalab/...). Both are standard, traceable mechanisms. No downloads from untrusted shorteners or personal IPs are used. (As usual with npm, postinstall scripts are possible; review package sources if you require stricter controls.)
Credentials
Only a single token (MINERU_TOKEN) is required and is justified by the CLI's auth flow. No other credentials or config paths are requested. Users should confirm the token's scope and avoid reusing high-privilege tokens.
Persistence & Privilege
The skill is not always-enabled and does not request persistent modification of other skills or system-wide settings. Autonomous invocation is allowed but this is the platform default and not a reason to flag the skill by itself.
Assessment
This skill is essentially documentation for using the MinerU CLI and appears coherent. Before installing: 1) Verify mineru-open-api is the official MinerU package (check the npm page and the GitHub repo linked in SKILL.md). 2) Create a dedicated MinerU token with minimal scope and do not reuse other service credentials. 3) If you install via npm, review the package's install scripts and source code if you require tightened supply-chain control. 4) Prefer running the CLI in a sandbox or CI runner if you are processing untrusted PDFs. 5) Avoid embedding the MINERU_TOKEN in shared logs or public code; set it as a restricted environment variable.Like a lobster shell, security has layers — review code before you run it.
latest
License
MIT-0
Free to use, modify, and redistribute. No attribution required.
Runtime requirements
📄 Clawdis
Binsmineru-open-api
EnvMINERU_TOKEN
Primary envMINERU_TOKEN
Install
Install via npm
Bins: mineru-open-api
npm i -g mineru-open-apiInstall via go install
Bins: mineru-open-api
SKILL.md
PDF to HTML
Convert PDF files to HTML using MinerU.
Install
npm install -g mineru-open-api
# or via Go (macOS/Linux):
go install github.com/opendatalab/MinerU-Ecosystem/cli/mineru-open-api@latest
Quick Start
# Convert PDF to HTML (requires token)
mineru-open-api extract report.pdf -f html -o ./out/
# From URL
mineru-open-api extract https://example.com/report.pdf -f html -o ./out/
# With language hint
mineru-open-api extract report.pdf -f html --language en -o ./out/
Authentication
Token required:
mineru-open-api auth # Interactive token setup
export MINERU_TOKEN="your-token" # Or via environment variable
Create token at: https://mineru.net/apiManage/token
Capabilities
- Supported input: .pdf (local file or URL)
- Output format: HTML (
-f html) - HTML output requires
extractwith token — not available inflash-extract - Language hint with
--language(default:ch, useenfor English) - Page range with
--pages(e.g.1-10)
Notes
- HTML output (
-f html) is only available viaextractwith token - Output goes to stdout by default; use
-o <dir>to save to a file - All progress/status messages go to stderr; document content goes to stdout
- MinerU is open-source by OpenDataLab (Shanghai AI Lab): https://github.com/opendatalab/MinerU
Files
1 totalSelect a file
Select a file to preview.
Comments
Loading comments…
