PDF to HTML

v0.4.0

Convert PDF documents to HTML using MinerU. Transforms PDF files into web-ready HTML with structure and formatting preserved. Features: PDF to HTML conversio...

0· 56·0 current·0 all-time
MIT-0
Download zip
LicenseMIT-0 · Free to use, modify, and redistribute. No attribution required.
Security Scan
VirusTotalVirusTotal
Benign
View report →
OpenClawOpenClaw
Benign
high confidence
Purpose & Capability
The skill is an instruction-only wrapper for the MinerU CLI. Declared requirements (mineru-open-api binary and MINERU_TOKEN) directly match the described functionality (calling mineru-open-api extract to produce HTML). There are no unrelated binaries or extra credential claims.
Instruction Scope
SKILL.md instructs the agent to run the mineru-open-api CLI (extract, auth) against local files or URLs and to use MINERU_TOKEN. It does not instruct reading other environment variables, unrelated system files, or exfiltrating data to unexpected endpoints.
Install Mechanism
Install options are npm (mineru-open-api) or go install from a GitHub repo (github.com/opendatalab/...). Both are standard, traceable mechanisms. No downloads from untrusted shorteners or personal IPs are used. (As usual with npm, postinstall scripts are possible; review package sources if you require stricter controls.)
Credentials
Only a single token (MINERU_TOKEN) is required and is justified by the CLI's auth flow. No other credentials or config paths are requested. Users should confirm the token's scope and avoid reusing high-privilege tokens.
Persistence & Privilege
The skill is not always-enabled and does not request persistent modification of other skills or system-wide settings. Autonomous invocation is allowed but this is the platform default and not a reason to flag the skill by itself.
Assessment
This skill is essentially documentation for using the MinerU CLI and appears coherent. Before installing: 1) Verify mineru-open-api is the official MinerU package (check the npm page and the GitHub repo linked in SKILL.md). 2) Create a dedicated MinerU token with minimal scope and do not reuse other service credentials. 3) If you install via npm, review the package's install scripts and source code if you require tightened supply-chain control. 4) Prefer running the CLI in a sandbox or CI runner if you are processing untrusted PDFs. 5) Avoid embedding the MINERU_TOKEN in shared logs or public code; set it as a restricted environment variable.

Like a lobster shell, security has layers — review code before you run it.

latestvk977e2mtpfa52y3cb6zx82yk1584497b

License

MIT-0
Free to use, modify, and redistribute. No attribution required.

Runtime requirements

📄 Clawdis
Binsmineru-open-api
EnvMINERU_TOKEN
Primary envMINERU_TOKEN

Install

Install via npm
Bins: mineru-open-api
npm i -g mineru-open-api
Install via go install
Bins: mineru-open-api

SKILL.md

PDF to HTML

Convert PDF files to HTML using MinerU.

Install

npm install -g mineru-open-api
# or via Go (macOS/Linux):
go install github.com/opendatalab/MinerU-Ecosystem/cli/mineru-open-api@latest

Quick Start

# Convert PDF to HTML (requires token)
mineru-open-api extract report.pdf -f html -o ./out/

# From URL
mineru-open-api extract https://example.com/report.pdf -f html -o ./out/

# With language hint
mineru-open-api extract report.pdf -f html --language en -o ./out/

Authentication

Token required:

mineru-open-api auth             # Interactive token setup
export MINERU_TOKEN="your-token" # Or via environment variable

Create token at: https://mineru.net/apiManage/token

Capabilities

  • Supported input: .pdf (local file or URL)
  • Output format: HTML (-f html)
  • HTML output requires extract with token — not available in flash-extract
  • Language hint with --language (default: ch, use en for English)
  • Page range with --pages (e.g. 1-10)

Notes

  • HTML output (-f html) is only available via extract with token
  • Output goes to stdout by default; use -o <dir> to save to a file
  • All progress/status messages go to stderr; document content goes to stdout
  • MinerU is open-source by OpenDataLab (Shanghai AI Lab): https://github.com/opendatalab/MinerU

Files

1 total
Select a file
Select a file to preview.

Comments

Loading comments…