Skill flagged — suspicious patterns detected

ClawHub Security flagged this skill as suspicious. Review the scan results before using.

MinerU zero-setup document extraction — convert PDFs, images, Word, and PowerPoint to Markdown instantly. No login, no token, no configuration. Just run and get results

v0.2.1

MinerU fast extract — zero-setup, instant document extraction. Convert PDFs, images, Word (DOCX), and PowerPoint (PPTX) to Markdown with no login, no token,...

0· 217·2 current·2 all-time

Install

OpenClaw Prompt Flow

Install with OpenClaw

Best for remote or guided setup. Copy the exact prompt, then paste it into OpenClaw for mineru-extract/mineru-fast-extract.

Previewing Install & Setup.
Prompt PreviewInstall & Setup
Install the skill "MinerU zero-setup document extraction — convert PDFs, images, Word, and PowerPoint to Markdown instantly. No login, no token, no configuration. Just run and get results" (mineru-extract/mineru-fast-extract) from ClawHub.
Skill page: https://clawhub.ai/mineru-extract/mineru-fast-extract
Keep the work scoped to this skill only.
After install, inspect the skill metadata and help me finish setup.
Required binaries: mineru-open-api
Use only the metadata you can verify from ClawHub; do not invent missing requirements.
Ask before making any broader environment changes.

Command Line

CLI Commands

Use the direct CLI path if you want to install manually and keep every step visible.

OpenClaw CLI

Bare skill slug

openclaw skills install mineru-fast-extract

ClawHub CLI

Package manager switcher

npx clawhub@latest install mineru-fast-extract
Security Scan
VirusTotalVirusTotal
Benign
View report →
OpenClawOpenClaw
Suspicious
medium confidence
Purpose & Capability
Name and description match the requested artifact: the skill only requires a single CLI binary (mineru-open-api) and its CLI commands in SKILL.md correspond to document extraction (PDF/Word/PPT/Image → Markdown). Requiring a mineru-open-api binary is coherent with the stated purpose.
!
Instruction Scope
Runtime instructions only tell the agent to invoke mineru-open-api flash-extract on local files or URLs. The SKILL.md does not disclose whether the CLI does all processing locally or uploads files to mineru.net or other remote services. The ability to pass remote URLs and the lack of any network/disclosure text is a scope concern because it affects confidentiality of documents the user provides.
Install Mechanism
Installers are npm (public registry) and go install from a GitHub module — these are common and traceable install routes. However, installing an npm package or a Go binary executes third‑party code on disk; that is expected but carries moderate risk. No direct downloads from unknown servers or URL shorteners were used.
!
Credentials
The skill declares no environment variables or credentials (which is consistent with 'no token' messaging). However, because the SKILL.md omits whether the CLI performs remote calls, there is an implicit risk that sensitive documents could be transmitted off‑device without requiring a token. The absence of any declared network endpoint or privacy statement is a proportionality/clarity concern.
Persistence & Privilege
The skill is not always: true, does not request persistent system configuration or other skills' configs, and only uses a single CLI tool — no elevated presence or special privileges are requested.
What to consider before installing
This skill appears to be a thin wrapper around a third‑party CLI (mineru-open-api). Before installing or using it on sensitive documents: 1) Inspect the mineru-open-api package and the referenced GitHub repo (check maintainers, recent commits, license, and whether processing is local). 2) Prefer running the CLI in a sandbox/container or on a throwaway VM for initial tests. 3) Test with non‑sensitive documents to observe network activity (e.g., monitor DNS/HTTP) to confirm whether files are uploaded. 4) If you need on‑device-only processing, verify the implementation explicitly states local processing or choose a well‑audited open‑source tool. If you cannot verify the binary's behavior, avoid using it with private or confidential documents.

Like a lobster shell, security has layers — review code before you run it.

Runtime requirements

Clawdis
Binsmineru-open-api

Install

Install via npm
Bins: mineru-open-api
npm i -g mineru-open-api
Install via go install
Bins: mineru-open-api
latestvk9709vj1t5mgqa9vrc7ex7w03d84dmyh
217downloads
0stars
5versions
Updated 2w ago
v0.2.1
MIT-0

Fast Document Extraction with mineru-open-api

Zero-setup, instant document parsing — no login, no token, no configuration needed. Supports tables and formulas (LaTeX).

Installation

npm install -g mineru-open-api

Or via Go (macOS/Linux):

go install github.com/opendatalab/MinerU-Ecosystem/cli/mineru-open-api@latest

Verify installation

mineru-open-api version

Quick start

mineru-open-api flash-extract report.pdf                     # PDF → Markdown (instant!)
mineru-open-api flash-extract report.pdf -o ./out/           # Save to file
mineru-open-api flash-extract resume.docx                    # Word → Markdown
mineru-open-api flash-extract slides.pptx                    # PowerPoint → Markdown
mineru-open-api flash-extract photo.png                      # Image → Markdown (OCR)
mineru-open-api flash-extract https://example.com/doc.pdf    # URL → Markdown

Supported input formats

FormatSupported
PDF (.pdf)Yes
Images (.png, .jpg, .jpeg, .jp2, .webp, .gif, .bmp)Yes
Word (.docx)Yes
PowerPoint (.pptx)Yes
URLs (remote files)Yes

Command: flash-extract

mineru-open-api flash-extract <file-or-url> [flags]

Flags

FlagShortDefaultDescription
--output-o(stdout)Output path (file or directory)
--languagechDocument language
--pages(all)Page range, e.g. 1-10
--timeout900Timeout in seconds

Supported --language values

Values are organized by script/language family — each value covers all languages in its group.

Standalone language packs

ValueIncluded languages说明
chChinese, English, Chinese Traditional中英文(默认值)
ch_serverChinese, English, Chinese Traditional, Japanese繁体、手写体
enEnglish纯英文
japanChinese, English, Chinese Traditional, Japanese日文为主
koreanKorean, English韩文
chinese_chtChinese, English, Chinese Traditional, Japanese繁体中文为主
taTamil, English泰米尔文
teTelugu, English泰卢固文
kaKannada卡纳达文
elGreek, English希腊文
thThai, English泰文

Language family packs

ValueScript/FamilyIncluded languages
latinLatin script (拉丁语系)French, German, Afrikaans, Italian, Spanish, Bosnian, Portuguese, Czech, Welsh, Danish, Estonian, Irish, Croatian, Uzbek, Hungarian, Serbian (Latin), Indonesian, Occitan, Icelandic, Lithuanian, Maori, Malay, Dutch, Norwegian, Polish, Slovak, Slovenian, Albanian, Swedish, Swahili, Tagalog, Turkish, Latin, Azerbaijani, Kurdish, Latvian, Maltese, Pali, Romanian, Vietnamese, Finnish, Basque, Galician, Luxembourgish, Romansh, Catalan, Quechua
arabicArabic script (阿拉伯语系)Arabic, Persian, Uyghur, Urdu, Pashto, Kurdish, Sindhi, Balochi, English
cyrillicCyrillic script (西里尔语系)Russian, Belarusian, Ukrainian, Serbian (Cyrillic), Bulgarian, Mongolian, Abkhazian, Adyghe, Kabardian, Avar, Dargin, Ingush, Chechen, Lak, Lezgin, Tabasaran, Kazakh, Kyrgyz, Tajik, Macedonian, Tatar, Chuvash, Bashkir, Malian, Moldovan, Udmurt, Komi, Ossetian, Buryat, Kalmyk, Tuvan, Sakha, Karakalpak, English
east_slavicEast Slavic (东斯拉夫语系)Russian, Belarusian, Ukrainian, English
devanagariDevanagari script (天城文语系)Hindi, Marathi, Nepali, Bihari, Maithili, Angika, Bhojpuri, Magahi, Santali, Newari, Konkani, Sanskrit, Haryanvi, English

Examples

mineru-open-api flash-extract report.pdf
mineru-open-api flash-extract report.pdf -o ./out/
mineru-open-api flash-extract report.pdf --language en
mineru-open-api flash-extract report.pdf --language latin
mineru-open-api flash-extract report.pdf --pages "1-5"
mineru-open-api flash-extract contract.docx -o ./out/
mineru-open-api flash-extract presentation.pptx -o ./out/
mineru-open-api flash-extract scan.jpg --language ch

Output behavior

  • No -o flag: result goes to stdout; status/progress messages go to stderr
  • With -o flag: result saved to file/directory; progress messages on stderr
  • Markdown output includes extracted images saved alongside the .md file
  • Tables are converted to Markdown tables
  • Formulas are converted to LaTeX format (inline $...$ and block $$...$$)

Agent guidelines

When using this skill on behalf of the user:

  • Always use flash-extract for any input — whether it's a local file or a URL (e.g. https://cdn-mineru.openxlab.org.cn/demo/example.pdf). Do NOT assume a URL means "web page". flash-extract handles URLs to document files directly.
  • Quote file paths that contain spaces or special characters with double quotes. Example: mineru-open-api flash-extract "report 01.pdf".
  • Don't run commands blindly on errors — explain the exit code and troubleshooting steps instead of re-running the command.
  • Installation questions ("mineru 怎么安装") should be answered with the install instructions above.

Default output directory

When the user does NOT specify -o, generate a default output directory:

~/MinerU-Skill/<name>_<hash>/
  • <name>: derived from the source, then sanitized (replace spaces and shell-unsafe characters with _, collapse consecutive _).
    • For URLs: last path segment (e.g. https://arxiv.org/pdf/2509.221862509.22186)
    • For local files: filename without extension (e.g. report.pdfreport)
  • <hash>: first 6 characters of MD5 hash of the full original source.
echo -n "source" | md5sum | cut -c1-6   # Linux
echo -n "source" | md5 | cut -c1-6      # macOS

When the user specifies -o: use the user's path as-is.

Skill upgrade = CLI upgrade

When the user asks to upgrade this skill, re-install the CLI first:

npm install -g mineru-open-api@latest

Exit codes

CodeMeaningRecovery
0Success
1General API or unknown errorCheck network; retry; use --verbose
2Invalid parameters / usage errorCheck command syntax and flag values
4File too large or page limit exceededTry a smaller file or fewer pages
5Extraction failedDocument may be corrupted or unsupported
6TimeoutIncrease with --timeout

Troubleshooting

  • Timeout on large files: Increase with --timeout 1600
  • Extraction quality is poor: Try specifying --language to match the document language
  • HTTP 429: Rate limit hit. Wait a few minutes and retry.

Comments

Loading comments...