Skill flagged — suspicious patterns detected

ClawHub Security flagged this skill as suspicious. Review the scan results before using.

mar-document-pro

v1.0.0

文档处理技能 - 让 AI 能够读取、解析、提取 PDF、DOCX、PPT 等文档的关键信息,并通过 SkillBoss API Hub 进行 AI 智能分析与摘要。当用户要求分析文档、提取内容、总结报告时触发此技能。

0· 55·0 current·0 all-time

Install

OpenClaw Prompt Flow

Install with OpenClaw

Best for remote or guided setup. Copy the exact prompt, then paste it into OpenClaw for marjoriebroad/mar-document-pro.

Previewing Install & Setup.
Prompt PreviewInstall & Setup
Install the skill "mar-document-pro" (marjoriebroad/mar-document-pro) from ClawHub.
Skill page: https://clawhub.ai/marjoriebroad/mar-document-pro
Keep the work scoped to this skill only.
After install, inspect the skill metadata and help me finish setup.
Use only the metadata you can verify from ClawHub; do not invent missing requirements.
Ask before making any broader environment changes.

Command Line

CLI Commands

Use the direct CLI path if you want to install manually and keep every step visible.

OpenClaw CLI

Bare skill slug

openclaw skills install mar-document-pro

ClawHub CLI

Package manager switcher

npx clawhub@latest install mar-document-pro
Security Scan
Capability signals
Requires sensitive credentials
These labels describe what authority the skill may exercise. They are separate from suspicious or malicious moderation verdicts.
VirusTotalVirusTotal
Pending
View report →
OpenClawOpenClaw
Suspicious
medium confidence
!
Purpose & Capability
The SKILL.md describes a document-processing skill that extracts text/tables and calls the SkillBoss API — this matches the name/description. However, the registry metadata declared no required env vars or binaries while the SKILL.md requires SKILLBOSS_API_KEY and references multiple Python libraries (pdfplumber, PyPDF2, python-docx, python-pptx, openpyxl). The skill will thus fail or behave unexpectedly unless a Python runtime and those packages are available; that mismatch is incoherent.
Instruction Scope
Instructions are explicit about reading local files (PDF/DOCX/PPTX/XLSX), extracting contents, then sending the extracted text to https://api.heybossai.com/v1 via requests with the SKILLBOSS_API_KEY. This is consistent with the described purpose, but it means full document contents (potentially including sensitive data) are transmitted to a third-party service — a privacy/exfiltration risk inherent to the workflow and worth the user's consideration.
Install Mechanism
There is no install spec (instruction-only), which is lower supply-chain risk. However, the instructions rely on multiple Python packages but do not declare them or provide installation steps. That omission is an operational inconsistency (runtime dependencies are undeclared).
!
Credentials
SKILL.md requires SKILLBOSS_API_KEY (used to call the external API), which is proportionate to the stated third-party AI analysis. But the registry metadata does not list this required environment variable or mark a primary credential — a clear mismatch. The single API key requested is reasonable for function, but the metadata omission reduces transparency.
Persistence & Privilege
The skill is not always-enabled and does not request elevated system persistence or modify other skills. It is user-invocable and can be invoked autonomously per platform defaults; nothing in the package requests unusual privileges.
What to consider before installing
Before installing or enabling this skill, consider the following: - Metadata mismatch: the registry metadata does not declare the SKILLBOSS_API_KEY that the SKILL.md requires, nor does it list the Python packages the code examples use. Ask the publisher to correct metadata and provide a manifest or install steps (pip requirements) so you know what will run. - Data privacy: the workflow extracts full document text and sends it to api.heybossai.com. Do not allow the skill to process sensitive, confidential, or regulated documents unless you trust SkillBoss and have reviewed their privacy/security policies. - Runtime requirements: this is an instruction-only skill that assumes Python and multiple libraries are available. If you enable it, ensure it's run in a controlled environment with the required packages installed (or request the skill author add an install spec). - Least privilege: supply a dedicated API key with limited scope for SkillBoss rather than reuse broad credentials, and rotate/revoke the key if you stop using the skill. - Ask for clarification: request that the author update the registry to declare SKILLBOSS_API_KEY as primaryEnv, list required packages (requirements.txt), and state expected runtime (Python version) and any data retention behavior for the external API. Given the metadata/instruction inconsistencies and the external data transmission, proceed only after those questions are answered and you are comfortable with the privacy trade-offs.

Like a lobster shell, security has layers — review code before you run it.

latestvk97cyvywq820ws4jw3v2rgbmyd85e5b8
55downloads
0stars
1versions
Updated 4d ago
v1.0.0
MIT-0

Document Pro - 文档处理技能

概述

赋予 AI 强大的文档处理能力:

  • PDF 读取与提取
  • Word 文档解析
  • PowerPoint 提取
  • Excel 数据提取
  • 文档格式转换

触发场景

  1. 用户发送文档并要求"分析"、"总结"
  2. 用户要求"提取文档内容"
  3. 用户要求"转换成 PDF"
  4. 用户询问文档中的具体信息
  5. 用户要求"从报告/论文中提取要点"

支持的格式

格式读取写入工具
PDFpdfplumber, PyPDF2
DOCXpython-docx
PPTXpython-pptx
XLSXopenpyxl
TXT内置
Markdown内置

工具使用

PDF 处理

# 提取文本
import pdfplumber

with pdfplumber.open("document.pdf") as pdf:
    for page in pdf.pages:
        text = page.extract_text()
        print(text)

# 提取表格
with pdfplumber.open("document.pdf") as pdf:
    table = pdf.pages[0].extract_tables()

Word 文档

from docx import Document

doc = Document("document.docx")
for para in doc.paragraphs:
    print(para.text)

# 提取表格
for table in doc.tables:
    for row in table.rows:
        print([cell.text for cell in row.cells])

PowerPoint

from pptx import Presentation

prs = Presentation("presentation.pptx")
for slide in prs.slides:
    for shape in slide.shapes:
        if shape.has_text_frame:
            print(shape.text)

AI 分析(通过 SkillBoss API Hub)

提取文档内容后,使用 SkillBoss API Hub 进行 AI 分析(摘要、关键词提取等):

import requests, os

SKILLBOSS_API_KEY = os.environ["SKILLBOSS_API_KEY"]
API_BASE = "https://api.heybossai.com/v1"

def pilot(body: dict) -> dict:
    r = requests.post(
        f"{API_BASE}/pilot",
        headers={"Authorization": f"Bearer {SKILLBOSS_API_KEY}", "Content-Type": "application/json"},
        json=body,
        timeout=60,
    )
    return r.json()

# 提取文档文本后,调用 SkillBoss API Hub 进行摘要分析
def analyze_document(doc_text: str) -> str:
    result = pilot({
        "type": "chat",
        "inputs": {
            "messages": [
                {"role": "system", "content": "你是文档分析助手,请提取要点并生成结构化摘要。"},
                {"role": "user", "content": f"请分析以下文档内容:\n\n{doc_text}"}
            ]
        },
        "prefer": "balanced"
    })
    return result["result"]["choices"][0]["message"]["content"]

工作流

1. 识别文档类型 → 选择正确的工具
2. 读取内容 → 提取文本、表格、图片
3. 分析信息 → 通过 SkillBoss API Hub 理解结构、提取要点
4. 总结呈现 → 用中文总结给用户

进阶功能

文档摘要

  • 提取文档主要观点
  • 生成简短摘要
  • 列出关键要点

表格处理

  • 识别表格结构
  • 提取表格数据
  • 转换为 CSV/Excel

关键词提取

  • 找出重要名词/术语
  • 识别主题
  • 提取关键信息

环境变量

SKILLBOSS_API_KEY=<your_skillboss_api_key>

输出格式

向用户呈现文档时:

  • 文档类型和页数
  • 主要内容摘要
  • 关键要点(3-5条)
  • 建议的后续操作

限制

  • 扫描版 PDF 需要 OCR
  • 复杂格式可能丢失
  • 图片/图表无法完全理解

Comments

Loading comments...