PPT 视觉复刻
v1.4.1将PPT截图或信息图转换为可编辑的PPTX文件。此技能基于Images2Slides论文(arXiv:2602.07645)实现, 利用视觉-语言模型(VLM)进行区域理解,通过坐标映射算法将像素坐标转换为PPTX坐标。 支持复杂形状降级处理策略(custGeom/渐变/透明等无法直接还原的形状转为PNG嵌入)。...
⭐ 0· 101·0 current·0 all-time
byAllegro@allergro
MIT-0
Download zip
LicenseMIT-0 · Free to use, modify, and redistribute. No attribution required.
Security Scan
OpenClaw
Benign
high confidencePurpose & Capability
Name/description align with included code (coordinate_mapper.js, ppt_generator.js, complex_shape.py) and the declared runtime needs (pptxgenjs, Pillow, filesystem, network). There are no unrelated environment variables, binaries, or config paths requested that would be disproportionate to the conversion task.
Instruction Scope
SKILL.md explicitly instructs sending images to an external VLM service for region analysis (OpenClaw image tool / MiniMax / GPT-4V). The code files themselves do not contain network calls; the network usage is part of the architecture described in the instructions. The instructions also require writing temporary PNGs and output PPTX files and include optional unzipping of original PPTX for XML extraction—these are proportional but mean image data and (optionally) PPTX contents will be transmitted/processed.
Install Mechanism
There is no automated install spec (instruction-only install), which is lowest-risk. The SKILL.md asks users to install pptxgenjs via npm and Pillow via pip for optional PNG rendering; these are standard, known packages. No remote archive downloads or obscure installers are present.
Credentials
The skill does not declare any required environment variables or credentials, which is consistent with shipping as agent-side code that uses the platform's image/VLM tooling. However, runtime image analysis requires a configured VLM endpoint (implicit credentials/auth are expected to be provided by the platform or agent configuration). Users should confirm where images will be sent and what credentials the agent/platform uses.
Persistence & Privilege
The skill does not request always:true or any special persistent privileges. It reads and writes temporary files in the filesystem (explicitly declared) and does not modify other skills or global agent config.
Assessment
This skill appears coherent for converting PPT screenshots to editable PPTX: it includes the JS/Python code you'd expect and warns that images are sent to an external VLM. Before installing, ensure you (1) do not process sensitive/confidential slides because images are transmitted to the configured VLM endpoint, (2) verify which VLM/service the agent will use and that its privacy/security posture is acceptable, (3) install the optional dependencies (npm pptxgenjs and Python Pillow) in a controlled environment, and (4) allow filesystem read/write only in a safe temp directory. If you need assurance about where data is sent, ask the skill author how the VLM endpoint is configured or inspect the agent/platform image tool settings.Like a lobster shell, security has layers — review code before you run it.
latestvk97f1e2zrv4ab9sj071vkghqzh83nxzg
License
MIT-0
Free to use, modify, and redistribute. No attribution required.
