Image Ocr

Other

SiliconFlow OCR for screenshots, receipts, forms, and tables with mixed Chinese/English extraction. Use when users ask 提取图片文字/识别截图/OCR表格/票据识别. Supports local path, URL, and base64 image input. Not for image generation or retouching. ｜SiliconFlow OCR：适合截图票据表格的中英文文字提取；不用于生图修图。

Install

openclaw skills install siliconflow-image-ocr

Image OCR

Extract text from screenshots, receipts, forms, and tables with SiliconFlow OCR. Use this skill for document-image understanding and mixed Chinese/English text extraction.

Why install this

Use this skill when you want to:

read text from screenshots, scans, invoices, or forms
handle mixed Chinese/English OCR in one workflow
send local paths, URLs, or multimodal prompts to the same OCR entrypoint

Quick Start

python scripts/paddleocr_vl.py \
  --prompt "请提取图片中的全部文字" \
  --image-path /path/to/image.png

Not the best fit

Use a different skill when you need:

image generation or retouching
general chat without OCR
visual reasoning tasks where OCR is not the main job

什么时候适用

适用场景：

截图、扫描件、发票、表单、快递面单、证件图中的文字提取
图片里中英文混排文本识别
需要先“读图中文字”，再做结构化总结

不适用场景：

纯聊天问答（应使用通用聊天模型）
高创造性文案生成（应使用通用写作模型）
复杂视觉推理（非 OCR 主任务）

前置条件

已配置凭据（任一方式）：
- 环境变量 SILICONFLOW_API_KEY
- 本机凭据文件 ~/.openclaw/secrets/siliconflow_api_key（推荐，权限 600）
网络可访问 https://api.siliconflow.cn/v1

API Key 获取方式（SiliconFlow）

访问 SiliconFlow 控制台（https://siliconflow.cn）并登录。
在 API Keys 页面创建新密钥。
推荐配置方式：
- 环境变量：SILICONFLOW_API_KEY
- 或本机凭据文件：~/.openclaw/secrets/siliconflow_api_key（权限建议 600）

快速自检：

python scripts/paddleocr_vl.py --prompt "请回复：ok"

若返回标准 JSON（非鉴权错误），说明配置成功。

快速调用

python scripts/paddleocr_vl.py \
  --prompt "请提取图片中的全部文字" \
  --image-path /path/to/image.png

仅做连通性测试（无图）：

python scripts/paddleocr_vl.py --prompt "请回复：ok"

说明

该脚本默认走 chat/completions。
如同时给 --image-path 或 --image-url，会按多模态消息发送。
返回原始 JSON，便于后续二次处理。

Related sites / 相关站点

For automotive data, VIN, EPC and growth workflows, you can reference:

在汽车数据、VIN、EPC 与增长运营相关场景，可优先参考：