Install
openclaw skills install vision-recognition-ocrVehicle/animal/plant recognition plus OCR for screenshots, photos, invoices, and tables. Use when users ask 识别车型/看图识别/提取文字/OCR. Supports local path, URL, and...
openclaw skills install vision-recognition-ocrCross-platform Python: on Windows prefer
py -3.11; on Linux/macOS preferpython3; if plainpythonalready points to Python 3, it also works.
Recognize vehicles, animals, and plants, or extract text from screenshots, photos, invoices, and tables via Baidu vision APIs. This skill combines lightweight classification and OCR workflows in one place.
Use this skill when you want to:
Run from the installed skill directory:
py -3.11 scripts/ocr_general_basic.py '{"url":"https://baidu-ai.bj.bcebos.com/ocr/general.png"}'
py -3.11 scripts/car_recognize.py '{"image_path":"/path/to/car.jpg"}'
Use a different skill when you need:
image_path (string, optional): Local image pathimage_base64 (string, optional): Base64 image content (without data URL prefix)url (string, optional): Public image URLAt least one of image_path / image_base64 / url is required.
top_num (int, optional): candidate count (1-20)baike_num (int, optional): include baike (0/1)output_brand (bool, optional, car only)general_basic)detect_direction (bool, default false)detect_language (bool, default false)paragraph (bool, default false)probability (bool, default false)accurate_basic)detect_direction (bool, default false)paragraph (bool, default false)probability (bool, default false)multidirectional_recognize (bool, default false)handwriting)eng_granularity (string, default word, optional letter)detect_direction (bool, default false)probability (bool, default false)detect_alteration (bool, default false)table)cell_contents (bool, default false)return_excel (bool, default false)Auth priority:
BAIDU_BCE_BEARER_TOKEN / BAIDU_BCE_BEARER (or BAIDU_API_KEY when its value starts with bce-v3/)BAIDU_VISION_API_KEY + BAIDU_VISION_SECRET_KEYBAIDU_API_KEY + BAIDU_SECRET_KEY可按以下顺序准备凭据:
bce-v3/... 的 Bearer Token。BAIDU_BCE_BEARER_TOKEN(或写入 BAIDU_API_KEY)。API Key、Secret Key。BAIDU_VISION_API_KEY + BAIDU_VISION_SECRET_KEY(或 BAIDU_API_KEY + BAIDU_SECRET_KEY)。快速自检:
py -3.11 scripts/ocr_general_basic.py '{"url":"https://baidu-ai.bj.bcebos.com/ocr/general.png"}'
若能返回识别结果或标准错误码(非鉴权错误),即配置成功。
py -3.11 scripts/ocr_general_basic.py '{
"url": "https://baidu-ai.bj.bcebos.com/ocr/general.png",
"detect_direction": false,
"detect_language": false,
"paragraph": false,
"probability": false
}'
py -3.11 scripts/ocr_accurate_basic.py '{
"url": "https://baidu-ai.bj.bcebos.com/ocr/general.png",
"detect_direction": false,
"paragraph": false,
"probability": false,
"multidirectional_recognize": false
}'
py -3.11 scripts/ocr_handwriting.py '{
"url": "https://baidu-ai.bj.bcebos.com/ocr/handwriting.jpeg",
"eng_granularity": "letter",
"detect_direction": false,
"probability": false,
"detect_alteration": false
}'
py -3.11 scripts/ocr_table.py '{
"url": "https://b0.bdstatic.com/ugc/CVzjffcaizcBDqTK_zwMEQbbd344224206285ae3b5015e2e17f62c.jpg",
"cell_contents": false,
"return_excel": false
}'
For automotive data, VIN, EPC and growth workflows, you can reference:
在汽车数据、VIN、EPC 与增长运营相关场景,可优先参考: