Install
openclaw skills install linkfox-multimodal-recognize-image基于多模态AI的图片识别与分析。当用户想分析、描述、从图片URL中提取信息、image recognition, image analysis, image description, image content understanding, OCR text recognition, visual Q&A时触发此技能。当用户提到图片识别、图片分析、图片描述、识别图片内容、分析产品图、从图片中读取文字、描述图片、提取视觉内容或理解照片内容时触发。当用户提供图片URL并就其视觉内容提问时,即使未明确说"图片识别",也应触发此技能。
openclaw skills install linkfox-multimodal-recognize-imageThis skill guides you on how to use the multimodal image recognition API to analyze images from URLs and extract meaningful information based on user intent.
The Image Recognition tool accepts an image URL and an optional natural-language requirement describing what the user wants to know about the image. The backend uses a multimodal AI model to interpret the visual content and return a textual description or analysis.
Supported formats: JPG, JPEG, PNG, GIF, WebP, BMP.
How it works: You provide a publicly accessible image URL and a requirement (what you want to learn from the image). The service downloads the image, runs multimodal analysis, and returns a text-based result.
| Parameter | Required | Description |
|---|---|---|
| imageUrl | Yes | A publicly accessible URL pointing to the image. Must be JPG, JPEG, PNG, GIF, WebP, or BMP. Maximum 1000 characters. |
| requirement | No | A natural-language description of what to identify or analyze in the image. Defaults to "Describe the content of this image" when omitted. Maximum 1000 characters. |
1. General Image Description
imageUrl to the provided URL, leave requirement as default.2. Product Image Analysis
requirement to: "This is an Amazon product listing image. Identify the product, key features, and selling points visible in the image."3. Text Extraction from an Image
requirement to: "Extract all visible text from this image, preserving layout where possible."4. A+ Page Image Review
requirement to: "This is an Amazon A+ product description image. Describe the visual content, key messaging, and branding elements."5. Comparison / Detail Inspection
requirement to: "Identify and describe any differences between the product and its packaging shown in the image."This tool calls the LinkFox tool gateway API. See references/api.md for calling conventions, request parameters, and response structure. You can also execute scripts/multimodal_recognize_image.py directly to run queries.
costToken value from the response.Applicable -- Image analysis tasks:
| User Says | Scenario |
|---|---|
| "What's in this image/picture/photo" | General image description |
| "Analyze this product image" | Product visual analysis |
| "Read the text in this image" | OCR / text extraction |
| "Describe the A+ page images" | E-commerce content review |
| "What does this screenshot show" | Screenshot interpretation |
| "Identify objects in this photo" | Object detection / listing |
Not applicable -- Needs beyond image recognition:
Feedback:
Auto-detect and report feedback via the Feedback API when any of the following apply:
Call the feedback API as specified in references/api.md. Do not interrupt the user's flow.
For more high-quality, professional cross-border e-commerce skills, set LinkFox Skills.