Generate images & videos with: Gemini 3 Pro Image + Qwen Wan 2.6 (video) via one API key

Generate images & videos with AIsa. Gemini 3 Pro Image (image) + Qwen Wan 2.6 (video) via one API key.

MIT-0 · Free to use, modify, and redistribute. No attribution required.

⭐ 2 · 876 · 0 current installs · 0 all-time installs

by@AIsaDocs

MIT-0

Security Scan

VirusTotal

Benign

View report →

OpenClaw

Benign

high confidence

✓

Purpose & Capability

Name/description claim image+video generation via AIsa; required binary (python3, curl) and single env var AISA_API_KEY match that purpose. The included client code calls the documented AIsa endpoints (api.aisa.one) and implements Gemini image and Wan video flows described in SKILL.md.

ℹ

Instruction Scope

SKILL.md and the Python client only instruct calls to AIsa endpoints, create/poll async video tasks, parse inline image base64, and save/download media to local disk. This is within scope. Note: the client will download arbitrary URLs returned by the service or provided as --img-url, so downloaded content may be large or come from external storage (signed URLs).

✓

Install Mechanism

No install spec (instruction-only with a bundled Python script). No remote archives or opaque installers are fetched by the skill itself, so install risk is low.

✓

Credentials

Only AISA_API_KEY is required and used (explicit or from env). No additional secrets, unrelated credentials, or config paths are requested. The API key is proportional to the claimed functionality.

✓

Persistence & Privilege

Skill is not always-enabled and does not request elevated/system-wide persistence or modify other skills. disable-model-invocation is false (agent may invoke autonomously) which is the platform default and not problematic here.

Assessment

This package appears to do what it says: it uses your AISA_API_KEY to call AIsa endpoints to generate images/videos and saves downloaded media locally. Before installing, consider: (1) only provide a dedicated AISA API key you trust and monitor its usage/billing (video generation can incur continuous costs), (2) review/inspect the included media_gen_client.py if you plan to run it locally, (3) be aware the client will download URLs returned by the service or any --img-url you supply (could be large or point to third-party storage), and (4) confirm you trust the remote provider (api.aisa.one / aisa.mintlify.app) and the skill source (owner is unknown). If you have sensitive data, avoid sending it to untrusted third-party APIs and consider running in an isolated environment.

Like a lobster shell, security has layers — review code before you run it.

Current versionv1.0.0

Download zip

latestvk97d7qyce37db1d4tqz9xxe39n80rx4p

License

MIT-0

Free to use, modify, and redistribute. No attribution required.

Termshttps://spdx.org/licenses/MIT-0.html

Runtime requirements

🎬 Clawdis

Binspython3, curl

EnvAISA_API_KEY

Primary envAISA_API_KEY

SKILL.md

OpenClaw Media Gen 🎬

Generate images and videos with one AIsa API key:

Image: gemini-3-pro-image-preview (Gemini GenerateContent)
Video: wan2.6-t2v (Qwen Wan 2.6 / Tongyi Wanxiang, async task)

API Reference: AIsa API Reference (all pages available at https://aisa.mintlify.app/llms.txt)

🎯 Pricing Advantage

Video Generation (WAN) - Cost Comparison

Resolution	AIsa (Contract)	AIsa (Official)	Bailian (Official)	OpenRouter
720P	$0.06/sec	~$0.08	~$0.10	❌
1080P	$0.09/sec	~$0.12	~$0.15	❌
Pro/Animate	$0.108–0.156	~$0.18	~$0.25	❌

Key Benefits:

25-40% cheaper than Bailian official pricing
OpenRouter doesn't support video - AIsa is the only unified API with video generation
Contract pricing available for production workloads
Single API key for both image and video generation

🔥 What You Can Do

Image Generation (Gemini)

"Generate a cyberpunk cityscape at night, neon lights, rainy, cinematic"

Video Generation (Wan 2.6)

"Use a reference image to generate a 5-second shot: slow camera push-in, wind blowing hair, cinematic, shallow depth of field"

Quick Start

export AISA_API_KEY="your-key"

🖼️ Image Generation (Gemini)

Endpoint

Base URL: https://api.aisa.one/v1
POST /models/{model}:generateContent

Documentation: google-gemini-chat (GenerateContent) at https://aisa.mintlify.app/api-reference/chat/chat-api/google-gemini-chat.md

curl Example (returns inline_data for images)

curl -X POST "https://api.aisa.one/v1/models/gemini-3-pro-image-preview:generateContent" \
  -H "Authorization: Bearer $AISA_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "contents":[
      {"role":"user","parts":[{"text":"A cute red panda, ultra-detailed, cinematic lighting"}]}
    ]
  }'

Note: Response may contain candidates[].parts[].inline_data (typically with base64 data and mime type); client script automatically parses and saves the file.

🎞️ Video Generation (Qwen Wan 2.6 / Tongyi Wanxiang)

Create Task

Base URL: https://api.aisa.one/apis/v1
POST /services/aigc/video-generation/video-synthesis
Header: X-DashScope-Async: enable (required for async)

Documentation: video-generation at https://aisa.mintlify.app/api-reference/aliyun/video/video-generation.md

curl -X POST "https://api.aisa.one/apis/v1/services/aigc/video-generation/video-synthesis" \
  -H "Authorization: Bearer $AISA_API_KEY" \
  -H "Content-Type: application/json" \
  -H "X-DashScope-Async: enable" \
  -d '{
    "model":"wan2.6-t2v",
    "input":{
      "prompt":"cinematic close-up, slow push-in, shallow depth of field",
      "img_url":"https://upload.wikimedia.org/wikipedia/commons/thumb/3/3a/Cat03.jpg/320px-Cat03.jpg"
    },
    "parameters":{
      "resolution":"720P",
      "duration":5,
      "shot_type":"single",
      "watermark":false
    }
  }'

Poll Task Status

GET /services/aigc/tasks?task_id=...

Documentation: task at https://aisa.mintlify.app/api-reference/aliyun/video/task.md

curl "https://api.aisa.one/apis/v1/services/aigc/tasks?task_id=YOUR_TASK_ID" \
  -H "Authorization: Bearer $AISA_API_KEY"

Python Client

# Generate image (save to local file)
python3 {baseDir}/scripts/media_gen_client.py image \
  --prompt "A cute red panda, cinematic lighting" \
  --out "out.png"

# Create video task (requires img_url)
python3 {baseDir}/scripts/media_gen_client.py video-create \
  --prompt "cinematic close-up, slow push-in" \
  --img-url "https://upload.wikimedia.org/wikipedia/commons/thumb/3/3a/Cat03.jpg/320px-Cat03.jpg" \
  --duration 5

# Poll task status
python3 {baseDir}/scripts/media_gen_client.py video-status --task-id YOUR_TASK_ID

# Wait until success (optional: print video_url on success)
python3 {baseDir}/scripts/media_gen_client.py video-wait --task-id YOUR_TASK_ID --poll 10 --timeout 600

# Wait until success and auto-download mp4
python3 {baseDir}/scripts/media_gen_client.py video-wait --task-id YOUR_TASK_ID --download --out out.mp4

💡 Use Cases

AI Agents: Automate visual content generation for social media, marketing materials
Content Creators: Generate custom images and videos programmatically
Developers: Build apps with multimodal generation capabilities
Businesses: Cost-effective alternative to Bailian with better pricing

🚀 Why AIsa for Media Generation?

Unified API: Single key for both images (Gemini) and videos (WAN)
Best Pricing: 25-40% cheaper than alternatives
Production Ready: Contract pricing and enterprise support available
No Competition: OpenRouter doesn't support video generation
Simple Integration: Python client with async task management built-in

Files

3 total

Select a file

Select a file to preview.

Comments

Loading comments…