Ai Video Creator Free

v1.0.0

Drop a script, a handful of images, or even just a rough idea, and watch it transform into a shareable video — no editing software, no subscription wall, no...

0· 89·0 current·0 all-time

Install

OpenClaw Prompt Flow

Install with OpenClaw

Best for remote or guided setup. Copy the exact prompt, then paste it into OpenClaw for vcarolxhberger/ai-video-creator-free.

Previewing Install & Setup.
Prompt PreviewInstall & Setup
Install the skill "Ai Video Creator Free" (vcarolxhberger/ai-video-creator-free) from ClawHub.
Skill page: https://clawhub.ai/vcarolxhberger/ai-video-creator-free
Keep the work scoped to this skill only.
After install, inspect the skill metadata and help me finish setup.
Required env vars: NEMO_TOKEN
Use only the metadata you can verify from ClawHub; do not invent missing requirements.
Ask before making any broader environment changes.

Command Line

CLI Commands

Use the direct CLI path if you want to install manually and keep every step visible.

OpenClaw CLI

Bare skill slug

openclaw skills install ai-video-creator-free

ClawHub CLI

Package manager switcher

npx clawhub@latest install ai-video-creator-free
Security Scan
VirusTotalVirusTotal
Pending
View report →
OpenClawOpenClaw
Benign
medium confidence
Purpose & Capability
The skill is a cloud-backed video generation assistant and only requests a single service token (NEMO_TOKEN) and an optional config path under ~/.config/nemovideo/. All declared endpoints and headers relate to the named Nemo video API; the required credential is proportional to the task.
Instruction Scope
The instructions keep scope to connecting to the nemovideo backend: creating/using a session, posting SSE messages, uploading files, querying credits/state, and exporting renders. Notable runtime behaviors the user should expect: the skill will upload user-supplied files (multipart POSTs or URLs), may read install path information to set an X-Skill-Platform header, and will generate an anonymous token if no NEMO_TOKEN is present. There are no instructions to read unrelated system files or other environment variables.
Install Mechanism
This is instruction-only (no install spec, no code files). That is the lowest-risk model because nothing is written to disk by an installer here.
Credentials
Only one declared environment variable (NEMO_TOKEN) is required and it maps directly to the external service the skill uses. The skill also supports obtaining an anonymous short-lived token from the service if no token is present; no other secrets or unrelated credentials are requested.
Persistence & Privilege
The skill is not force-enabled (always: false) and does not request system-wide changes or modification of other skills. It does not ask for elevated or persistent host privileges in its instructions.
Assessment
This skill appears coherent: it needs a Nemo service token and then talks only to nemovideo.ai endpoints to create sessions, upload assets, and request renders. Before installing, consider: (1) the skill's publisher/homepage are not provided here — verify the publisher and the nemovideo.ai service independently if you care about provenance; (2) the agent will upload any files you supply to the external API and may create an anonymous short-lived token if you don't provide one, so don't upload private secrets or files you wouldn't want sent to an external service; (3) check the external service's privacy/terms (mega-api-prod.nemovideo.ai) to understand retention and usage of uploaded media and generated tokens. If you trust the provider and are comfortable with uploads and network calls, the skill's requests are proportional to its stated purpose.

Like a lobster shell, security has layers — review code before you run it.

Runtime requirements

🎬 Clawdis
EnvNEMO_TOKEN
Primary envNEMO_TOKEN
latestvk979w6h3zs120ywe0wqetc0qz584a8x0
89downloads
0stars
1versions
Updated 3w ago
v1.0.0
MIT-0

Getting Started

Send me your topic, script, or a few bullet points and I'll turn it into a complete video outline with scenes, captions, and pacing. No content yet? Just describe the video you have in mind and I'll start from there.

Try saying:

  • "Create a 30-second product promo video script for my handmade candle shop, targeting Instagram Reels with upbeat pacing and on-screen text callouts"
  • "I have 8 photos from our company event — build me a video storyboard with scene order, caption suggestions, and a recommended background music mood"
  • "Write a scene-by-scene breakdown for a 2-minute YouTube explainer video about how to start composting at home, aimed at beginners"

Getting Connected

Before handling any user request, establish a connection to the backend API. Show a brief status like "Connecting...".

If NEMO_TOKEN is in the environment, use it directly and create a session. Otherwise, acquire a free starter token:

  • Generate a UUID as client identifier
  • POST to https://mega-api-prod.nemovideo.ai/api/auth/anonymous-token with the X-Client-Id header
  • The response includes a token with 100 free credits valid for 7 days — use it as NEMO_TOKEN

Then create a session by POSTing to https://mega-api-prod.nemovideo.ai/api/tasks/me/with-session/nemo_agent with Bearer authorization and body {"task_name":"project","language":"en"}. The session_id in the response is needed for all following requests.

Tell the user you're ready. Keep the technical details out of the chat.

From Raw Idea to Ready-to-Share Video, Instantly

Most people have content worth sharing — a product story, a tutorial, a quick announcement — but hit a wall when it comes to actually making the video. Editing timelines, sourcing music, syncing visuals, writing captions: it stacks up fast. This skill cuts through all of that.

With the AI Video Creator Free skill, you describe what you want or hand over your raw materials — text, bullet points, image descriptions, a script — and it builds the structure, narrative flow, and visual plan for your video. Whether you're making a 15-second Instagram reel concept or a 3-minute explainer, the output is practical, formatted, and ready to produce or hand off to a simple video tool.

This is especially useful for solo creators and small teams who can't justify expensive software or agency rates. You get the creative direction, scene-by-scene breakdowns, suggested visuals, on-screen text, and pacing guidance — all free, all fast, all tailored to your specific message and audience.

Routing Your Video Requests

When you submit a prompt, image, or script, AI Video Creator Free parses your input and routes it to the appropriate generation pipeline — text-to-video, image-to-video, or idea expansion — based on the detected content type and chosen output style.

User says...ActionSkip SSE?
"export" / "导出" / "download" / "send me the video"→ §3.5 Export
"credits" / "积分" / "balance" / "余额"→ §3.3 Credits
"status" / "状态" / "show tracks"→ §3.4 State
"upload" / "上传" / user sends file→ §3.2 Upload
Everything else (generate, edit, add BGM…)→ §3.1 SSE

Cloud Rendering API Reference

AI Video Creator Free offloads all rendering to a distributed cloud backend, where your assets are queued, processed through diffusion-based video synthesis models, and returned as a downloadable MP4 or shareable link. Render times vary by resolution, clip length, and current queue load — lightweight 720p clips typically complete in under two minutes.

Skill attribution — read from this file's YAML frontmatter at runtime:

  • X-Skill-Source: ai-video-creator-free
  • X-Skill-Version: from frontmatter version
  • X-Skill-Platform: detect from install path (~/.clawhub/clawhub, ~/.cursor/skills/cursor, else unknown)

All requests must include: Authorization: Bearer <NEMO_TOKEN>, X-Skill-Source, X-Skill-Version, X-Skill-Platform. Missing attribution headers will cause export to fail with 402.

API base: https://mega-api-prod.nemovideo.ai

Create session: POST /api/tasks/me/with-session/nemo_agent — body {"task_name":"project","language":"<lang>"} — returns task_id, session_id.

Send message (SSE): POST /run_sse — body {"app_name":"nemo_agent","user_id":"me","session_id":"<sid>","new_message":{"parts":[{"text":"<msg>"}]}} with Accept: text/event-stream. Max timeout: 15 minutes.

Upload: POST /api/upload-video/nemo_agent/me/<sid> — file: multipart -F "files=@/path", or URL: {"urls":["<url>"],"source_type":"url"}

Credits: GET /api/credits/balance/simple — returns available, frozen, total

Session state: GET /api/state/nemo_agent/me/<sid>/latest — key fields: data.state.draft, data.state.video_infos, data.state.generated_media

Export (free, no credits): POST /api/render/proxy/lambda — body {"id":"render_<ts>","sessionId":"<sid>","draft":<json>,"output":{"format":"mp4","quality":"high"}}. Poll GET /api/render/proxy/lambda/<id> every 30s until status = completed. Download URL at output.url.

Supported formats: mp4, mov, avi, webm, mkv, jpg, png, gif, webp, mp3, wav, m4a, aac.

SSE Event Handling

EventAction
Text responseApply GUI translation (§4), present to user
Tool call/resultProcess internally, don't forward
heartbeat / empty data:Keep waiting. Every 2 min: "⏳ Still working..."
Stream closesProcess final response

~30% of editing operations return no text in the SSE stream. When this happens: poll session state to verify the edit was applied, then summarize changes to the user.

Backend Response Translation

The backend assumes a GUI exists. Translate these into API actions:

Backend saysYou do
"click [button]" / "点击"Execute via API
"open [panel]" / "打开"Query session state
"drag/drop" / "拖拽"Send edit via SSE
"preview in timeline"Show track summary
"Export button" / "导出"Execute export workflow

Draft field mapping: t=tracks, tt=track type (0=video, 1=audio, 7=text), sg=segments, d=duration(ms), m=metadata.

Timeline (3 tracks): 1. Video: city timelapse (0-10s) 2. BGM: Lo-fi (0-10s, 35%) 3. Title: "Urban Dreams" (0-3s)

Error Handling

CodeMeaningAction
0SuccessContinue
1001Bad/expired tokenRe-auth via anonymous-token (tokens expire after 7 days)
1002Session not foundNew session §3.0
2001No creditsAnonymous: show registration URL with ?bind=<id> (get <id> from create-session or state response when needed). Registered: "Top up credits in your account"
4001Unsupported fileShow supported formats
4002File too largeSuggest compress/trim
400Missing X-Client-IdGenerate Client-Id and retry (see §1)
402Free plan export blockedSubscription tier issue, NOT credits. "Register or upgrade your plan to unlock export."
429Rate limit (1 token/client/7 days)Retry in 30s once

Tips and Tricks for Getting the Best Results

The more context you give, the sharper the output. Instead of saying 'make a video about my bakery,' try: 'make a 45-second TikTok video for my sourdough bakery, targeting local customers, with a warm and rustic tone.' Platform, length, tone, and audience all shape how scenes are structured and how captions are written.

If you already have a rough script or bullet points, paste them in — the skill will reformat and enhance them rather than starting from scratch, which saves time and keeps your voice intact.

For image-based videos, describe each photo or asset briefly (e.g., 'photo of a latte on a wooden table, morning light'). This helps generate accurate scene descriptions and transition suggestions even without uploading files directly.

Finally, ask for variations. If the first video outline feels too formal or too fast-paced, just say so. You can request a shorter cut, a different hook, or a version optimized for silent autoplay — the skill adapts quickly.

Performance Notes — What to Expect

This skill is optimized for planning, scripting, and structuring video content — it outputs scene-by-scene breakdowns, on-screen text suggestions, voiceover scripts, caption copy, and pacing notes. It does not render or export actual video files, but its output is designed to work seamlessly with free tools like CapCut, Canva Video, or DaVinci Resolve.

Response quality is strongest for videos under 5 minutes. For longer formats like full tutorials or documentary-style content, break the request into segments (intro, body, outro) for cleaner, more focused output.

The skill handles a wide range of formats: vertical (9:16 for Reels/TikTok), square (1:1 for feeds), and horizontal (16:9 for YouTube). Specifying your format upfront ensures the scene structure and text placement suggestions match the actual screen dimensions you're working with.

Expect turnaround in seconds. Each output is a complete creative brief you can act on immediately.

Comments

Loading comments...