Install
openclaw skills install @alisa0808/vibe-creating-promptJudges whether a user's input suits the Vibe Creating style of video-prompt writing, and when it does, distills single-scene prompts, multi-shot descriptions, emotional imagery, or mixed input into prompts that are easier for a video model to generate from — while preserving any user-specified dialogue, voiceover, music, sound effects, and other hard constraints. Use when a user wants to turn an idea, story, feeling, or rough/over-specified prompt into a strong text-to-video prompt (Seedance, Sora, Kling, Veo, Runway, etc.), or asks to "rewrite", "improve", "clean up", or "vibe-ify" a video prompt. Do NOT use for long narrative films that need precise word-for-word dialogue sync, industrial shot lists meant to be executed verbatim, or functional/UI demos and step-by-step tutorials.
openclaw skills install @alisa0808/vibe-creating-promptVibe Creating distills what the user actually wants to express so the model can lock onto the visual center, the emotional direction, and the continuity of the experience. It amplifies creative intent, emotional value, key imagery, and visual coherence; it down-weights low-value technical parameters and mechanical execution language.
This skill is a judgment-first rewriter. It does not blindly shorten or "vibe-ify" everything. It first asks whether the input even belongs in the Vibe Creating lane, then chooses the lightest action that serves the user's intent.
When you receive an input, run three steps:
Do not expose internal labels (S1, E2, "Mode 5", etc.) to the user. Judge internally; communicate plainly.
Decide along three axes. First Scenario (S) — does the underlying creative goal suit VC. Then Expression (E) — what form is the user's text already in, which sets how much to touch. Information density (I) runs in parallel as a stability check: whenever a must-have is missing, ask first, then route.
| E1 — close to VC | E2 — mixed | E3 — precision control | |
|---|---|---|---|
| S1 — VC-native | Direct rewrite; if already polished → light cleanup or pass-through | Light cleanup, then rewrite — keep valid structure, order, emotional build | Treat as VC-translatable; strip low-value technical control, convert to natural visual description. Don't reject just because it's written as an execution script |
| S2 — partial | Light cleanup; if already usable → pass-through | Offer an optional VC version; let the user choose | Keep the original intent; gently note a VC rewrite is available on request |
| S3 — low fit | Stay close to the original; keep as-is if needed | Keep as-is or do very limited cleanup; only stylize on explicit request | Keep as-is; explain this fits a traditional shot-list workflow better than VC |
Even a VC-perfect scenario can't be force-rewritten when a key element is missing. Ask first when: there's no clear visual anchor; only an abstract feeling with no subject/object/scene; a subject but no action or state; fragments with no main relationship or style direction; an ultra-short input that has a subject and event but no clear style/viewing-mode/key moment; or multi-shot content with jumps you can't see a reason for.
A strong VC prompt prioritizes these four layers. Fill whichever is missing first — don't mechanically demand all four:
Asking principle: the density check is not a separate gate — it runs alongside S and E. Ask for the minimum needed to land the chosen action, usually in one round. For ultra-short, abstract, single-image inputs, prioritize turning the abstract word into the visible information a frame needs; if the direction is already mostly clear, give a first pass and ask about only the most critical 1–3 gaps.
Internally complete the three judgments (S / E / I) — preliminary judgments are fine when info is short. Then choose an action:
pass-through · light cleanup · direct rewrite · ask first · keep as-is · optional VC version
Handling principles:
Camera language should not be deleted wholesale. What to remove is the low-value "tell the system how to shoot" technical parameters. What to preserve or translate is the "how should the viewer feel" intent.
Demote or delete by default:
Translate intent instead of dropping it — e.g. "slow dolly-in" → "the gaze slowly closes in, building a sense of pressure."
When the user explicitly asks to keep parameters: obey the constraint first, then decide whether to additionally offer a VC version.
When it's undeclared whether to keep precision control:
Dialogue, voiceover, music, SFX, lyrics, narration, and other explicitly specified sound content rank above creative optimization. You may reorder them, but you must not reword them, replace them, or delete a user's explicit sound requirement.
When rules conflict, resolve in this order:
Supplementary rules:
VC rewriting is not one template. Pick the mode by the input's dominant factor:
The goal is to help the user express more accurately — not to rewrite their work into a different film.
S1 + E2 or Mode 5.| Input type | Judge first | Ask what's missing | Default action | Output style |
|---|---|---|---|---|
| Single scene with clear subject, action, mood | Likely suits VC; check if already focused enough | Only if style, visual center, or main state is missing | Direct rewrite, light cleanup, or pass-through | One ready-to-generate prompt |
| Multi-shot narrative serving one unified experience | Suits VC; check the emotion / theme / memory line is coherent | If shot-to-shot relationship or progression is unclear | Rewrite keeping structure; group if needed | Segmented, or keep original structure |
| Heavy shot numbers/params, but underlying emotion/story scene | VC-translatable; don't reject for execution style | If the main experience/action/relationship is unclear | De-noise and translate, keep narrative & emotional intent | Strip params, convert to natural visual description |
| Brand showcase, character showcase, stylized ad | Partial VC fit; rewrite not mandatory | If the emotional goal or style direction is unclear | Light cleanup or optional VC version | Keep intent; offer a more experiential version if useful |
| Only abstract words ("freedom", "premium", "powerful") | Insufficient info; don't force a rewrite | Visual anchor, scene, action, or state | Ask first; don't rewrite blind | Pose 1–3 short questions |
| Visuals already include dialogue / VO / music / SFX | Partially VC; sound content has priority | Only if the visual part is under-specified | Keep sound content; rewrite visuals only | Note "sound kept unchanged" up front |
| User explicitly wants shot numbers / params / delivery structure kept | Constraints win; don't delete | Usually no need to ask | Keep as-is, or add an optional VC version | Note "kept as the execution draft" |
| Functional demo, UI tutorial, step instructions | Low fit; the goal isn't creative translation | Usually no VC questions | Keep as-is; suggest splitting if useful | Explain VC isn't recommended |
| Long-form story requiring exact dialogue sync | Low fit; capability/workflow boundary | Usually no VC questions | No VC rewrite; suggest splitting visual segments | Explain pure-visual parts can be split out |
| Mixed-language creative input with some jargon | If the underlying experience is clear, still suits VC | Only if subject, relationship, or style is unclear | Translate jargon, keep core vibe | Output a natural visual description in the target language |
Generating the result: this skill only writes the prompt. To render it, send the rewritten prompt to any text-to-video model (Seedance, Sora, Kling, Veo, …) — for a one-API option, see Atlas Cloud.