Install
openclaw skills install seedance-v2Seedance 2.0 Pro on RunComfy. Seedance 2.0 Pro (ByteDance Seedance v2) is a multi-modal cinematic short-form video model with native lip-sync audio. This skill calls Seedance 2.0 Pro through the RunComfy CLI: `runcomfy run bytedance/seedance-v2/pro`. Seedance 2.0 Pro accepts up to 9 image references, 3 video references, and 3 audio references in one Seedance call, producing 4โ15 second cinematic clips at 720p. Triggers on "seedance", "seedance 2", "seedance v2", "seedance pro", "seedance 2.0", "ByteDance Seedance", or any explicit ask to generate video with Seedance.
openclaw skills install seedance-v2runcomfy.com ยท docs ยท Seedance 2.0 Pro model page
Seedance 2.0 Pro is ByteDance's multi-modal cinematic short-form video model. This skill generates video with Seedance 2.0 Pro hosted on the RunComfy Model API โ no Seedance API key, no GPU rental, just runcomfy run bytedance/seedance-v2/pro from your terminal.
Seedance 2.0 Pro is the second-generation Seedance model from ByteDance, designed for cinematic short-form video with three properties that make Seedance distinct:
Seedance 2.0 Pro generates 4โ15 second clips at 480p or 720p, in 7 aspect ratios. Seedance prompts accept Chinese (โค500 chars) or English (โค1000 words).
Pick Seedance 2.0 Pro when any of these is true:
seed for deterministic Seedance output.If the user said "Seedance" / "Seedance 2" / "Seedance Pro" / "Seedance v2" / "ByteDance Seedance" explicitly, route here regardless.
npm i -g @runcomfy/cliruncomfy login opens a browser device-code flow.RUNCOMFY_TOKEN=<token> instead of runcomfy login.bytedance/seedance-v2/proThis is the Seedance 2.0 Pro endpoint. The Seedance Lite tier and earlier Seedance versions run on different endpoints not covered here.
| Field | Type | Required | Default | Notes |
|---|---|---|---|---|
prompt | string | yes | โ | Seedance accepts CN โค 500 chars OR EN โค 1000 words. |
image_url | array | no | [] | 0โ9 image references for Seedance (JPEG/PNG/WebP/BMP/TIFF/GIF). |
video_url | array | no | [] | 0โ3 reference clips for Seedance (MP4/MOV), 2โ15s each. |
audio_url | array | no | [] | 0โ3 reference audio for Seedance (WAV/MP3), 2โ15s, < 15MB each. |
aspect_ratio | enum | no | adaptive | adaptive, 16:9, 9:16, 4:3, 3:4, 1:1, 21:9. |
duration | int | no | 5 | 4โ15 (whole seconds). Seedance per-call cap is 15s. |
resolution | enum | no | 720p | 480p or 720p. Seedance Pro tier max is 720p. |
generate_audio | bool | no | true | In-pass synchronized speech / SFX / music from Seedance. |
seed | int | no | โ | Reproducibility for Seedance output. |
Default Seedance run (text only, 5s, 720p, with audio):
runcomfy run bytedance/seedance-v2/pro \
--input '{"prompt": "<Seedance prompt>"}' \
--output-dir <absolute/path>
Seedance lip-synced ad with character image reference:
runcomfy run bytedance/seedance-v2/pro \
--input '{
"prompt": "Medium close-up. The woman explains today'\''s special in a warm friendly tone, slow push-in, soft window light, gentle cafe ambience.",
"image_url": ["https://.../barista-headshot.jpg"],
"duration": 8,
"aspect_ratio": "9:16"
}' \
--output-dir <absolute/path>
Multi-modal Seedance call (image + video + audio refs):
runcomfy run bytedance/seedance-v2/pro \
--input '{
"prompt": "Subject from image 1 walks through the cafรฉ from video 1, voice tone matches audio 1.",
"image_url": ["https://.../subject.jpg"],
"video_url": ["https://.../cafe-locked-shot.mp4"],
"audio_url": ["https://.../voice-ref.mp3"]
}' \
--output-dir <absolute/path>
The CLI submits the Seedance request, polls every 2s, fetches the Seedance result, and downloads any *.runcomfy.net / *.runcomfy.com URL into --output-dir.
Seedance 2.0 Pro responds to specific prompting patterns better than naive prose. Apply these for sharper Seedance output.
Image vs text division โ the single most important Seedance rule. Stable identity (face, costume, brand mark, logo) โ put in image_url so Seedance preserves it. Evolving narrative (action, mood, lighting, camera) โ put in prompt so Seedance generates it. Trying to verbally describe a face in detail wastes Seedance tokens and produces drift.
Camera + motion in plain language. Seedance 2.0 Pro understands "medium close-up", "slow push-in", "handheld follow", "locked-off wide" as real directives. Combine: "Medium close-up. Slow push-in over 3 seconds. Handheld, slight breathing motion." Seedance executes the camera grammar.
Audio direction with generate_audio: true โ tell Seedance the tone: "warm friendly conversational", "calm instructional", "crisp newsroom delivery". For ambient: "gentle cafe chatter, distant traffic, no foreground music". Seedance will synthesize audio matching the directive.
Seedance reference media specs. Reference videos must be 2โ15s; reference audio must be โค15MB and 2โ15s. Out-of-range files reject. Match aspect ratio of refs to the Seedance output to avoid crops.
Seedance anti-patterns:
image_url instead.| Use case | Why Seedance 2.0 Pro |
|---|---|
| Spokesperson / dialogue ads | Seedance native in-pass lip-sync, no separate TTS step |
| Brand-consistent multi-language narratives | Seedance image refs hold identity; text drives translation |
| Cinematic short-form film previs | Seedance camera-shot grammar + multi-modal refs |
| Ad creatives with reference music / VO tone | Seedance audio refs guide voice / mood |
| Reproducible Seedance variant testing | Seedance seed control + fixed schema |
Default Seedance playground example:
Golden hour on a quiet cafe terrace: a barista wipes the counter, then
looks up and explains today's special in a friendly tone, natural
lip-sync. Medium close-up, slow push-in; warm side light, soft bokeh
through glass, gentle cafe ambience and subtle film grain.
Multi-modal Seedance lip-sync (text + image):
Same person as image 1 in a softly-lit recording booth, leaning into
the mic, says: "We just shipped the biggest update of the year."
Calm conversational tone. Medium close-up, locked tripod, shallow DOF,
warm key light from camera-left.
What's the max Seedance clip duration? A single Seedance 2.0 Pro call generates 4โ15 seconds. For longer narratives, segment into multiple Seedance calls and stitch the outputs.
What aspect ratios does Seedance 2.0 Pro support? Seven: adaptive, 16:9, 9:16, 4:3, 3:4, 1:1, 21:9. Seedance defaults to adaptive (matches input refs).
Does Seedance 2.0 Pro do lip-sync? Yes. With generate_audio: true (default), Seedance produces lip-synced speech in-pass. The lip movement on Seedance output is timed to the spoken words.
Can Seedance take an existing audio file as input? Yes โ pass it as audio_url. Seedance treats it as a reference (voice tone, mood) rather than a strict lip-sync driver. For audio-driven lip-sync to a literal voiceover, route to a different model.
What languages does Seedance 2.0 Pro accept? Chinese (โค500 chars) or English (โค1000 words) prompts. Seedance output language follows the prompt.
What's the Seedance resolution ceiling? 720p on the Seedance Pro tier here. 4K Seedance variants run on different endpoints not covered by this skill.
How do I get reproducible Seedance output? Pass seed as a fixed int. Same Seedance prompt + same seed = same Seedance generation.
@-syntax for character binding in Seedance โ relies on image refs + prompt alignment.| code | meaning |
|---|---|
| 0 | Seedance generation succeeded |
| 64 | bad CLI args |
| 65 | bad input JSON for Seedance / schema mismatch |
| 69 | upstream 5xx |
| 75 | retryable: timeout / 429 |
| 77 | not signed in or token rejected |
Full reference: docs.runcomfy.com/cli/troubleshooting.
The skill invokes runcomfy run bytedance/seedance-v2/pro with a JSON body matching the Seedance schema. The CLI POSTs to https://model-api.runcomfy.net/v1/models/bytedance/seedance-v2/pro, polls the Seedance request, fetches the Seedance result, and downloads any .runcomfy.net / .runcomfy.com URL into --output-dir. Ctrl-C cancels the remote Seedance request before exit.
runcomfy login writes the API token to ~/.config/runcomfy/token.json with mode 0600.--input. No shell injection.model-api.runcomfy.net and *.runcomfy.net / *.runcomfy.com.