senseaudio-game-npc-director

v1.0.1

Use when a game, interactive story, or virtual world needs reusable NPC voice behavior, including fixed voice identity, catchphrases, relationship-aware dial...

⭐ 0· 227·0 current·0 all-time

byWu Ruixiao@kikidouloveme79

OpenClaw Prompt Flow

Install with OpenClaw

Best for remote or guided setup. Copy the exact prompt, then paste it into OpenClaw for kikidouloveme79/senseaudio-game-npc-director.

Previewing Install & Setup.

Prompt PreviewInstall & Setup

Install the skill "senseaudio-game-npc-director" (kikidouloveme79/senseaudio-game-npc-director) from ClawHub.
Skill page: https://clawhub.ai/kikidouloveme79/senseaudio-game-npc-director
Keep the work scoped to this skill only.
After install, inspect the skill metadata and help me finish setup.
Use only the metadata you can verify from ClawHub; do not invent missing requirements.
Ask before making any broader environment changes.

Command Line

CLI Commands

Use the direct CLI path if you want to install manually and keep every step visible.

OpenClaw CLI

Bare skill slug

openclaw skills install senseaudio-game-npc-director

ClawHub CLI

Package manager switcher

npx clawhub@latest install senseaudio-game-npc-director

Security Scan

VirusTotal

Benign

View report →

OpenClaw

Suspicious

high confidence

ℹ

Purpose & Capability

The scripts and SKILL.md implement NPC voice generation, ASR, TTS, and Feishu delivery which matches the skill description. However, the registry metadata lists no required env vars or config paths even though the implementation clearly expects API keys, platform tokens, and host helper modules.

Instruction Scope

Runtime instructions and scripts call external services (SenseAudio ASR/TTS endpoints, Feishu open API), transcode and upload audio, and load helper modules from parent _shared directories and another skill (audioclaw-skills-voice-reply). They also read workspace/config files via audioclaw_paths. The SKILL.md and registry do not disclose these file/config dependencies or the exact credentials used, giving the agent access to external networks and workspace configs beyond what's declared.

✓

Install Mechanism

No install spec is provided (instruction + scripts only), so nothing arbitrary is downloaded at install time. This lowers install-time risk. The code will run at runtime and import helper modules from the host environment.

Credentials

The code uses SENSEAUDIO_API_KEY, SENSEAUDIO_PLATFORM_TOKEN, SENSEAUDIO_ASR_MODEL (and an api-key-env argument), and expects Feishu app credentials/config (app_id/app_secret) via a workspace config — none of these were declared in the registry metadata. Requiring platform API keys and tenant credentials is normal for ASR/TTS and Feishu features, but the omission in metadata is an incoherence that could lead to surprise credential usage or accidental credential leakage.

✓

Persistence & Privilege

The skill is not marked always:true and does not modify other skills' configurations. It does import shared modules from parent directories, but there is no evidence it attempts to persist itself or alter system-wide agent settings.

What to consider before installing

Before installing or enabling this skill, verify the following: (1) Confirm which environment variables and config files you must provide — the code expects SENSEAUDIO_API_KEY and/or SENSEAUDIO_PLATFORM_TOKEN, an ASR model env, and a Feishu app_id/app_secret stored in a workspace config, but the registry metadata lists none. (2) Ask the publisher where the _shared helper modules come from (senseaudio_env, audioclaw_paths, senseaudio_api_guard) and the audioclaw-skills-voice-reply feishu helper; these are not included and will be imported from the host environment. (3) Understand that the skill will send audio to external endpoints (api.senseaudio.cn, platform.senseaudio.cn, open.feishu.cn) and will upload/transcode audio — do not supply sensitive audio or credentials unless you trust the endpoints and code. (4) If you cannot confirm the provenance of the missing helpers and the required credentials, run the skill only in an isolated environment or decline installation. If you decide to proceed, ask the maintainer to update the registry metadata to declare required env vars and config paths and to include or document any dependent helper packages.

Like a lobster shell, security has layers — review code before you run it.

latestvk971c7zwcf63xxhq8y8qjh7zad83c87b

227downloads

0stars

2versions

Updated 4h ago

v1.0.1

MIT-0

AudioClaw Game NPC Director

What this skill is for

This skill is for building low-cost, high-immersion voice assets for games and interactive worlds.

It treats voice as part of the world model, not just a final rendering step.

You can use it to give each NPC:

a fixed voice
a role or class identity
catchphrases
relationship-aware tone shifts
event-based spoken lines
ASR-driven reactions to what the player actually says

Strong use cases

1. Quest and task broadcasters

Generate:

new quest lines
reminder lines
completion lines
failure or delay lines

with one consistent NPC voice.

2. Relationship-aware NPC dialogue

Use the same NPC voice but adjust line style based on:

stranger
neutral
trusted
close ally

This makes the world feel reactive without needing fully hand-authored voice libraries.

3. Player voice intake

Use AudioClaw ASR to transcribe a player's spoken line, then generate a relation-aware NPC reply.

This is the bridge from:

static voiced assets

to:

interactive voiced worlds

4. Dynamic world event announcements

Generate voiced lines for:

invasion warnings
weather changes
market events
faction alerts
town broadcasts

5. Worldbuilding narration

Generate short lore or ambient narration using one narrator voice or one faction-specific voice.

Workflow

Define the NPC profile:
- name
- role
- world
- speaking style
- catchphrase
- default voice_id
Choose one of two paths:
- scene-first: define an event and generate NPC lines directly
- player-first: transcribe player audio with scripts/senseaudio_asr.py, then build NPC reply lines from the transcript
Define the current scene:
- event type
- player relationship
- player state
- objective
Run either scripts/build_npc_scene_manifest.py or scripts/build_npc_reply_from_player.py.
Review the generated lines.
Run scripts/batch_tts_scene.py with the fixed voice_id.
- If you already created a clone on the AudioClaw platform, use that prepared clone voice_id.
- A prepared cloned voice id commonly looks like vc-..., and can be passed directly with --clone-voice-id.
- This skill already uses streaming TTS internally and now records stream chunk metadata.
- If the chosen voice is a clone id like vc-..., scene synthesis now auto-routes to SenseAudio-TTS-1.5.
If the user wants to hear the NPC lines directly in Feishu or AudioClaw, run scripts/send_npc_scene_to_feishu.py, or add --send-feishu-audio to scripts/run_player_voice_npc_pipeline.py.
- This step reuses the same Feishu audio delivery path as the dedicated voice-reply skill.
- It transcodes the generated .mp3 lines into .ogg/.opus and sends them one by one as real audio messages.
- scripts/run_player_voice_npc_pipeline.py can now take either --input-audio or --input-text, so ongoing NPC dialogue does not need to drop back to text just because the player typed instead of speaking.
- If the user enters an ongoing NPC dialogue mode, treat voice delivery as the default unless the user explicitly asks for text-only replies.
Attach the resulting assets to your runtime, editor tooling, or content review flow.

AudioClaw Trigger Pattern

Use this skill as a mode-based session.

Recommended user trigger:

进入 NPC 模式，用 $senseaudio-game-npc-director。
NPC：雾港档案官阿砚
关系：trusted
地点：北码头
目标：找回失踪账册
clone voice_id：your_clone_voice_id
后面我发语音，你都按这个设定回复。

After mode entry, the agent should keep session state with:

npc identity
relationship
location
objective
chosen voice_id
reply mode, defaulting to voice

For each new player turn:

If the input is audio, run scripts/run_player_voice_npc_pipeline.py --input-audio ....
If the input is text, still run scripts/run_player_voice_npc_pipeline.py --input-text ... so the reply stays on the same voice pipeline.
In ongoing NPC dialogue mode, default to --send-feishu-audio so the generated NPC lines are sent one by one as Feishu audio messages.
Only fall back to text-first replies if the user explicitly asks for text-only output or the channel cannot play voice.
If the user says "直接发语音" or "一条一条发 NPC 语音", keep the same voice mode and continue sending audio without asking again.

NPC mode should be sticky inside the same session:

Keep using the same NPC identity, relationship, location, objective, and voice settings for every following turn
Keep voice reply as the default until the user explicitly says to exit NPC mode or switch back to text replies

If the user asks to switch voice, only swap the configured voice_id; keep the same NPC profile and relationship state.

Design rules

Keep one NPC tied to one stable voice wherever possible.
Let emotion and relation change the wording, not the identity.
Use short lines for reactive NPC speech and system announcements.
For player voice loops, make ASR intake deterministic before adding deeper agent logic.
If you want faster perceived NPC response generation, use stream ASR for the player-input leg.
Treat cloned voices or exclusive voices as drop-in replacements for the same workflow.
Official clone support is a two-step chain:
- create the clone on the AudioClaw platform first
- then use the prepared clone voice_id here

API key lookup

For the NPC generation side of this skill:

TTS-oriented scripts now default to SENSEAUDIO_API_KEY

Practical rule:

scripts/batch_tts_scene.py and scripts/run_player_voice_npc_pipeline.py now default to SENSEAUDIO_API_KEY
If the host app injects SENSEAUDIO_API_KEY as a login token such as v2.public..., the shared bootstrap replaces it with the real sk-... value from ~/.audioclaw/workspace/state/senseaudio_credentials.json before the TTS stage starts
The ASR scripts keep their own existing defaults and are intentionally not changed here

Resources

scripts/build_npc_scene_manifest.py
- Builds scene lines from an NPC profile and game state
scripts/senseaudio_asr.py
- Calls AudioClaw ASR using the official open API host or the official platform endpoint
- Defaults to the official sense-asr-deepthink model
scripts/build_npc_reply_from_player.py
- Turns a player transcript into intent-aware NPC reply lines
scripts/run_player_voice_npc_pipeline.py
- Runs the full player input pipeline end to end
- Supports --input-audio, --input-text, --stream-asr, --clone-voice-id, and --send-feishu-audio
scripts/batch_tts_scene.py
- Synthesizes all scene lines with one fixed voice
scripts/send_npc_scene_to_feishu.py
- Reuses the Feishu voice delivery path to send generated NPC lines one by one as audio messages
references/npc_voice_design.md
- Patterns for worldbuilding, relation states, and event announcements
references/asr_player_loop.md
- Official ASR findings and the recommended player voice pipeline

Comments

Loading comments...