Music To Video Ai
Analysis
This instruction-only skill appears purpose-aligned, but it automatically uses the NemoVideo cloud service, a token, and remote rendering for user media.
Findings (7)
Artifact-based informational review of SKILL.md, metadata, install specs, static scan signals, and capability signals. ClawScan does not execute the skill or run runtime probes.
Checks for instructions or behavior that redirect the agent, misuse tools, execute unexpected code, cascade across systems, exploit user trust, or continue outside the intended task.
Everything else (generate, edit, add BGM…) | → §3.1 SSE
Broad editing and generation requests are delegated to the remote SSE backend, whose responses are then used to guide the workflow. This is purpose-aligned for cloud video editing but means external service responses influence agent behavior.
/api/upload-video/nemo_agent/me/<sid> | POST | Upload a file (multipart) or URL.
The skill exposes upload, SSE editing, credit checking, state fetching, and render-export API operations. These actions are expected for cloud video generation, but they can send user files and start render jobs.
Source: unknown; Homepage: none
The registry metadata does not provide a source repository or homepage, while the skill relies on a remote backend. This is a provenance transparency gap, though there is no install-time code or hidden dependency in the provided artifacts.
closing the tab before completion orphans the job.
Render jobs are queued on remote GPU nodes and can become orphaned if the session is closed. This is disclosed, but it is a failure mode users should notice because it can leave backend work in progress.
Checks whether tool use, credentials, dependencies, identity, account access, or inter-agent boundaries are broader than the stated purpose.
Every API call needs Authorization: Bearer <NEMO_TOKEN>
The skill requires a NemoVideo bearer token and can also obtain an anonymous token for free credits. This is expected for the integrated service but gives the skill access to that service account or credit balance.
Checks for exposed credentials, poisoned memory or context, unclear communication boundaries, or sensitive data that could leave the user's control.
Store the returned session_id for all subsequent requests.
The skill reuses a server-side session identifier and fetches latest timeline state across requests. This is expected for an editing session but creates persistent task context tied to the token/session.
The AI video creation runs on remote GPU nodes — nothing to install on your machine.
The skill sends media and prompts to an external cloud rendering service. This is clearly aligned with the stated purpose, but users should understand that files are processed outside the local environment.
