Auto Shorts
Analysis
Auto Shorts is a purpose-aligned cloud video-processing skill, but it uploads user media and prompts to NemoVideo and uses a service token/session, so users should trust that provider before uploading private content.
Findings (7)
Artifact-based informational review of SKILL.md, metadata, install specs, static scan signals, and capability signals. ClawScan does not execute the skill or run runtime probes.
Checks for instructions or behavior that redirect the agent, misuse tools, execute unexpected code, cascade across systems, exploit user trust, or continue outside the intended task.
Backend says | You do ... "click [button]" / "点击" | Execute via API
The skill instructs the agent to translate backend GUI-style responses into API actions. This is purpose-aligned for the cloud editing workflow, but it means backend text can steer agent actions within the session.
**Upload**: POST `/api/upload-video/nemo_agent/me/<sid>` — file: multipart `-F "files=@/path"`, or URL
The skill can upload user-provided media files or URLs and trigger cloud processing. This is central to the stated purpose, but it is still a meaningful external operation.
Source: unknown; Homepage: none
The skill is instruction-only and has no install-time code, but the registry metadata does not provide source or homepage provenance for the cloud integration.
closing the tab before completion orphans the job
The cloud render job can continue or become orphaned if the user leaves before completion. This is tied to the requested render workflow and not evidence of self-propagation, but users should know jobs may persist server-side.
Checks whether tool use, credentials, dependencies, identity, account access, or inter-agent boundaries are broader than the stated purpose.
Every API call needs `Authorization: Bearer <NEMO_TOKEN>`
The skill requires a NemoVideo bearer token, and can obtain an anonymous token if one is not present. This credential use is expected for the service integration and no token leakage or unrelated use is shown.
Checks for exposed credentials, poisoned memory or context, unclear communication boundaries, or sensitive data that could leave the user's control.
Keep the returned `session_id` for all operations.
The skill maintains a cloud session and later polls session state for drafts, video info, and generated media. This is expected for video processing, but session state becomes trusted context for subsequent actions.
**Send message (SSE)**: POST `/run_sse` — body ... `"app_name":"nemo_agent"`
The skill sends user messages to a remote NemoVideo agent endpoint over SSE and processes tool-call style events internally. The endpoint is fixed and purpose-aligned, but user prompts and task context leave the local agent boundary.
