Data Pipeline Toolkit

Security checks across malware telemetry and agentic risk

Overview

The skill describes a broad data-pipeline tool but ships no pipeline code while instructing agents to run a local script and use database/cloud credentials, so it needs review before use.

Treat this as incomplete until the actual pipeline.sh implementation and install source are available for review. If you still use it, run commands manually, verify the script path, use test data first, and provide only narrowly scoped credentials.

VirusTotal

VirusTotal findings are pending for this skill version.

View on VirusTotal

Risk analysis

Artifact-based informational review of SKILL.md, metadata, install specs, static scan signals, and capability signals. ClawScan does not execute the skill or run runtime probes.

#
ASI04: Agentic Supply Chain Vulnerabilities
Medium
What this means

A user or agent could end up running unreviewed or unintended local code while believing it is part of the installed skill.

Why it was flagged

The central workflow depends on a local pipeline.sh helper, but the provided manifest contains only SKILL.md and _meta.json with no install spec or code file for that helper. Running this command would execute whatever local script exists rather than reviewed package code.

Skill content
./pipeline.sh create my-pipeline
Recommendation

Do not run the quick-start commands unless you know where pipeline.sh comes from. The skill should include the helper script or a pinned, auditable install source and declare its runtime requirements.

#
ASI03: Identity and Privilege Abuse
Low
What this means

ETL use may require credentials that can read or modify important databases, APIs, or cloud storage locations.

Why it was flagged

The example expects a database connection string, and the skill lists databases, APIs, S3, Google Cloud Storage, Kafka, and SQS as sources or destinations, but the registry metadata declares no required credentials or environment variables.

Skill content
./pipeline.sh load my-pipeline postgres --connection $DB_URL
Recommendation

Use narrowly scoped credentials, confirm the exact source and destination before running a pipeline, and avoid giving production credentials to an unreviewed helper.

#
ASI10: Rogue Agents
Low
What this means

A pipeline could keep running on a schedule and continue accessing or moving data after the user’s immediate request is complete.

Why it was flagged

Scheduling is an expected ETL feature, but cron-based or event-driven jobs can continue running after the initial task if not explicitly reviewed and removed.

Skill content
- **Scheduling** - Cron-based or event-driven
Recommendation

Only enable schedules intentionally, document where they are created, and provide a clear way to list, pause, and delete scheduled jobs.