Install
openclaw skills install data-pipelinesDeep data pipeline workflow—ingestion, orchestration, idempotency, data quality, SLAs, observability, and lineage. Use when building batch/stream pipelines, debugging job failures, or hardening ETL/ELT.
openclaw skills install data-pipelinesPipelines fail on silent schema drift, partial writes, and unclear ownership. Design for at-least-once delivery, idempotent sinks, and observable stages.
Trigger conditions:
Initial offer:
Use six stages: (1) requirements & SLAs, (2) source contracts, (3) transforms & idempotency, (4) orchestration & dependencies, (5) quality & monitoring, (6) lineage & operations). Confirm batch vs stream and cloud stack.
Goal: Freshness (latency), completeness expectations, cost ceiling, failure tolerance (quarantine vs stop-the-line).
Exit condition: SLA table: pipeline → metric → threshold.
Goal: Schema versioning; CDC vs snapshot pulls; API rate limits.
Goal: Deterministic transforms; upsert keys; partition strategy for rewinds.
Goal: Clear DAG; retry policy; backfill without double counting; SLA miss alerts.
Goal: Data quality checks (null spikes, row bounds, referential checks); metrics on lag, duration, error rate.
Goal: Column-level lineage where valuable; on-call runbook; ownership per pipeline.