Install
openclaw skills install tracingDeep distributed tracing workflow—instrumentation boundaries, context propagation, sampling, tail-based analysis, service maps, and using traces for latency debugging. Use when adopting OpenTelemetry, debugging microservices, or tuning P99 latency.
openclaw skills install tracingTraces answer which hop consumed time and where errors surfaced across services. Success requires consistent propagation, meaningful spans, and sampling that preserves signal without bankrupting storage.
Trigger conditions:
Initial offer:
Use six stages: (1) define goals & SLOs, (2) instrumentation plan, (3) propagation & context, (4) sampling strategy, (5) analysis workflows, (6) governance & cost. Confirm languages and infra (K8s, service mesh).
Goal: Know why tracing exists—latency, errors, dependency discovery, or customer journey mapping.
Exit condition: Success metrics: e.g., “reduce unknown time in checkout to <5% of trace duration.”
Goal: Spanness where it helps—not every function.
GET /orders/{id} patterns) vs high-cardinality raw pathsExit condition: Inventory of frameworks auto-instrumented vs manual spans needed.
Goal: Trace ID crosses async boundaries—no broken traces.
asyncio, Promise)Exit condition: Broken trace rate measurable; top 5 causes documented (missing propagation, etc.).
Goal: Representative traces without storing everything.
Exit condition: Written policy: baseline rate + error always + latency outliers.
Goal: Engineers use traces in incidents and perf work.
Exit condition: Runbook snippet: “How to find slowest span in checkout.”
Goal: PII controlled; budget predictable.