Install
openclaw skills install supplier-scorecardTrack and score supplier reliability, quality consistency, lead time accuracy, and communication to make sourcing decisions based on evidence. Use when comparing suppliers for the same product, deciding whether to switch or dual-source, negotiating with data instead of feelings, or building a quarterly supplier review process for an ecommerce business.
openclaw skills install supplier-scorecardChoosing the wrong supplier costs far more than the price difference on a quote — it shows up as late shipments, inconsistent quality, customer returns, and listing suspensions. This skill builds a structured scoring framework for evaluating your suppliers across reliability, quality, lead time accuracy, communication responsiveness, and cost competitiveness so you can make sourcing decisions backed by evidence instead of gut feeling or whoever quoted cheapest last month.
| Decision | Strong | Acceptable | Weak |
|---|---|---|---|
| Scoring dimensions | 5 weighted: quality 30%, reliability 25%, lead time 20%, communication 15%, cost 10% | Equal-weighted 5 dimensions | Cost-only comparison |
| Evidence basis | Order-level logs (defect counts, promised vs. actual dates) | Last 90 days from memory, flagged as estimate | "They seem fine" |
| Review cadence | Quarterly scorecard + ad-hoc after any critical incident | Semi-annual | Only when something breaks |
| Switching threshold | Score gap ≥15 points sustained 2 quarters + transition plan | Gap ≥20 points one quarter | Switching over one bad order |
| Dual sourcing | Hero SKUs dual-sourced at 70/30 volume split | Backup supplier qualified but inactive | Single-sourced bestsellers |
| Sample evaluation | Blind side-by-side against spec sheet, 3 units per supplier | Single sample vs. spec | Photos from the sales rep |
| Negotiation use | Scorecard shared with supplier; improvement targets attached to volume commitments | Scores referenced verbally | Scores kept secret, used to ambush |
Set the five dimensions with default weights: Quality consistency 30% (defect rate, spec adherence, batch-to-batch variance), Reliability 25% (orders fulfilled complete and correct), Lead time accuracy 20% (promised vs. actual days — accuracy, not raw speed), Communication 15% (response time, proactive problem disclosure, English/your-language clarity), Cost competitiveness 10% (landed cost vs. market, payment terms, MOQ flexibility). Adjust weights only with a written reason (e.g., fashion seasonality → lead time 30%).
Per order, record: order date, promised ship date, actual ship date, units ordered/received, defects found (incoming inspection + customer-reported within 60 days), spec deviations, and communication incidents (slow replies, surprise changes, proactive warnings). Start from the last 90 days of order history, email, and chat logs — estimates are acceptable for the first scorecard if flagged.
Use the anchors in references/scoring-rubrics.md so scores mean the same thing across suppliers and quarters — e.g., lead time accuracy: 10 = within ±2 days on 95%+ of orders; 5 = within ±7 days on 80%; 1 = misses by >2 weeks or unannounced delays. Never score from impression; every score cites log entries.
Weighted score = Σ(dimension score × weight) × 10, giving 0–100. Band: 85+ strategic partner; 70–84 solid, monitor; 55–69 improvement plan required; <55 begin transition. Tie-breaks go to quality, never cost.
Before switching for a higher-scoring or cheaper supplier: order blind samples (3 units, evaluated side-by-side against the spec sheet), run one small paid pilot order through full inspection, and model the switching cost (tooling, MOQ, re-inspection, listing risk from quality change). A 15-point score gap sustained two quarters justifies a switch; one bad month does not.
Quarterly re-score from the log (30 minutes per supplier once the log exists), ad-hoc re-score after any critical incident (failed inspection, missed launch date), and an annual deep review including factory audit or video walkthrough for top-volume suppliers. Verify each scorecard with assets/scorecard-quality-checklist.md.
Inputs: Supplier A (current): $4.20/unit, 18 orders logged. Supplier B (candidate): $3.65/unit, 4 pilot orders.
Output (excerpt):
Dimension (weight) A B Evidence Quality 30% 8 5 A: 1.1% defect rate, stable across batches. B: 3.8% on pilots, one batch with off-spec USB ports Reliability 25% 9 7 A: 18/18 complete. B: 3/4 complete, one 8% short-ship "resolved" with credit Lead time 20% 6 8 A: avg +5 days vs. promise (holiday slippage). B: ±2 days all pilots Communication 15% 9 6 A: <4h responses, proactively flagged a material change. B: 1–2 days, vague on defect root cause Cost 10% 6 9 B is 13% cheaper landed Weighted total 78.5 65.3 Recommendation: Stay with A; the $0.55 saving costs ~2.7 points of defect rate — at your 4.2% return-rate sensitivity that's negative ROI. Action: share lead-time scores with A and attach a ±3-day target to Q3 POs; keep B warm as a 20% dual-source for the holiday capacity crunch.
Inputs: Supplier C, kitchenware, scored 86 in Q1. Q2 log shows defects 0.9% → 2.6%, two unannounced material substitutions, response time stable.
Output (excerpt):
Q2 score: 73 (−13). Quality 9→6 (defect trend + golden-sample deviation on handle material), communication 8→6 (substitutions disclosed only after inspection caught them). Band drops to "solid, monitor" with a flag: two unannounced substitutions is a leading indicator of margin-squeeze behavior. Action: (1) written spec re-confirmation with material certificates per batch, (2) incoming inspection raised from 10% to 25% sampling for 2 orders, (3) improvement target in next PO: zero unapproved substitutions, defects <1.5%, (4) re-score in 6 weeks instead of next quarter. Do NOT consolidate the new SKU launch with C until back above 80.
references/output-template.md — scorecard table, banding, and recommendation formatreferences/scoring-rubrics.md — 1–10 anchored rubrics for all five dimensionsreferences/sourcing-playbook.md — evidence logging, sample testing, dual-sourcing, and switching protocolsassets/scorecard-quality-checklist.md — scorecard completeness checklist (40+ items)