Install
openclaw skills install model-routing-orchestratorRoute each user request to the most cost-effective model or multi-model workflow based on task type, complexity, risk, latency, budget, tool needs, and verification requirements.
openclaw skills install model-routing-orchestratorUse this skill to decide which model tier, workflow shape, and verification strategy should handle a user's request.
The goal is to maximize cost-effectiveness without sacrificing task fit, correctness, or operational reliability.
This skill does not blindly choose the strongest model. It chooses the cheapest safe path that still meets the quality bar for the task.
It may recommend:
For every request, choose the minimum-cost execution path that can still satisfy:
Use this skill when you need to decide:
Do not use this skill to:
Collect or infer the following from the request and system context:
Classify the request into one or more of these categories:
Simple generation
General reasoning
Deep reasoning
Exact calculation or formal logic
Coding and technical execution
Long-context synthesis
Multi-modal tasks
High-risk tasks
Always prefer the cheapest path that can safely succeed.
Apply this order of preference:
Do not escalate unless the task characteristics justify it.
Use abstract capability tiers unless the deployment specifies exact providers.
Use for:
Strengths:
Weaknesses:
Use for:
Strengths:
Weaknesses:
Use for:
Strengths:
Weaknesses:
Use when exactness matters more than fluent wording.
Use this path for:
Rule: When a task requires exact numeric correctness, prefer tools plus model orchestration over pure model reasoning.
Score the request across these dimensions:
Apply these rules before any soft optimization.
If the task involves exact arithmetic, formulas, tables, accounting-like operations, unit-sensitive conversions, or step-sensitive logic:
If the task is high-risk:
If the task is materially ambiguous and the answer quality depends on interpretation:
If the input is large or multi-document:
If the task includes images, diagrams, PDFs with layout dependence, or visual interpretation:
For code tasks:
Choose one of these workflow shapes.
Use when:
Examples:
Use when:
Examples:
Use when:
Examples:
Use when:
Pattern:
Use when:
Pattern:
Examples:
Use when:
Pattern:
Best for:
Use when:
Pattern:
Best for:
Use when:
Pattern:
Best for:
Use these strategies to keep cost high-value.
Escalate to a stronger model or multi-step workflow when any of these appear:
Use a cheaper path when:
When the request includes complex calculations or formal reasoning:
Never use a fluent but non-verified freeform model answer as the final authority for exact numeric work when a deterministic path exists.
When the request includes large context:
Return exactly this structure:
Routing Decision: <single-economy | single-balanced | single-premium | tool-assisted | staged-pipeline | draft-and-review | parallel-comparison | consensus-check>
Primary Reason: <one-sentence explanation of why this route is the best cost-effective choice>
Task Profile:
Recommended Execution Plan:
Model Role Assignment:
Why This Is Cost-Effective:
Escalation Rule: <when to move to a stronger model or more expensive workflow>
Fallback Rule: <what to do if the chosen path fails or produces uncertainty>
Request: "Rewrite this email to sound more professional."
Output: Routing Decision: single-economy
Primary Reason: This is a low-risk rewriting task where a low-cost model should meet the quality bar.
Task Profile:
Recommended Execution Plan:
Model Role Assignment:
Why This Is Cost-Effective:
Escalation Rule: Escalate only if the user requests highly nuanced persuasion, legal sensitivity, or multiple tone variants.
Fallback Rule: If the first rewrite is too generic, retry once with stricter tone instructions using a balanced model.
Request: "Design an AI agent architecture for long-form writing with memory, retrieval, conflict resolution, and chapter planning."
Output: Routing Decision: single-premium
Primary Reason: This task requires deep reasoning, abstraction, and multi-part architecture tradeoff analysis.
Task Profile:
Recommended Execution Plan:
Model Role Assignment:
Why This Is Cost-Effective:
Escalation Rule: Escalate to parallel-comparison only if multiple competing architecture strategies are explicitly required.
Fallback Rule: If the first answer is too generic, rerun with a staged outline-first then full-design workflow.
Request: "Calculate the projected monthly burn and runway under three hiring scenarios and explain the assumptions."
Output: Routing Decision: tool-assisted
Primary Reason: This request depends on exact calculations, so deterministic computation is safer and more cost-effective than pure model reasoning.
Task Profile:
Recommended Execution Plan:
Model Role Assignment:
Why This Is Cost-Effective:
Escalation Rule: Escalate to premium review only if the scenario assumptions are ambiguous or strategically sensitive.
Fallback Rule: If tool-based computation is unavailable, return assumptions explicitly and mark the result as unverified instead of presenting certainty.
Request: "Compare these five candidate proposals and recommend the best one based on cost, timeline, technical fit, and risk."
Output: Routing Decision: staged-pipeline
Primary Reason: The request is large and decomposable, so staged extraction and synthesis is more cost-effective than sending everything directly to a premium model.
Task Profile:
Recommended Execution Plan:
Model Role Assignment:
Why This Is Cost-Effective:
Escalation Rule: Escalate to consensus-check if the recommendation will drive a major decision or if proposal differences are subtle.
Fallback Rule: If extraction quality is poor, rerun the extraction stage with a stronger model before recomputing the final recommendation.