Local Inference Context

Context management for self-hosted LLM backends (llama.cpp, Ollama). Prevents mid-task 503 errors and context overflows caused by VRAM-limited KV caches. Use instead of generic context skills when running a local inference server. Complements context-recovery for post-compaction scenarios.

Install

openclaw skills install @joekravelli/local-inference-context