Install
openclaw skills install @fhaozzy/llm-inference-radarBuild and run a Chinese research intelligence radar for LLM inference acceleration, AI infra, model serving, kernels, reproducibility, open-source project signals, company infra updates, horizon transfer opportunities, feedback learning, and a persistent knowledge base. Use when the user asks for daily or weekly briefs, paper radar, open-source watch, infra watch, transfer matrix updates, hidden opportunities, or maintaining a research radar for LLM inference acceleration.
openclaw skills install @fhaozzy/llm-inference-radarUse this skill to produce a faithful research-intelligence radar for LLM inference acceleration. Default to Chinese output while preserving standard English technical terms such as KV cache, speculative decoding, continuous batching, TTFT, TPOT, and throughput.
For latest briefs or claims about recent work, gather current sources first. Prefer primary sources: papers, OpenReview pages, arXiv pages, official repositories, release notes, PRs, design docs, benchmarks, official docs, and engineering blogs. Treat X/Twitter, newsletters, media posts, Zhihu, and WeChat posts as leads only; trace them back to primary sources when possible. If no primary source is found, label the item as an early signal and do not present strong conclusions.
Always include open-source status and reproducibility for papers, new methods, and new technical claims. Do not amplify performance claims without benchmark context. If benchmark details are incomplete, explicitly say that generalizability cannot be judged yet.
Use multi-label classification. Do not force an item into one category when it naturally spans serving, KV cache, agent runtime, long-context memory, VLM serving, or RL/test-time scaling.
Deduplicate the same event across arXiv, HF Daily Papers, GitHub, author posts, and media coverage. Aggregate evidence into one item instead of repeating it.
Preserve exploration budget while making the daily brief paper-first. A daily brief must include at least 10 items, at least 5 paper or method items, and at least 3 items from Horizon Radar, Hidden Opportunity, Upstream Signal, or Downstream Signal. Prefer algorithmic/method papers over routine engineering releases when both have similar evidence quality.
Read only the references needed for the task:
references/output-templates.md, references/ranking-and-evidence.md, references/taxonomy.yaml, and references/watchlist.yaml.references/ranking-and-evidence.md, references/output-templates.md, and references/taxonomy.yaml.references/watchlist.yaml, references/ranking-and-evidence.md, and references/output-templates.md.references/taxonomy.yaml, references/transfer-matrix.md, and references/output-templates.md.references/feedback-learning.md and the affected files in references/ or knowledge_base/.references/knowledge-base-schema.md and the relevant files in knowledge_base/.py scripts/check_brief.py <brief.md> from the skill directory and fix any reported omissions.knowledge_base/ files.For each paper or method, answer these points before ranking it:
Use the paper template in references/output-templates.md.
Do not rank open-source updates by stars or titles alone. Inspect the actual artifact: release note, merged major PR, RFC/design doc, benchmark, issue discussion, maintainer comment, docs changelog, or new feature integration.
Classify the update as one or more of: major engineering progress, ordinary maintenance, benchmark update, RFC/design doc, issue-exposed bottleneck, docs/API change, or integration signal.
For company infra posts, extract the real system or benchmark information: architecture change, hardware, serving stack, workload, latency/throughput/cost/memory metrics, relationship to open-source systems, and missing details. Account for promotional bias.
When new domains or workloads appear, ask:
Update knowledge_base/transfer_matrix.md when there is durable evidence or a useful open question.
At the end of each interaction, check whether the user gave feedback about topics, sources, output format, depth, quality, research stage, watchlist, taxonomy, or transfer matrix. Classify the feedback as hard constraint, strong preference, weak preference, or one-off request.
Apply durable feedback to references/config.yaml, references/taxonomy.yaml, references/watchlist.yaml, or knowledge_base/transfer_matrix.md as appropriate. Record the raw feedback and interpretation in knowledge_base/feedback_log.md, and record durable rule changes in knowledge_base/changelog.md.
Do not overfit to short-term preferences. Keep the configured horizon exploration budget unless the user explicitly changes it.