Install
openclaw skills install rag-pipelinesDeep RAG workflow—document ingestion, chunking, metadata, retrieval and reranking, grounding and citations, evaluation, and failure modes (hallucination, staleness). Use when building or debugging retrieval-augmented generation systems.
openclaw skills install rag-pipelinesRAG quality is dominated by chunking, retrieval, and evaluation—not the LLM alone. Treat the system as data engineering plus generation with explicit failure modes.
Trigger conditions:
Initial offer:
Use six stages: (1) task & success criteria, (2) ingestion & cleaning, (3) chunking & metadata, (4) retrieval & rerank, (5) generation & grounding, (6) evaluation & monitoring). Confirm embedding model and retrieval stack (vector DB, search engine, hybrid).
Goal: Define what a “good” answer contains: required citations, length, tone, and when to refuse.
Exit condition: Written rubric with examples of acceptable vs unacceptable answers.
Goal: Deterministic text extraction (strip boilerplate, handle PDF/OCR if needed); deduplicate documents; track source URL and updated_at for staleness.
Goal: Tune chunk size and overlap to query patterns—not one global token count for all content.
Goal: Hybrid lexical + dense retrieval often beats vector-only for keyword-heavy queries.
Goal: System prompts that require using only provided context; explicit “not found” behavior; optional citation format (snippet, doc id, link).
Goal: Offline golden questions with expected supporting docs; online thumbs-down reasons; monitor retrieval hit rate, nDCG@k, and age of sources used.