Install
openclaw skills install paper-deep-readingDeep-read research papers into source-aware reports, traceable claim evidence, and research-direction seeds. Use for paper PDFs, LaTeX sources, appendices, code notes, peer reviews, literature-review tasks, novelty audits, and finding new research questions with minimum viable experiments.
openclaw skills install paper-deep-readingUse this skill when the user wants a deep, paper-grounded, auditable, idea-generative reading report for one computer-science paper or a small paper batch.
The input may be:
.tex filesThe default output is text-first, audit-first, formula-preserving, and research-direction-oriented. This version does not require a dedicated webpage reader; when search/browsing tools are available, use them to assemble the best source package before writing.
Human-readable report
report.mdMachine-readable trace artifacts
traceability_manifest.jsonlatex_paragraphs.jsonartifact_index.jsonMachine-readable research artifacts
research_lens.jsondirection_board.jsonThe report is the primary user-facing deliverable. It must read like a serious research mentor's deep-reading memo, not like a thin checklist dump. The direction board is the primary idea-mining surface: it converts paper weaknesses, hidden assumptions, evidence gaps, proxy mismatches, successor-paper gaps, and reviewer objections into candidate research directions.
This skill package is intended to stay compatible with ClawHub / OpenClaw skill packaging.
Keep the package lean:
SKILL.md as the main instruction fileREADME.md or CHANGELOG.mdKeep the package license-safe:
MIT-0 publication modelLICENSE.txtMIT-0 redistribution expectationsRuntime discipline:
Do not treat the OpenClaw / ClawHub version as a lightweight summary mode. The single-file constraint changes presentation, not analysis quality.
Never remove, weaken, shorten, or bypass any existing deep-reading requirement, including:
The report depth bar should stay close to a strong top-conference paper memo:
If a tension appears between a shorter explanation and a more idea-generative one, choose the more useful research-direction analysis while keeping claims grounded.
If a tension appears between speculation about author intent and factual safety, label the reconstruction explicitly as plausible inference or speculation and anchor it to textual evidence.
This version keeps the original traceability and formula-preservation bar, but adds a research-direction mining layer.
The report must help the user answer not only:
but also:
C brokeY had to be replacedZ the paper constructedUse references/research-generative-methodology.md and references/research-direction-mining-best-practices.md whenever the user wants:
Read each paper in three direction-mining passes. These passes are adapted for discovering new research points, not merely for comprehension.
Quickly inspect title, abstract, introduction, section headings, conclusion, references, and visible figures/tables. Answer the five triage questions:
Category: What type of paper is this: method, benchmark, theory, measurement, system, dataset, analysis, survey, or position?Context: What field conversation, assumptions, and ancestor methods does it sit inside?Correctness: Do the core assumptions, data, metrics, and comparisons look initially plausible?Contributions: What are the claimed contributions and how strong do they look before deep verification?Clarity: Is the argument organized well enough that the method and claims can be audited?Then add a direction-promise note:
Read the paper carefully but keep the goal causal:
problem -> assumption break -> design principle -> module -> formula -> figure/table -> experiment -> claim
During this pass:
available proxy that replaces an unavailable ideal mechanismRecreate the work as if you had to implement, prove, or reproduce it. Ask:
This pass must produce future-work triggers. A trigger is not a generic suggestion; it is a statement of the form:
current method works if H -> under not-H it breaks -> new mechanism needed -> minimum experiment to test the opportunity
When the user asks for new research directions, do not stop at the paper's own related work. If tools are available and time permits, inspect a small set of successor papers, citation trails, follow-up discussions, code repositories, or public review threads.
Use successor reading to answer:
If successor-paper search was not possible, say so and keep direction confidence lower. Do not fabricate citation trends.
Every report must combine critical and creative reading.
Critical reading asks:
Creative reading asks:
The final directions must be creative and falsifiable.
Use reviewer thinking not just to judge acceptance, but to discover research seeds.
Audit at least these dimensions when evidence allows:
Convert reviewer objections into direction candidates:
reviewer objection -> why it matters -> what evidence would resolve it -> minimum viable experiment -> possible new paper
The skill does not replace the researcher or claim to have completed experiments. It turns a paper into candidate directions that a researcher can test.
Every strong candidate direction must include:
seed_type: one of assumption violation, unavailable mechanism, proxy mismatch, evidence gap, tiny example, successor-paper gap, reviewer objection, negative result, or cross-domain transferpaper_anchor: claim IDs and source evidence that triggered itresearch_question: a question that can be answeredhypothesis: what might be trueminimum_viable_experiment: the smallest decisive testnegative_result_interpretation: what it would mean if the hypothesis failskiller_objection: the strongest reason the idea might be uninteresting or invalidkiller_result: the result that would make the direction worth pursuingfirst_week_plan: practical steps for a researcher's first weekrisk_level and expected_valueGeneric future-work lists are not enough. A direction without a test plan is an inspiration note, not a research seed.
The report itself remains the primary verification surface, but the detailed evidence placement is:
Main body
### Anchored Points blocks near the relevant discussion- [C5.2][evidence-backed interpretation] ...Final appendix
Do not clutter the main narrative by inserting long locator bullets immediately after every claim.
Keep the main body readable, and move detailed original-paragraph explanation to the final # Appendix: Claim -> Evidence Index.
Use scripts/render_inline_trace_report.py after drafting the report and manifest to materialize or refresh that appendix.
When the paper contains key formulas, the report must not compress them into prose-only summaries.
For each central equation, objective, theorem statement, update rule, estimator, metric, loss, or constraint, explicitly include:
Do not weaken equation detail for the sake of shorter presentation.
Always assemble the best available evidence package before writing.
Preferred reading order:
Treat LaTeX as the primary structural source.
Use PDF only as a visual and pagination aid for:
Do not stop at PDF summarization immediately.
First check whether the same paper has a matching arXiv LaTeX/source package. If it exists and matches the same paper, switch to LaTeX-primary + PDF-assisted reading.
If not, continue with the PDF and say explicitly that the reading is PDF-primary.
Search for the paper and collect:
Never silently analyze the wrong paper. Disambiguate by title, authors, abstract, year, venue, and method keywords.
If the paper is an ICLR or OpenReview-hosted paper, look for:
Use them to enrich:
If some sources cannot be found, do not abort. State clearly what was attempted, what was found, what was missing, and how that affects confidence. Then continue with the best grounded report possible.
If LaTeX cannot be found after an explicit search, say so clearly and use PDF-oriented evidence rows in traceability_manifest.json instead of pretending paragraph anchors exist.
Write the skill instructions, internal prompts, and template skeletons in English. Choose the report language from the user's current request language by default.
When writing the report in Chinese:
report.mdThe report must cover, whenever the evidence supports it:
Use templates/report_template.md as the default skeleton.
For each numbered section:
### Anchored Points- [C<section>.<index>][label] claim texttraceability_manifest.jsonThis is the claim-to-evidence map.
Rules:
interpretation_typeresearch_rolelatex_paragraphs.jsonThis is the stable LaTeX anchor index.
Each entry must keep:
paragraph_idsource_pathline_startline_endsection_pathkindtextartifact_index.jsonA compact index for the generated text-first bundle.
It should list the locations of:
report.mdtraceability_manifest.jsonlatex_paragraphs.jsonresearch_lens.jsondirection_board.jsonresearch_lens.jsonThis is the compact idea-mining artifact. Use templates/research_lens.template.json and references/artifact_contract.md.
It should capture:
direction_board.jsonThis is the structured research-direction board. Use templates/direction_board.template.json.
It should capture:
Use stable section-local ids such as:
C3.1C5.2C14.4Do not hide multiple judgments in one claim bullet.
List all materially relevant evidence for a claim, not just one convenient paragraph.
Each claim must declare exactly one of:
evidence-backed interpretationplausible inferencespeculationIf the report reconstructs likely author reasoning, it must still point to the exact paragraphs, equations, figures, tables, experiments, reviews, or successor-paper signals that motivate that reconstruction. Idea generation is required, but fabrication is forbidden.
Each direction seed should also label the trigger as one of:
evidence-backed interpretationplausible inferencespeculationDo not rank speculative seeds as high-confidence unless the uncertainty is explicit.
Prefer a report that is pleasant to read and easy to audit.
For every claim, the user should be able to answer:
For the strongest research-direction sections, the report should also answer:
Use phrasing such as:
The report should sound like a research mentor reconstructing how the work may have been invented and how it could become the next project, not like a generic summarizer.
scripts/extract_latex_paragraphs.py.report.md using anchored claim IDs in the main body.traceability_manifest.json so each claim points to one or more paragraph IDs or fallback anchors.research_lens.json so the paper's research equation, story structure, module logic, citation functions, reviewer audit, and future directions are captured in structured form.direction_board.json so the best candidate research seeds are ranked, testable, and linked to evidence.artifact_index.json so the bundle stays portable.scripts/validate_traceability.py.scripts/validate_direction_board.py when direction_board.json is present.scripts/render_inline_trace_report.py to append or refresh the final Claim -> Evidence Index appendix in report.md.For a small paper batch:
report.md-style bundle per paper when the user expects detailed readingIf some sources cannot be found, do not abort. State clearly:
Then continue with the best grounded report possible.
If the evidence does not support strong idea generation, say so and produce a conservative direction board. Do not invent novelty, successor trends, reviewer objections, or experimental feasibility.