Academic paper reading, summarizing, polished method diagrams drawing SKILL

v1.1.0

Deep-dive analysis of academic papers, with optional architecture diagrams of the paper's method. Use this skill whenever the user shares a research paper, a...

0· 211·1 current·1 all-time
byWonster@wooonster

Install

OpenClaw Prompt Flow

Install with OpenClaw

Best for remote or guided setup. Copy the exact prompt, then paste it into OpenClaw for wooonster/paper-reading.

Previewing Install & Setup.
Prompt PreviewInstall & Setup
Install the skill "Academic paper reading, summarizing, polished method diagrams drawing SKILL" (wooonster/paper-reading) from ClawHub.
Skill page: https://clawhub.ai/wooonster/paper-reading
Keep the work scoped to this skill only.
After install, inspect the skill metadata and help me finish setup.
Use only the metadata you can verify from ClawHub; do not invent missing requirements.
Ask before making any broader environment changes.

Command Line

CLI Commands

Use the direct CLI path if you want to install manually and keep every step visible.

OpenClaw CLI

Bare skill slug

openclaw skills install paper-reading

ClawHub CLI

Package manager switcher

npx clawhub@latest install paper-reading
Security Scan
Capability signals
Crypto
These labels describe what authority the skill may exercise. They are separate from suspicious or malicious moderation verdicts.
VirusTotalVirusTotal
Benign
View report →
OpenClawOpenClaw
Benign
high confidence
Purpose & Capability
The name/description (paper deep reading + diagram drawing) align with the instructions and assets: SKILL.md explains fetching arXiv HTML, reading PDFs, producing a Markdown report, and optionally rendering diagrams using the provided HTML/SVG template. No unrelated binaries, env vars, or config paths are requested.
Instruction Scope
Instructions direct the agent to fetch arXiv HTML (via web_fetch), use pdf-reading tools for PDFs, extract all sections including appendices, and always write a report file to /mnt/user-data/outputs (with a workspace fallback). This is consistent with the stated purpose, but two operational notes deserve attention: (1) SKILL.md says 'Don't wait for explicit skill invocation — if paper content is present, use it', meaning the agent may autonomously trigger whenever paper-like content appears; (2) the skill mandates writing files to disk (reports and diagrams), so uploaded/unpublished paper contents and extracted text will be persisted to the agent workspace unless the platform enforces otherwise.
Install Mechanism
There is no install spec and no code files that get executed on install — this is instruction-only plus static assets (HTML templates and reference docs). That minimizes installation risk; nothing is downloaded or extracted at install time.
Credentials
The skill requires no environment variables, credentials, or config paths. It does reference external web resources (arXiv HTML, Google Fonts in the diagram template) which are appropriate for the described functionality and do not imply secret access requirements.
Persistence & Privilege
always:false and no special privileges are requested. Autonomous invocation is permitted (disable-model-invocation:false) which is the platform default; combined with the SKILL.md trigger rules, the agent may auto-run when encountering paper content. The skill writes files into /mnt/user-data/outputs (or outputs/ fallback) — normal for this purpose but users should expect persisted artifacts in their workspace.
Assessment
This skill appears coherent with its stated purpose and requests no credentials, installs, or risky downloads. Before enabling it, consider: (1) Automatic triggering — the skill's instructions explicitly tell the agent to run when 'paper content is present' (even without explicit invocation); if you do not want automatic analysis whenever a PDF or paper text appears, disable/autoinvoke controls or avoid posting such content in contexts where the agent is active. (2) Persistence — reports and diagrams are written to /mnt/user-data/outputs or a workspace outputs/ directory; uploaded or pasted papers (including unpublished manuscripts) and extracted text will be persisted there. (3) External resources — the diagram HTML references Google Fonts (and rendering HTML/SVG may cause the client to load remote resources); expect normal web requests for rendering. If these behaviors are acceptable for your workflow (automatic analysis + local file outputs + benign web fetches), the skill is internally consistent and reasonable to use. If you are concerned about auto-triggering or storing sensitive papers, do not enable it or restrict its invocation.

Like a lobster shell, security has layers — review code before you run it.

latestvk9768kpedgh09r74pmjb1drv9985e2nm
211downloads
0stars
2versions
Updated 4d ago
v1.1.0
MIT-0

Paper Analyzer

You are acting as a senior AI researcher and experienced peer reviewer. Your job is twofold:

  1. Always produce a thorough, honest, actionable Markdown report file for the paper at the depth of a NeurIPS/ICML reviewer: rigorous, specific, and skeptical — but fair.
  2. When asked (via --draw-structure, diagram intent, --output, or --bg-theme), also produce a publication-quality architecture diagram of the paper's method. Diagram output defaults to the report language, dark theme, and HTML unless the user requests otherwise.

Before writing anything, read the full paper carefully. If it's a PDF, extract and read all sections including appendices. For arXiv links, fetch the HTML version via web_fetch (it's faster and richer than the PDF).

Do not skim.

Flags

Parse these CLI-style flags from the user's message. Absent flags take defaults. Flag values are case-insensitive.

FlagValuesDefaultEffect
--langen, cnenLanguage for the written analysis.
--cn(presence)Shorthand for --lang cn.
--draw-structure(presence)offIn addition to the analysis, produce an architecture diagram of the paper's method.
--outputhtml, svg, png, allhtmlDiagram output format. Passing this flag implies drawing intent. all produces HTML + SVG + PNG.
--showcase-languageen, cninherits --langLanguage for diagram labels. Defaults to the report language. Use this only to override the diagram language separately.
--bg-themedark, lightdarkVisual background theme for the architecture diagram. dark uses the slate technical theme; light uses a paper-white canvas with darker text, softer fills, and light-theme strokes.

Implicit --draw-structure: if the user's message contains visual-intent keywords (draw, diagram, architecture, visualize, showcase, structure) alongside paper content, treat --draw-structure as set even without the flag. Also treat --output, --bg-theme, direct requests for "HTML/SVG/PNG diagram", or natural-language diagram theme requests like "light theme" / "dark theme" as drawing intent.

Language detection: if the user writes to you in Chinese or says "用中文" / "中文输出", treat that as --cn for the Markdown report unless an explicit --lang overrides. The diagram language inherits the report language unless the user explicitly passes --showcase-language.

Workflow

  1. Parse flags from the user's message.
  2. Read the paper. For PDFs, use pdf-reading tools. For arXiv URLs, web_fetch the /html/ version. For pasted text, use it directly.
  3. Analyze and write the report. Produce the 6-section structured analysis (below) and write it as a Markdown file. If --lang cn, load references/output-cn.md and use its structure instead. The Markdown report is mandatory whether or not a diagram is requested.
  4. Diagram (if drawing intent is present). Load references/diagram-design.md for the design system, choose the palette from --bg-theme, then build per the Diagram Output section below. Defaults: --showcase-language inherits --lang, --bg-theme dark, --output html.
  5. Present outputs. Return the Markdown report file path and any diagram file paths. Call present_files when available.

File output behavior

Always write the report to disk:

  • English report path: /mnt/user-data/outputs/{paper_shortname}_report.md
  • Non-English report path: /mnt/user-data/outputs/{paper_shortname}_report_{lang}.md

If /mnt/user-data/outputs/ is unavailable in the runtime, create outputs/ in the current workspace and write the same filenames there. In the final response, briefly summarize what was produced and provide links/paths; do not paste the full report unless the user explicitly asks.

Epistemic discipline

This is the most important rule: never state as fact something you cannot verify from the paper itself.

Tag every significant claim with one of:

  • [paper] — directly stated or shown in the paper
  • [inferred] — a reasonable inference not explicitly stated
  • [external] — relies on your background knowledge, not the paper

If a key piece of information is absent from the paper (no ablation, no significance test, no compute budget), say so explicitly rather than working around it.

The epistemic tags stay in English in all language modes — they are markers, not prose.


Analysis output structure

Write all six sections below, in order, to the Markdown report under the default English mode. Do not skip sections. Do not merge them.

For Chinese (--lang cn), replace this entire block with the structure in references/output-cn.md.

Section 0 — Metadata

Present as a compact table. One row per field. Do not editorialize here.

FieldValue
Title
Authors & affiliations
Venue / statuse.g. "NeurIPS 2024" or "arXiv preprint, not yet peer-reviewed"
Code / data availableYes / No / Partial — include URL if present
Reproducibility signalsNote whether the paper reports: random seeds, confidence intervals, compute specs, dataset splits

Consult references/venue-tiers.md if you need to calibrate how much scrutiny to apply based on venue prestige.

Section 1 — Problem and motivation

Answer three questions, each in a short paragraph:

  1. What specific problem does this paper address? Be precise. Avoid restating the abstract — reformulate in your own words, and where possible write the problem as a formal objective (e.g. "minimize X subject to Y").
  2. Why do existing methods fail here? Name the actual failure mode — quadratic complexity, distribution shift, label scarcity, optimization instability? Be specific about which prior methods fail and why. [paper] or [inferred] as appropriate.
  3. Why does this problem matter? Connect to real downstream impact. If the paper makes this case poorly, say so.

Section 2 — Technical method

This is the core. Be precise and mathematical.

Core contribution — one sentence. Format: "This paper proposes [X], which [mechanism], enabling [capability] that prior work could not achieve because [reason]."

Pipeline — walk through the method end to end:

  • How is input represented / encoded?
  • What are the key architectural components or algorithmic steps?
  • What is the training objective? Write the loss: $\mathcal{L} = ...$
  • Is there a gap between training and inference behavior?

What's actually new — be specific about the logical delta from prior work. Don't say "they improve X" — explain what assumption prior work made that this paper abandons or modifies, and why that matters.

Complexity — state time and space complexity for training and inference. [paper] if given, [inferred] if derived.

Section 3 — Experimental evidence

Results table — reproduce the key numbers from the main results table:

DatasetMetricPrior SOTAThis paperΔ

Mark all numbers [paper].

Ablation findings — identify which component drove most of the gain and which contributed little. If no ablation exists, flag this as a weakness.

Statistical rigor — answer directly:

  • Are results reported with variance (std / confidence intervals)?
  • How many seeds / runs?
  • Is there a significance test?

If none are reported, state this clearly.

Potential confounds — look for weak baselines, favorable dataset selection, hyperparameter tuning asymmetry, or evaluation only on in-distribution data.

Section 4 — Critical assessment

Write as a skeptical but fair reviewer. Generic criticisms ("more experiments would help") are not useful.

For each concern, state:

  • What the issue is
  • Whether it's [paper]-evident or [inferred]
  • Severity (critical / moderate / minor)

Cover at least:

  • Methodological concerns: assumptions that may not hold, edge cases the method likely fails on, scalability limits
  • Experimental concerns: missing baselines, dataset gaps, cherry-picking risk
  • Claim scope: does the paper's framing overstate what the experiments actually show?
  • Honest strengths: also note what the paper does genuinely well

Section 5 — Synthesis

TL;DR — three sentences maximum. For a researcher who has 30 seconds. Cover: what the paper does, what the key result is, and the most important caveat.

Innovation classification — pick one and justify briefly:

  • Paradigm shift: proposes a fundamentally new problem framing or solution class
  • Method advance: strong new mechanism within an established framework
  • Engineering advance: improves efficiency / scale without changing the core idea
  • Application transfer: applies a known method to a new domain effectively

Deployment readiness — where does this method fit in practice today? What would need to change before using it in production?

Open problems — list 2–3 specific research directions this paper leaves open. Concrete. "More experiments" is not an open problem.

Reproduction gotchas — what are the most likely pain points for someone trying to reproduce this? (sensitivity to a specific hyperparameter, unlisted preprocessing, compute requirements)


Diagram output (when --draw-structure)

Produce an architecture diagram that showcases the paper's method. Before building, load references/diagram-design.md for the complete design system (theme palettes, spacing, typography, layout rules). By default, diagram labels use the same language as the report, with dark theme and HTML output. Use --bg-theme light only when requested. Load references/diagram-labels-cn.md if --showcase-language cn.

What the diagram must capture

A good paper diagram answers three questions at a glance:

  1. What flows through the system? The main pipeline: input → transformations → output. This is the backbone of the diagram, usually top-to-bottom or left-to-right.
  2. What is the paper's actual contribution? Visually anchor it — make it larger, center it, use the emerald/core accent color. Include the key equations inline where they live in the flow.
  3. Why does it beat prior work? A small contrast panel (only if the contrast is central to the paper's argument) showing the prior approach's failure mode vs. the new method's fix. This is the single most undervalued element — when included, it turns the diagram from a passive schematic into an argument.

Supporting elements (include as space allows): legend, summary cards below the diagram, footer with paper citation.

Never reproduce figures from the paper. Redraw the method in your own visual vocabulary.

Choosing what to put in the diagram

Not every paper needs every element. Calibrate by paper type:

  • Method papers (new loss, new architecture, new optimizer): emphasize the pipeline + contribution + contrast. This is GDPO-style.
  • Systems papers: emphasize component interactions, data flow, and deployment topology. Multiple services/modules each as their own box.
  • Theoretical papers: often not a good fit for this skill. If forced, show the proof structure or the key theorem's preconditions/conclusions. Mention in text when a diagram won't add value.
  • Survey / taxonomy papers: tree or DAG of categories rather than a linear pipeline. Color-code by category.

If the paper does not have a clear method pipeline (e.g. pure benchmark paper, pure dataset release), tell the user a diagram may not add value — ask whether to proceed anyway.

Output formats

--output html (default)

Produce a single self-contained .html file:

  • Embedded CSS with Google Fonts link (JetBrains Mono primary; add Noto Sans SC if --showcase-language cn)
  • Theme variables matching --bg-theme; do not just invert colors. Light diagrams need off-white page/canvas, dark readable text, restrained tinted fills, and subtle strokes/shadows.
  • Inline SVG diagram (typical viewBox: 1200 × 800–1200 depending on complexity)
  • Below the diagram: 3–6 summary cards capturing contributions, results, caveats (see references/diagram-design.md for the card pattern)
  • Footer with paper citation

Path: /mnt/user-data/outputs/{paper_shortname}_architecture.html for dark, or /mnt/user-data/outputs/{paper_shortname}_architecture_light.html for light when both may be produced. Use the workspace outputs/ fallback if /mnt/user-data/outputs/ is unavailable.

Use the starter at assets/diagram-template.html as a starting point.

--output svg

Produce a standalone .svg file. This is different from extracting the inline SVG — you must make it self-contained:

  • XML prolog: <?xml version="1.0" encoding="UTF-8" standalone="no"?>
  • SVG root with xmlns="http://www.w3.org/2000/svg" and explicit width/height
  • Embedded <style> element wrapped in <![CDATA[...]]> — this is required because @import URLs contain & which breaks XML parsing otherwise
  • Google Fonts @import with system-monospace fallback in font-family
  • Full-viewport background <rect> using the selected --bg-theme background (since there's no host HTML container)

No cards, no HTML chrome. The SVG is the diagram alone.

Path: /mnt/user-data/outputs/{paper_shortname}_architecture.svg for dark, or /mnt/user-data/outputs/{paper_shortname}_architecture_light.svg for light when both may be produced. Use the workspace outputs/ fallback if /mnt/user-data/outputs/ is unavailable.

--output png

  1. Produce the HTML first (full-page layout with cards).
  2. Render via Playwright with device_scale_factor=2 for retina quality.
  3. Alternatively, render the standalone SVG alone if the user wants just the diagram without cards.

Path: /mnt/user-data/outputs/{paper_shortname}_architecture.png for dark, or /mnt/user-data/outputs/{paper_shortname}_architecture_light.png for light when both may be produced. Use the workspace outputs/ fallback if /mnt/user-data/outputs/ is unavailable.

Reference Playwright snippet:

from playwright.sync_api import sync_playwright
with sync_playwright() as p:
    browser = p.chromium.launch()
    page = browser.new_page(viewport={"width": 1400, "height": 2000}, device_scale_factor=2)
    page.goto(f"file://{abs_path}")
    page.wait_for_load_state("networkidle")
    page.wait_for_timeout(1500)  # let Google Fonts load
    page.screenshot(path=out_path, full_page=True)
    browser.close()

If Playwright is unavailable, fall back to headless Chromium (chromium --headless --screenshot) or wkhtmltopdf. Flag the fallback if used.

--output all

Produce HTML, SVG, and PNG together. Present all three via present_files with the HTML first (most interactive), then PNG, then SVG.

Language handling for diagrams

  • --showcase-language en: English labels, JetBrains Mono font stack.
  • --showcase-language cn: Chinese labels. Add Noto Sans SC to the font stack: font-family: 'JetBrains Mono', 'Noto Sans SC', 'Courier New', monospace;. Translate section headers, box labels, captions. Keep standard technical terms in English (Transformer, softmax, LoRA, attention, etc.). Consult references/diagram-labels-cn.md for common translations.

Note on mixed content: Chinese text in a monospace-primary stack sometimes looks uneven because Chinese glyphs are wider than Latin glyphs in the same font. This is expected — do not try to force equal widths.


Handling incomplete input

If the user gives you only an abstract or title without full text:

  • Analyze what you can and still write the Markdown report file, but state upfront that you're working from limited information.
  • Do not invent technical details you can't verify.
  • Ask if they can share the full paper.
  • If --draw-structure was requested, explain that a method diagram needs the full method section; offer to produce a provisional diagram from the abstract only if the user insists.

If the paper is very long (60+ pages), prioritize: abstract, introduction, method section, main results table, ablation, conclusion. Note if you did not read appendices.

Tone

Write like a senior colleague reviewing a paper for a workshop, not like a press release. Use precise technical language. Don't hedge everything — take positions when the evidence supports them. If a paper is weak, say so clearly and specifically. If it's strong, say that too.

Reference files

  • references/output-cn.md — Chinese analysis output structure. Load when --lang cn.
  • references/venue-tiers.md — venue prestige calibration for Section 0.
  • references/diagram-design.md — full design system for architecture diagrams (colors, spacing, typography, arrows, component patterns). Load before building any diagram.
  • references/diagram-labels-cn.md — Chinese translations for common diagram labels. Load when --showcase-language cn.
  • assets/diagram-template.html — starter template for the HTML diagram.

Comments

Loading comments...