Notcrawl

v0.1.0

Mirror a Notion workspace into local SQLite + normalized Markdown for search, diff, and agent queries without depending on the Notion UI.

0· 42· 1 versions· 0 current· 0 all-time· Updated 8h ago· MIT-0
byChase Webb@chasewebb

Notcrawl

Local-first Notion crawler. Pulls pages, databases, and blocks into a SQLite store and emits normalized Markdown alongside, so PaperBrain can absorb Notion content into the vault graph.

Requirements

  • Notion internal integration token. Create at https://www.notion.so/profile/integrations → New integration → copy the secret (secret_… or ntn_…).
  • Share each Notion page or database with the integration (Notion's menu → Connections → add your integration). Notcrawl can only see what the integration is invited to.
  • notcrawl binary on PATH (installed at ~/.local/bin/notcrawl).

Setup

export NOTION_API_KEY="ntn_…"
notcrawl init                              # create ~/.notcrawl/config.toml + db
notcrawl sync --full                       # initial pull of all shared content
notcrawl export-md --out ~/.notcrawl/md    # dump normalized Markdown

State

  • Config: ~/.notcrawl/config.toml
  • Database: ~/.notcrawl/notcrawl.db
  • Markdown export: ~/.notcrawl/md/ (configurable)

Common Commands

notcrawl status --json
notcrawl sync --incremental
notcrawl pages list --json
notcrawl search "OKR" --json
notcrawl export-md --out <dir>             # regenerate Markdown
notcrawl sql 'SELECT count(*) FROM pages'

Integration Notes

  • Markdown export is the bridge to PaperVault — point --out at a vault folder (e.g. KNOWLEDGE/notion/) to fold Notion content into the graph.
  • Schedule notcrawl sync --incremental + notcrawl export-md via PaperFang for hands-free mirroring.
  • Diff-friendly: Markdown output is deterministic, so changes show up cleanly in git.

Version tags

crawlervk978z5e8bxt9qeq9adyb1hrk6x85p879latestvk978z5e8bxt9qeq9adyb1hrk6x85p879local-firstvk978z5e8bxt9qeq9adyb1hrk6x85p879paperchasevk978z5e8bxt9qeq9adyb1hrk6x85p879

Runtime requirements

📝 Clawdis
OSmacOS · Linux
Binsnotcrawl
EnvNOTION_API_KEY