Install
openclaw skills install redbookSearch, read, analyze, and automate Xiaohongshu (小红书) content via CLI
openclaw skills install redbookUse the redbook CLI to search notes, read content, analyze creators, automate engagement, and research topics on Xiaohongshu (小红书/RED).
OpenClaw users: Install via clawhub install redbook or npm install -g @lucasygu/redbook.
⚠️ Research discipline (read this first). XHS 风控 throttles reading, not just writing. Any research that reads more than a handful of notes MUST run through the Research Loop — human-paced by default (~20 s/note, one at a time; a typical job finishes in tens of minutes, large ones spread across the day). Hammering
read/comments/analyze-viralin a tight loop trips captcha / IP block (300012) within a few dozen hits and degrades the account for hours. ⚡ Fast mode is emergency-only and requires a printed warning + explicit user opt-in. Never fire reads in parallel or with zero delay.
/redbook search "AI编程" # Search notes
/redbook read <url> # Read a note
/redbook user <userId> # Creator profile
/redbook analyze <userId> # Full creator analysis (profile + posts)
| Intent | Command |
|---|---|
| Search notes | redbook search "keyword" --json |
| Read a note | redbook read <url> --json |
| Get comments | redbook comments <url> --json --all |
| Creator profile | redbook user <userId> --json |
| Creator's posts | redbook user-posts <userId> --json |
| Browse feed | redbook feed --json |
| Search hashtags | redbook topics "keyword" --json |
| Analyze viral note | redbook analyze-viral <url> --json |
| Extract content template | redbook viral-template <url1> <url2> --json |
| Post a comment | redbook comment <url> --content "text" |
| Reply to comment | redbook reply <url> --comment-id <id> --content "text" |
| Batch reply (preview) | redbook batch-reply <url> --strategy questions --dry-run |
| Like a note | redbook like <url> |
| Unlike a note | redbook like <url> --undo |
| List favorites | redbook favorites --json or redbook favorites <userId> --json |
| Collect a note | redbook collect <url> |
| Remove from collection | redbook uncollect <url> |
| List followers | redbook followers <userId> --json |
| List following | redbook following <userId> --json |
| Delete own note | redbook delete <url> |
| Check note health | redbook health --json or redbook health --all --json |
| List user boards | redbook boards or redbook boards <userId> --json |
| List album notes | redbook board <board-url> or redbook board <boardId> --json |
| Render markdown to cards | redbook render content.md --style xiaohongshu |
| Publish image note | redbook post --title "..." --body "..." --images img.jpg |
| Check connection | redbook whoami |
Always add --json when parsing output programmatically. Without it, output is human-formatted text.
XHS is not Twitter or Instagram. These platform-specific engagement ratios reveal content type and audience behavior.
collected_count / liked_count)XHS's "collect" (收藏) is a save-for-later mechanic — users build personal reference libraries. This ratio is the strongest signal of content utility.
| Ratio | Classification | Meaning |
|---|---|---|
| >40% | 工具型 (Reference) | Tutorial, checklist, template — users bookmark for reuse |
| 20–40% | 认知型 (Insight) | Thought-provoking but not saved for later |
| <20% | 娱乐型 (Entertainment) | Consumed and forgotten — engagement is passive |
comment_count / liked_count)Measures how much a note triggers conversation.
| Ratio | Classification | Meaning |
|---|---|---|
| >15% | 讨论型 (Discussion) | Debate, sharing experiences, asking questions |
| 5–15% | 正常互动 (Normal) | Typical engagement pattern |
| <5% | 围观型 (Passive) | Users like but don't engage further |
share_count / liked_count)Measures social currency — whether users share to signal identity or help others.
| Ratio | Meaning |
|---|---|
| >10% | 社交货币 — people share to signal taste, identity, or help friends |
| <10% | Content consumed individually, not forwarded |
| Sort | What It Reveals |
|---|---|
--sort popular | Proven ceiling — the best a keyword can do |
--sort latest | Content velocity — how much is being posted now |
--sort general | Algorithm-weighted blend (default) |
| Form | Tendency |
|---|---|
图文 (image-text, type: "normal") | Higher collect rate — users save reference content |
视频 (video, type: "video") | Higher like rate — easier to consume passively |
Each module is a composable building block. Combine them for different analysis depths.
Answers: Which keywords have the highest engagement ceiling? Which are saturated vs. underserved?
Commands:
redbook search "keyword1" --sort popular --json
redbook search "keyword2" --sort popular --json
# Repeat for each keyword in your list
Fields to extract from each result's items[]:
items[].note_card.interact_info.liked_count — likes (may use Chinese numbers: "1.5万" = 15,000)items[].note_card.interact_info.collected_count — collectsitems[].note_card.interact_info.comment_count — commentsitems[].note_card.user.nickname — authorHow to interpret:
items[0] likes — the best-performing note for this keyword. This is the proven demand signal.items[0..9] — how well an average top note does.Output: Keyword × engagement table ranked by Top1 ceiling.
| Keyword | Top1 Likes | Top10 Avg | Top1 Collects | Collect/Like |
|---|---|---|---|---|
| keyword1 | 12,000 | 3,200 | 5,400 | 45% |
| keyword2 | 8,500 | 4,100 | 1,200 | 14% |
Answers: Which topic × scene intersections have demand? Where are the content gaps?
Commands:
# Combine base topic with scene/angle keywords
redbook search "base topic + scene1" --sort popular --json
redbook search "base topic + scene2" --sort popular --json
redbook search "base topic + scene3" --sort popular --json
Fields to extract: Same as Module A — Top1 liked_count for each combination.
How to interpret:
Output: Base × Scene heatmap.
scene1 scene2 scene3 scene4
base topic ████ 8K ██ 2K ████ 12K ░░ 200
Answers: What type of content is each keyword? Reference, insight, or entertainment?
Commands: Use search results from Module A, or for a single note:
redbook analyze-viral "<noteUrl>" --json
Fields to extract:
interact_info fieldsanalyze-viral: use pre-computed engagement.collectToLikeRatio, engagement.commentToLikeRatio, engagement.shareToLikeRatioHow to interpret: Apply the ratio benchmarks from XHS Platform Signals above.
Output: Per-keyword or per-note classification.
| Keyword | Collect/Like | Comment/Like | Type |
|---|---|---|---|
| keyword1 | 45% | 8% | 工具型 + 正常互动 |
| keyword2 | 12% | 22% | 娱乐型 + 讨论型 |
Answers: Who are the key creators in this niche? What are their strategies?
Commands:
# 1. Collect unique user_ids from search results across keywords
# Extract from items[].note_card.user.user_id
# 2. For each creator:
redbook user "<userId>" --json
redbook user-posts "<userId>" --json
Fields to extract:
user: interactions[] where type === "fans" → follower countuser-posts: notes[].interact_info.liked_count for all posts → compute avg, median, maxuser-posts: notes[].display_title → content patterns, posting frequencyHow to interpret:
Output: Creator comparison table.
| Creator | Followers | Avg Likes | Median | Max | Posts | Style |
|---|---|---|---|---|---|---|
| @creator1 | 12万 | 3,200 | 1,800 | 45,000 | 89 | Tutorial |
| @creator2 | 5.4万 | 8,100 | 6,500 | 22,000 | 34 | Story |
Answers: Do image-text or video notes perform better for this topic?
Commands:
redbook search "keyword" --type image --sort popular --json
redbook search "keyword" --type video --sort popular --json
Fields to extract:
liked_count and collected_count between the two result setstype field: "normal" = image-text, "video" = videoOutput: Form × engagement table.
| Form | Top1 Likes | Top10 Avg | Collect/Like |
|---|---|---|---|
| 图文 | 8,000 | 2,400 | 42% |
| 视频 | 15,000 | 5,100 | 18% |
Answers: Which keywords should I target? Where is the best effort-to-reward ratio?
Input: Keyword matrix from Module A.
Scoring logic:
Tier thresholds (based on Top1 likes):
| Tier | Top1 Likes | Meaning |
|---|---|---|
| S | >100,000 (10万+) | Massive demand — hard to compete but huge upside |
| A | 20,000–100,000 | Strong demand — competitive but winnable |
| B | 5,000–20,000 | Moderate demand — good for growing accounts |
| C | <5,000 | Niche — low competition, low ceiling |
Output: Tiered keyword list.
| Tier | Keyword | Top1 | Competition | Opportunity |
|---|---|---|---|---|
| A | keyword1 | 45K | Medium (6/10 >1K) | High |
| B | keyword3 | 12K | Low (2/10 >1K) | Very High |
| S | keyword2 | 120K | High (10/10 >1K) | Medium |
Answers: Who is the audience for this niche? What do they want?
Input: Engagement ratios from Module C + comment themes from analyze-viral + content patterns.
Fields to extract from analyze-viral JSON:
comments.themes[] — recurring phrases and keywords from comment sectioncomments.questionRate — % of comments that are questions (learning intent)engagement.collectToLikeRatio — save behavior signals intenthook.hookPatterns[] — what title patterns attract this audienceInference rules:
Output: Audience persona summary — demographics, intent, content preferences.
Answers: What specific content should I create, backed by data?
Input: Opportunity scores (Module F) + audience persona (Module G) + heatmap gaps (Module B).
For each content idea, specify:
hookPatterns that work for this nicheOutput: Ranked content ideas with data backing.
| # | Keyword | Hook Angle | Type | Target Likes | Reference |
|---|---|---|---|---|---|
| 1 | keyword3 | "N个方法..." (List) | 工具型 图文 | 5K+ | [top note URL] |
| 2 | keyword1 | "为什么..." (Question) | 认知型 视频 | 10K+ | [top note URL] |
Answers: Which comments deserve a reply? What is the comment quality distribution?
Commands:
# 1. Fetch all comments
redbook comments "<noteUrl>" --all --json
# 2. Preview reply candidates (dry run)
redbook batch-reply "<noteUrl>" --strategy questions --dry-run --json
# 3. Execute replies with template (5 min delay with ±30% jitter)
redbook batch-reply "<noteUrl>" --strategy questions \
--template "感谢提问!关于{content},..." \
--max 10
Fields to extract from --dry-run JSON:
candidates[].commentId — target commentscandidates[].isQuestion — boolean, detected questioncandidates[].likes — engagement signalcandidates[].hasSubReplies — whether already answeredskipped — how many comments were filtered outtotalComments — total fetchedStrategies:
questions — replies to comments ending with ? or ? (learning-oriented audience)top-engaged — replies to highest-liked comments (maximum visibility)all-unanswered — replies to comments with no existing sub-replies (fill gaps)How to interpret:
Safety: Hard cap 30 replies per batch, minimum 3-minute delay with ±30% jitter (default 5 min), --dry-run by default (no template = preview only), immediate stop on captcha. See Rate Limits & Safety for details.
Output: Reply plan table with candidate comments, strategy match reason, and status.
Answers: What structural template can I extract from successful notes to guide new content creation?
Commands:
# 1. Find top notes for a keyword
redbook search "keyword" --sort popular --json
# 2. Extract structural template from 2-3 top performers
redbook viral-template "<url1>" "<url2>" "<url3>" --json
Fields to extract from viral-template JSON:
dominantHookPatterns[] — hook types appearing in majority of notestitleStructure.commonPatterns[] — specific title formulatitleStructure.avgLength — target title lengthbodyStructure.lengthRange — target word count [min, max]bodyStructure.paragraphRange — target paragraph countengagementProfile.type — reference/insight/entertainmentaudienceSignals.commonThemes[] — what the audience talks aboutHow to interpret:
Composition with other modules:
Output: Content template spec — the structural skeleton for content creation. An LLM (via the composed workflow) uses this template to generate actual title, body, hashtags, and cover image prompt.
Answers: How should I manage ongoing engagement with my audience?
This module is a workflow that composes Modules I and J with human oversight.
Workflow:
redbook comments "<myNoteUrl>" --all --json to fetch recent commentsredbook batch-reply --strategy questions --dry-run to identify reply candidatesredbook batch-reply --strategy questions --template "..." --max 10Safety rules:
--dry-run first, human approval before executionhasSubReplies)Anti-spam guidelines:
Answers: How do I turn markdown content into Xiaohongshu-ready image cards?
Commands:
# Render markdown to styled PNG cards
redbook render content.md --style xiaohongshu
# Custom style and output directory
redbook render content.md --style dark --output-dir ./cards
# JSON output (for programmatic use)
redbook render content.md --json
Input: Markdown file with YAML frontmatter:
---
emoji: "🚀"
title: "5个AI效率技巧"
subtitle: "Claude Code 实战"
---
## 技巧一:...
Content here...
---
## 技巧二:...
More content...
Output: cover.png + card_1.png, card_2.png, ... in the same directory.
Card specs:
Pagination modes:
auto (default) — smart split on heading/paragraph boundaries using character-count heuristicseparator — manual split on --- in markdownHow to interpret:
puppeteer-core) — no browser download neededredbook post --images cover.png card_1.png ...Dependencies: Requires puppeteer-core and marked (optional, install with npm install -g puppeteer-core marked).
Composition with other modules:
redbook post --images for publishingAnswers: Are any of my notes being secretly rate-limited by XHS?
XHS assigns a hidden level field to each note in the creator backend API. This level controls recommendation distribution but is never shown in the UI. Your note may look "normal" while secretly receiving zero recommendations.
Commands:
# Check all notes (first page)
redbook health
# Check all pages
redbook health --all
# JSON output for programmatic use
redbook health --all --json
Level definitions:
| Level | Status | Meaning |
|---|---|---|
| 4 | 🟢 Normal | Full recommendation distribution |
| 2-3 | 🟡 Baseline | Basically normal, minor constraints |
| 1 | ⚪ New | Under review (new post) |
| -1 | 🔴 Soft limit | Mild throttling, decreased recommendations |
| -5 to -101 | 🔴 Moderate | Moderate throttling, minimal promotion |
| -102 | ⛔ Severe | Irreversible — must delete and repost |
Additional checks:
How to interpret:
Output: Terminal dashboard with color-coded distribution summary, limited notes list, and risk factor warnings.
Discovery credit: @xxx111god — xhs-note-health-checker
Combine modules for different analysis depths.
Modules: A → C → F
Search 3–5 keywords, classify engagement type, rank opportunities. Good for quickly validating whether a niche is worth deeper research.
Modules: A → B → E → F → H
Build keyword matrix, map topic × scene intersections, check content form performance, score opportunities, brainstorm specific content ideas.
Modules: A → D
Find who dominates a niche and study their content strategy, posting frequency, and engagement patterns.
Modules: A → B → C → D → E → F → G → H
The comprehensive playbook — keyword landscape, cross-topic heatmap, engagement signals, creator profiles, content form analysis, opportunity scoring, audience personas, and data-backed content ideas.
Command: redbook analyze-viral "<url>" --json
No module composition needed — analyze-viral returns hook analysis, engagement ratios, comment themes, author baseline comparison, and a 0-100 viral score in one call.
# 1. Find top notes
redbook search "keyword" --sort popular --json
# 2. Extract template from top 3 notes (replaces manual synthesis)
redbook viral-template "<url1>" "<url2>" "<url3>" --json
viral-template automates what previously required manual synthesis across analyze-viral results. It outputs a ContentTemplate JSON that captures dominant hooks, body structure ranges, engagement profile, and audience signals.
Modules: I
Single-module workflow for managing comment engagement on your notes. Use batch-reply --dry-run to audit, then execute with a template.
Modules: A → J → H → L
Keyword research → viral template extraction → data-backed content brainstorm → render to image cards. The template provides structural constraints that guide Module H's content ideas. Module L renders the final markdown to XHS-ready PNGs.
Modules: A → J → H → L → post
The full pipeline: research keywords → extract viral template → brainstorm content → write markdown → render to styled image cards → publish via redbook post --images cover.png card_1.png ...
Modules: M
Run redbook health --all periodically to catch throttled notes early. If level drops below 1, investigate the note's content for policy violations. Combine with Module I to check if throttled notes still have unanswered comments worth replying to.
Modules: A → C → I → J → K → M
Comprehensive automation playbook — keyword analysis, engagement classification, comment operations, viral replication templates, and engagement automation workflow.
XHS enforces aggressive anti-spam (风控) that detects automated behavior through device fingerprinting, activity ratio monitoring, and timing pattern analysis. The CLI applies safe defaults based on platform research.
| Action | Safe Interval | CLI Default | Hard Cap |
|---|---|---|---|
| Post a note | 3-4 hours (2-3 notes/day max) | N/A (manual) | — |
| Comment | ≥3 minutes | N/A (manual) | — |
| Reply | ≥3 minutes | N/A (manual) | — |
| Batch reply delay | ≥3 minutes | 5 min ±30% jitter | — |
| Batch reply count | — | 10 | 30 |
post, comment, and reply commands display safe interval reminders after each action.--dry-run first, review candidates, then execute--delay below 180000 (3 min)post commands 3-4 hours apart (2-3 notes/day maximum)The Rate Limits & Safety section above paces writes. This section paces reads — the part research actually hammers. read, comments, and analyze-viral each hit the note-detail API (with xsec_token); fire 30–50 in a tight loop and XHS returns NeedVerify / captcha or IP-blocks you (300012), and a tripped account stays degraded for hours. So research here is paced, not parallel: one note in flight, spaced to look like a human reading rather than a script scraping.
The mechanism is a file-based loop: a queue of work items on disk, one item processed per "tick", every result checkpointed immediately. Because all state lives in files, the loop is resumable and survives the session ending — a fresh session (or a Push heartbeat) picks up exactly where the last left off. Pairs with the /checkpoint-research skill for the synthesis step.
These are calibrated to XHS's documented read-side safe band, not guesswork. Community crawlers (MediaCrawler) default to ~2 s/request, single-threaded (CRAWLER_MAX_SLEEP_SEC = 2, MAX_CONCURRENCY_NUM = 1); reverse-engineering write-ups put the "looks-human" frequency at 3–8 s/request and the per-IP ceiling at ≤5 requests/min (~12 s average). This skill runs on your real logged-in cookies + residential IP + real Chrome fingerprint — the lowest-detection profile (the thing scrapers spend most of their effort faking). So the binding risk isn't request signature, it's behavioral: a human doesn't open 100 note-detail pages in two minutes. We therefore pace to a human reading cadence — comfortably inside the safe band — rather than to the absolute minimum. (5-minute gaps, an earlier guess, are ~25× slower than the ceiling and buy nothing.)
Detection here is distributional, not a rate threshold. XHS doesn't ask "more than N requests?" — it models what a human session looks like and flags the unlikely ones. Pure speed isn't the tell; the shape of the behavior is. Three things give a script away:
So the rule is not "go slow." It's make the session indistinguishable from a person reading: human-shaped variance (jitter + the occasional long pause, not merely a bigger constant), one note at a time with dwell, depth over breadth, a ceiling because people don't open hundreds of things, and stop at the first sign of friction (a human who hits a captcha backs off; a bot retries — so the circuit breaker is itself human-like behavior).
Safety-first is the more agentic posture, not the timid one. An agent's real advantage isn't speed — it's patience at scale: it can run a human-shaped loop for eight hours, which no person will. Aggression imitates the one human trait that gets you caught (impatience) and wastes the one the agent uniquely has (tireless patience). Lean into the patience.
This is a pattern, not a blacklist. The numbers here were calibrated from a real incident — a beverage-trend sweep that fired 3 distinct searches in ~5 seconds across ~17 unrelated keywords and drew an account-level warning. What's encoded is the behavioral invariant (human-shaped timing, one-in-flight, ceiling, dwell, breaker), not "don't run those queries." It generalizes to any topic and any platform. The incident is the calibration sample that tells us where the human/bot line sits — not a rule about matcha.
| Mode | Pace per note | ~Reads/min | Use when | Gate |
|---|---|---|---|---|
| 🚶 Steady (default) | 20 s ± 50% jitter (≈10–35 s) | ~2–3 | All routine / multi-note research | None — this is the default. |
| 🐢 Deep / overnight | 60–120 s | ~0.5–1 | Very large corpora, or minimizing footprint | Opt in for big jobs |
| ⚡ Fast (emergency) | 5 s ± jitter floor, max 30-note burst then cooldown | ~10+ (over the safe ceiling) | A genuine deadline that cannot wait | MUST print the fast-mode warning verbatim and get explicit user opt-in |
Invariants (all modes): never zero-delay, never parallel, exactly one note in flight, and — except for warned Fast mode — never exceed ~5 reads/min sustained. At the Steady pace a 100-note queue finishes in ~35–50 min; only very large queues or Deep mode stretch toward end-of-day.
for url in …; do redbook read …; done without the inter-item delay. No background & fan-out. (MAX_CONCURRENCY = 1, like the reference crawlers.)--comment-pages ≤ 3, count each comment page against the pace + budget like any other read, and never deep-crawl a comment thread.consumedToday >= dailyBudget, stop until the next calendar day — do not "just finish a few more".All state lives under one job directory — default research/xhs/<job-slug>/ in the private content repo (ask if unclear which repo):
| File | Role |
|---|---|
queue.jsonl | The work queue, one item per line: { "id": "001", "kind": "read|comments|analyze-viral|search|user|user-posts", "arg": "<url-or-keyword-or-userId>", "status": "pending|done|error", "note": "" } |
results/<id>.json | Raw --json output for each processed item (the corpus) |
loop-state.json | { "mode": "steady|deep|fast", "intervalSec": 20, "jitterPct": 50, "dailyBudget": 200, "consumedToday": 0, "budgetDate": "YYYY-MM-DD", "lastTickAt": "<iso>", "breaker": "ok|tripped", "breakerReason": null } (Deep: intervalSec 60–120; Fast: intervalSec 5 + mode:"fast") |
manifest.md | Human-readable plan + progress (checkmark per done item) — the recovery checkpoint, updated every tick |
SUMMARY.md | Final synthesis, written once the queue drains |
Seeding the queue: the cheap list endpoints (search, feed, user-posts — no xsec_token cost) are how you build the queue, not part of the paced loop. Run a search/feed up front, extract each webUrl, and write one read/analyze-viral item per note into queue.jsonl. Then the loop drains the expensive detail-reads at the Steady pace. Always carry the fresh webUrl (with token) as the item arg — tokens expire, so don't queue bare noteIds (see xsec_token).
A tick is the unit both drivers call. It processes exactly one item and stops:
loop-state.json. If breaker != "ok" → exit immediately (surface the reason; do not process).budgetDate != today, reset consumedToday = 0, budgetDate = today. If consumedToday >= dailyBudget → exit until tomorrow.pending item from queue.jsonl. If none remain → finalize: synthesize SUMMARY.md from results/, mark the loop complete, and tell the driver to stop. Done.redbook <kind> "<arg>" --json.{}, NeedVerify/captcha, Session expired, IP block 300012). If found → trip the breaker: set breaker:"tripped" + breakerReason, leave the item pending (not consumed), persist, surface to the user, STOP. (See Circuit breaker.)results/<id>.json, set the item status:"done", check it off in manifest.md, consumedToday += 1, set lastTickAt.The same tick, driven two ways. Pick by how long the job must outlive the current session:
A) In-session pacing (default — Steady). The simplest driver and the right one for most jobs: run ticks back-to-back inside one session, sleeping intervalSec ± jitter between each read. At Steady pace a 100-note queue finishes in ~35–50 min — comfortably within one session — and every item is checkpointed, so a crash just resumes. No scheduler needed. (Pace per read; don't fake it with one long blocking sleep.)
B) /loop self-paced (Deep / overnight / survives idle gaps). For Deep mode or jobs spanning hours, drive ticks with /loop without an interval (self-paced): do one tick, then ScheduleWakeup for intervalSec + jitter, and re-fire. ScheduleWakeup clamps to ≥60 s, so it fits Deep (60–120 s) — not sub-minute Steady (run that in-session, per A). The loop ends itself when the queue drains (Step 3) or the breaker trips (Step 5).
C) Hook — Push recurring issue (multi-day / true background, survives app restarts). Best for "run this every day" research or corpora too big for one day. Create a recurring issue (recurringIntervalSec — a few minutes per tick, or a daily sweep) assigned to a redbook-capable agent. Each occurrence is a fresh session = one tick against the same job directory — so the issue body must point at the job dir and say "run exactly one tick, then exit." It keeps firing until a human retires it; the agent finalizes + comments when the queue drains, and comments + leaves it for human attention if the breaker trips. (Push "scout" pattern — the schedule lives on the issue, the agent's Auto switch gates it.)
Why file-state, not a long-lived process: a single blocking
sleep-loop dies with the terminal and can't be resumed. The queue +loop-state.json+ per-item result files mean any future session —/loopwake, Push heartbeat, or a human re-running the tick — continues exactly where the last stopped, and the partial corpus is never lost.
Before any fast-mode run, print this verbatim and require an explicit "yes, fast mode" from the user. Do not proceed on silence or a vague affirmative:
⚠️ XHS FAST RESEARCH MODE — elevated 风控 risk
Reading at ~12 notes/min crosses the safe ~5/min ceiling and
materially raises the chance of:
• captcha / NeedVerify mid-run (stops the loop)
• IP block (error 300012) for hours
• account-level recommendation throttling
Steady mode (the default, ~20s/note) stays inside the safe band
and still finishes a typical job in well under an hour.
Proceed with fast mode for this run only? (yes / no)
Fast mode still obeys the floor (≥5 s ± jitter between notes — never zero), caps a burst at 30 notes then forces a cooldown back to Steady pace, and the circuit breaker still hard-stops on the first 风控 signal. Fast mode raises the pace past the safe ceiling; it never removes the floor or the breaker.
Trip the breaker and STOP the whole loop on the first of any of these from a read:
| Signal | Where it shows |
|---|---|
NeedVerify / captcha | error message or response flag |
Empty {} response | usually expired/invalid xsec_token or anti-scrape — re-seed the queue with fresh webUrls, don't retry blindly |
Session expired / "No 'a1' cookie" | cookie invalid — re-login in Chrome before resuming |
IP block 300012 | hard rate limit — wait it out / switch network |
On trip: set loop-state.breaker:"tripped" + breakerReason, leave the current item pending, persist, and surface to the user (or comment on the Push issue). Never auto-retry through a block — that's what escalates a soft throttle into a multi-hour block. Resume only after a human clears the cause and explicitly says to continue.
The following operations work reliably via API:
The following operations are unreliable via API (frequently trigger captcha):
--private for higher success rate)The following operations require browser automation (not supported by this CLI):
redbook search <keyword>Search for notes by keyword. Returns note titles, URLs, likes, author info.
redbook search "Claude Code教程" --json
redbook search "AI编程" --sort popular --json # Sort: general, popular, latest
redbook search "Cursor" --type image --json # Type: all, video, image
redbook search "MCP Server" --page 2 --json # Pagination
Options:
--sort <type>: general (default), popular, latest--type <type>: all (default), video, image--page <n>: Page number (default: 1)redbook read <url>Read a note's full content — title, body text, images, likes, comments count.
redbook read "https://www.xiaohongshu.com/explore/abc123" --json
Accepts full URLs or short note IDs. Falls back to HTML scraping if API returns captcha.
redbook comments <url>Get comments on a note. Use --all to fetch all pages.
redbook comments "https://www.xiaohongshu.com/explore/abc123" --json
redbook comments "https://www.xiaohongshu.com/explore/abc123" --all --json
redbook user <userId>Get a creator's profile — nickname, bio, follower count, note count, likes received.
redbook user "5a1234567890abcdef012345" --json
The userId is the hex string from the creator's profile URL.
redbook user-posts <userId>List all notes posted by a creator. Returns titles, URLs, likes, timestamps.
redbook user-posts "5a1234567890abcdef012345" --json
redbook feedBrowse the recommendation feed.
redbook feed --json
redbook topics <keyword>Search for topic hashtags. Useful for finding trending topics to attach to posts.
redbook topics "Claude Code" --json
redbook favorites [userId]List a user's collected (bookmarked) notes. Defaults to the current logged-in user when no userId is provided.
redbook favorites --json # Your own favorites
redbook favorites "5a1234567890abcdef" --json # Another user's favorites
redbook favorites --all --json # Fetch all pages
Options:
--all: Fetch all pages of favorites (default: first page only)Note: Other users' favorites are only visible if they haven't set their collection to private.
redbook collect <url>Collect (bookmark) a note to your favorites.
redbook collect "https://www.xiaohongshu.com/explore/abc123"
redbook uncollect <url>Remove a note from your collection.
redbook uncollect "https://www.xiaohongshu.com/explore/abc123"
redbook analyze-viral <url>Analyze why a viral note works. Returns a deterministic viral score (0–100).
redbook analyze-viral "https://www.xiaohongshu.com/explore/abc123" --json
redbook analyze-viral "https://www.xiaohongshu.com/explore/abc123" --comment-pages 5
Options:
--comment-pages <n>: Comment pages to fetch (default: 3, max: 10)JSON output structure:
Returns { note, score, hook, content, visual, engagement, comments, relative, fetchedAt }.
score.overall (0–100) — composite of hook (20) + engagement (20) + relative (20) + content (20) + comments (20)hook.hookPatterns[] — detected title patterns (Identity Hook, Emotion Word, Number Hook, Question, etc.)engagement — likes, comments, collects, shares + ratios (collectToLikeRatio, commentToLikeRatio, shareToLikeRatio)relative.viralMultiplier — this note's likes / author's median likesrelative.isOutlier — true if viralMultiplier > 3comments.themes[] — top recurring keyword phrases from commentsredbook viral-template <url> [url2] [url3]Extract a reusable content template from 1-3 viral notes. Analyzes each note (same pipeline as analyze-viral) and synthesizes common structural patterns.
redbook viral-template "<url1>" "<url2>" "<url3>" --json
redbook viral-template "<url1>" --comment-pages 5 --json
Options:
--comment-pages <n>: Comment pages to fetch per note (default: 3, max: 10)JSON output structure:
Returns { dominantHookPatterns, titleStructure, bodyStructure, engagementProfile, audienceSignals, sourceNotes, generatedAt }.
dominantHookPatterns[] — hook types appearing in majority of input notestitleStructure.avgLength — average title length across notesbodyStructure.lengthRange — [min, max] body lengthengagementProfile.type — "reference" / "insight" / "entertainment"audienceSignals.commonThemes[] — merged comment themes across notesredbook comment <url>Post a top-level comment on a note.
redbook comment "<noteUrl>" --content "Great post!" --json
Options:
--content <text> (required): Comment textredbook reply <url>Reply to a specific comment on a note.
redbook reply "<noteUrl>" --comment-id "<commentId>" --content "Thanks for asking!" --json
Options:
--comment-id <id> (required): Comment ID to reply to (from comments --json output)--content <text> (required): Reply textredbook batch-reply <url>Reply to multiple comments using a filtering strategy. Always preview with --dry-run first.
# Preview which comments match the strategy
redbook batch-reply "<noteUrl>" --strategy questions --dry-run --json
# Execute replies with a template (default 5 min delay with jitter)
redbook batch-reply "<noteUrl>" --strategy questions \
--template "感谢提问!{content}" --max 10
Options:
--strategy <name>: questions (default), top-engaged, all-unanswered--template <text>: Reply template with {author}, {content} placeholders--max <n>: Max replies (default: 10, hard cap: 30)--delay <ms>: Delay between replies in ms (default: 300000 / 5 min, min: 180000 / 3 min). ±30% random jitter applied automatically.--dry-run: Preview candidates without posting (default when no template)Safety: Stops immediately on captcha. No template = dry-run only. Delays include random jitter to avoid uniform timing patterns that trigger XHS bot detection.
redbook render <file>Render a markdown file with YAML frontmatter into styled PNG image cards. Uses the user's existing Chrome installation — no browser download needed.
redbook render content.md --style xiaohongshu
redbook render content.md --style dark --output-dir ./cards
redbook render content.md --pagination separator --json
Options:
--style <name>: purple, xiaohongshu (default), mint, sunset, ocean, elegant, dark--pagination <mode>: auto (default), separator (split on ---)--output-dir <dir>: Output directory (default: same as input file)--width <n>: Card width in px (default: 1080)--height <n>: Card height in px (default: 1440)--dpr <n>: Device pixel ratio (default: 2)Requires: puppeteer-core and marked (npm install -g puppeteer-core marked). Does NOT require XHS cookies — purely offline rendering.
Override Chrome path: Set CHROME_PATH environment variable if Chrome is not in the standard location.
redbook whoamiCheck connection status. Verifies cookies are valid and shows the logged-in user.
redbook whoami
redbook post (Limited)Publish an image note. Frequently triggers captcha (type=124) on the creator API. Image upload works, but the publish step is unreliable. For posting, consider using browser automation instead.
redbook post --title "标题" --body "正文" --images cover.png --json
redbook post --title "测试" --body "..." --images img.png --private --json
Options:
--title <title>: Note title (required)--body <body>: Note body text (required)--images <paths...>: Image file paths (required, at least one)--topic <keyword>: Search and attach a topic hashtag--private: Publish as private noteAll commands accept:
--cookie-source <browser>: chrome (default), safari, firefox--chrome-profile <name>: Chrome profile directory name (e.g., "Profile 1"). Auto-discovered if omitted.--json: Output as JSONThe XHS API requires a valid xsec_token to fetch note content. Without it, read, comments, and analyze-viral return {}.
The same token is also required for shareable URLs. Any https://www.xiaohongshu.com/explore/<id> URL without ?xsec_token=...&xsec_source=... is 302-redirected by XHS's anti-scrape layer to https://www.xiaohongshu.com/404/sec_*?source=xhs_sec_server&originalUrl=.... This affects anyone who clicks the URL — Safari, iOS link previews, agent action buttons, etc.
webUrl — use this (since v0.7.0):
Every note-returning command (feed, search, user-posts, favorites, board, read, post) now includes a webUrl field with the token baked in and the correct xsec_source. Consumers should use webUrl directly — do not construct URLs by hand.
redbook feed --json | jq '.items[0].webUrl'
# => "https://www.xiaohongshu.com/explore/<id>?xsec_token=<t>&xsec_source=pc_feed"
xsec_source is set per-command: pc_feed, pc_search, pc_user, pc_board, pc_share.
Key rules:
?xsec_token=... from a previous session will return {}. Never cache or reuse old URLs.search and feed always return fresh tokens. Every item includes a valid xsec_token + a pre-built webUrl.{}. Running redbook read <noteId> without a token almost always fails.The correct workflow — always search first:
# WRONG — stale URL or bare noteId, will likely return {}
redbook read "689da7b0000000001b0372c6" --json
redbook read "https://www.xiaohongshu.com/explore/689da7b0?xsec_token=OLD_TOKEN" --json
# RIGHT — search first, then use the fresh URL with token
redbook search "AI编程" --sort popular --json
# Extract the webUrl from search results, then:
redbook read "<webUrl from search result>" --json
For agents: Prefer webUrl from the response. When only a bare noteId is available, search first to obtain a fresh token, then use the returned webUrl.
Commands that need xsec_token: read, comments, analyze-viral
Commands that do NOT need xsec_token: search, user, user-posts, feed, whoami, topics
The XHS API returns abbreviated numbers with Chinese unit suffixes:
| API value | Actual number |
|---|---|
"1.5万" | 15,000 |
"2.4万" | 24,000 |
"1.2亿" | 120,000,000 |
"115" | 115 |
万 = ×10,000. 亿 = ×100,000,000. Numbers under 10,000 are plain integers as strings.
The analyze-viral command handles this automatically. When parsing --json output manually, watch for these suffixes in interact_info fields (liked_count, collected_count, etc.).
| Error | Meaning | Fix |
|---|---|---|
{} empty response | Missing or expired xsec_token | Search first to get a fresh token |
| "No 'a1' cookie" | Not logged into XHS in browser | Log into xiaohongshu.com in Chrome |
| "Session expired" | Cookie too old | Re-login in Chrome |
| "NeedVerify" / captcha | Anti-bot triggered | Wait and retry, or reduce request frequency |
| "IP blocked" (300012) | Rate limited | Wait or switch network |
When producing analysis reports, use these formats:
Data tables: Markdown tables with exact field mappings. Always include the metric unit.
Heatmaps: ASCII bar charts for cross-topic comparison:
职场 生活 教育 创业
AI编程 ████ 8K ██ 2K ████ 12K ░░ 200
Claude Code ██ 3K ░░ 100 ██ 4K █ 1K
Creator comparison: Structured table with both quantitative metrics and qualitative style assessment.
Final reports: Use this section order:
import { XhsClient } from "@lucasygu/redbook";
import { loadCookies } from "@lucasygu/redbook/cookies";
const cookies = await loadCookies("chrome");
const client = new XhsClient(cookies);
const results = await client.searchNotes("AI编程", 1, 20, "popular");
const topics = await client.searchTopics("Claude Code");
--cookie-source)puppeteer-core and marked (npm install -g puppeteer-core marked). Uses your existing Chrome — no additional browser download.