Eir Daily Content Curator

v3.119.0

Daily AI news curation — learns interests from your profile, searches the web, delivers structured summaries and daily briefs. Use when: 'set up daily news',...

⭐ 3· 202·1 current·1 all-time

by@heyeir

OpenClaw Prompt Flow

Install with OpenClaw

Best for remote or guided setup. Copy the exact prompt, then paste it into OpenClaw for heyeir/eir-daily-content-curator.

Previewing Install & Setup.

Prompt PreviewInstall & Setup

Install the skill "Eir Daily Content Curator" (heyeir/eir-daily-content-curator) from ClawHub.
Skill page: https://clawhub.ai/heyeir/eir-daily-content-curator
Keep the work scoped to this skill only.
After install, inspect the skill metadata and help me finish setup.
Required binaries: python3
Use only the metadata you can verify from ClawHub; do not invent missing requirements.
Ask before making any broader environment changes.

Command Line

CLI Commands

Use the direct CLI path if you want to install manually and keep every step visible.

OpenClaw CLI

Bare skill slug

openclaw skills install eir-daily-content-curator

ClawHub CLI

Package manager switcher

npx clawhub@latest install eir-daily-content-curator

Security Scan

Capability signals

CryptoRequires OAuth tokenRequires sensitive credentials

These labels describe what authority the skill may exercise. They are separate from suspicious or malicious moderation verdicts.

VirusTotal

Benign

View report →

OpenClaw

Benign

high confidence

✓

Purpose & Capability

Name/description (daily content curation, standalone or Eir modes) match the included scripts (search, candidate selection, crawl, task_builder, generate, eir_post, eir_sync, connect). Required binary is python3 only; optional API keys and an Eir pairing flow align with the two modes. No unrelated cloud credentials or unexpected binaries are required.

✓

Instruction Scope

SKILL.md directs the agent to run local Python scripts, read/write files inside the skill's data/ and config/ directories, and (in Eir mode) POST generated content to the Eir API. It does not instruct the agent to read unrelated system files or exfiltrate secrets. Prompts and docs explicitly state privacy defaults (personalization off) and describe what is sent in Eir mode.

✓

Install Mechanism

No install spec or remote downloads are present; the skill is instruction-plus-scripts only. All code is bundled in the skill; there are no external arbitrary URL downloads or package installers in the manifest.

✓

Credentials

Optional environment overrides (EIR_API_KEY, EIR_API_URL, SEARCH_API_KEY) are reasonable for the two operating modes. No unrelated secrets or numerous credentials are requested. The skill stores Eir credentials locally (config/eir.json) per SECURITY.md; this is proportional to Eir mode functionality.

✓

Persistence & Privilege

always is false and the skill does not request permanent system-level privileges. It suggests cron jobs for scheduled runs (user-configured). The skill writes/reads only its own config/data directories and stores the Eir API key in a gitignored config file, which is expected behavior.

Assessment

This skill appears to do what it says: local standalone curation or optional Eir-mode posting. Items to consider before installing: 1) Eir mode will send generated content, topic slugs, and source metadata to api.heyeir.com if you connect — only enable Eir mode if you accept that. 2) The connect script saves an API key to config/eir.json in the skill directory (check the file and confirm it's gitignored before committing). 3) Provide a SEARCH_API_KEY for standalone search; the skill will make network requests to your configured search provider and to crawled source URLs. 4) The docs mention EIR_API_URL overrides, but connect.py uses a hardcoded API_BASE; if you need a custom API host, verify the connect flow will use your override. 5) If you want zero external transmission, use Standalone mode and keep personalization disabled. Finally, run the pipeline in an isolated session (or review the code) before enabling scheduled cron jobs that invoke network operations automatically.

Like a lobster shell, security has layers — review code before you run it.

Runtime requirements

📰 Clawdis

Binspython3

latestvk97845h9gf83ea6pe9ysg142fd85grcm

202downloads

3stars

14versions

Updated 2d ago

v3.119.0

MIT-0

Daily Content Curator

Curates personalized content based on your interests. Supports two modes:

Standalone — works locally, no external account needed
Eir — full curation + delivery via heyeir.com

Getting Started

Before setup, ask the user which mode to use:

Mode	What it does	Requirements
Standalone	Search → curate → generate summaries locally	Search API key (Brave, Tavily, etc.)
Eir	Full pipeline with delivery to Eir app	Eir account + pairing code

Important: The two modes use different content formats and topic slug conventions. Choose the correct mode at setup time — switching later requires reconfiguration. If the user has an Eir account, use Eir mode.

Then follow the corresponding setup section below.

Standalone Mode

Flow

1. Configure          → Set up search API + interests (one-time)
2. Search             → Search API queries for each interest topic
3. Select + Crawl     → Agent picks best candidates, fetches full content
4. Generate           → Agent writes structured summaries from task files
5. Daily Brief        → Agent compiles brief from generated items

Steps 1-3 are Python scripts you run directly. Steps 4-5 are agent-driven — you tell your OpenClaw agent to read the task files and generate content. The agent uses whatever LLM model is configured in your OpenClaw session (e.g. Claude, GPT-4, Gemini).

Quick Start

1. Initialize workspace — creates config/ directory and default settings:

# Option A: inline JSON
python3 scripts/setup.py --init --settings '{
  "mode": "standalone",
  "language": "en",
  "search": {
    "search_base_url": "https://api.search.brave.com/res/v1",
    "search_api_key": "YOUR_BRAVE_API_KEY"
  }
}'

# Option B: settings file (recommended for PowerShell/Windows)
python3 scripts/setup.py --init --settings-file path/to/settings.json


Search provider examples:
| Provider | `search_base_url` | Get API key |
|----------|-------------------|-------------|
| Brave Search | `https://api.search.brave.com/res/v1` | [brave.com/search/api](https://brave.com/search/api/) |
| Tavily | `https://api.tavily.com` | [tavily.com](https://tavily.com/) |

> **Want richer results?** Install [SearXNG](https://docs.searxng.org/) and/or [Crawl4AI](https://github.com/unclecode/crawl4ai) locally. Add `searxng_url` and `crawl4ai_url` to your search config — they work as fallback or primary search/crawl providers.

**2. Set up interests** — edit the generated `config/interests.json`:
```json
{
  "topics": [
    {"label": "AI Agents", "keywords": ["autonomous agents", "tool use"], "freshness": "7d"},
    {"label": "Prompt Engineering", "keywords": ["prompting", "chain-of-thought"]}
  ],
  "language": "en",
  "max_items_per_day": 8
}

Interests can also be auto-extracted — see references/interest-extraction-prompt.md.

Freshness and tier: Each topic supports optional freshness (e.g. "3d", "7d", "14d") and tier ("focus", "tracked", "explore", "seed") fields in interests.json. In Eir mode, these come from the API directives. In standalone mode, defaults are "7d" and "tracked". The search pipeline uses tier to decide search depth (focus/tracked get entity refinement) and freshness to filter stale results.

3. Run the search + crawl pipeline (from the scripts/ directory):

cd scripts
python3 -m pipeline.search              # Search for each topic
python3 -m pipeline.candidate_selector  # Group results for agent selection
# ↓ Agent step: review topic files, write candidates.json (see below)
python3 -m pipeline.crawl               # Fetch full content
python3 -m pipeline.task_builder        # Bundle into task files

All python3 -m pipeline.* commands must be run from the scripts/ directory.

Agent selection step (between candidate_selector and crawl):

candidate_selector outputs per-topic JSON files to data/v9/topics/. Your agent should:

Read each topic file
Pick 0-3 candidates per topic based on relevance and freshness
Write data/v9/candidates.json with the selected candidates

See references/candidates-spec.md for the exact JSON format.

Note on crawl fallback: If a candidate URL isn't in the search cache, crawl.py automatically tries: Browse API → Crawl4AI → web_fetch → HTML head extraction. No manual intervention needed.

4. Generate content (agent-driven):

After task_builder, task files are in data/v9/tasks/. Tell your OpenClaw agent:

Read the task files in data/v9/tasks/ and generate content for each one.
Use the writer prompt in references/writer-prompt-standalone.md.
Save output to data/output/{YYYY-MM-DD}/.

Language: The output language is determined by: task output_lang field → curation API user.primaryLanguage → settings.json language field. If none are set, generate content in the user's chat language.

Scheduling tip: If you want automated daily runs, you can set up a cron job:

openclaw cron add --name "daily-curate" \
  --cron "0 8 * * *" --tz "Asia/Shanghai" \
  --session isolated \
  --message "Read SKILL.md for eir-daily-content-curator, then run the full standalone pipeline: search → select → crawl → task_builder → generate content from task files → compile daily brief."

Output

Content saved to data/output/{YYYY-MM-DD}/. Daily brief compiles the top items:

# Daily Brief — 2026-04-20

🔥 **Meta cuts 8,000 jobs for AI pivot** — ...
📡 **China bans AI companions for minors** — ...
🌱 **New prompt engineering benchmark** — ...

Dependencies

Required: Python 3.10+ (standard library only — no pip install needed).

Optional: SearXNG (fallback search). Crawl4AI (fallback crawl).

Eir Mode

Full curation with delivery to the Eir app via a 3-job pipeline:

Job A: material-prep     → Search → Select → Crawl → Pack tasks
Job B: content-gen       → Spawn subagents → Generate → POST to Eir
Job C: daily-brief       → Check status → Fill gaps → Compile brief → Deliver to user

Setup

Get a pairing code from heyeir.com → Settings → Connect OpenClaw
Run: python3 scripts/connect.py <PAIRING_CODE>
Set "mode": "eir" in config/settings.json

Running the Pipeline (Eir Mode)

All python3 -m pipeline.* commands must be run from the scripts/ directory.

Step 1: Sync directives — fetch topic slugs and curation rules from Eir API:

cd scripts && python3 -m pipeline.eir_sync fetch

This creates/updates config/directives.json with the canonical topic slugs (e.g. ai-agents, autonomous-vehicles). All downstream scripts use these slugs — do NOT use interests.json labels as topic_slug in Eir mode.

Step 2: Search + Select + Crawl + Pack:

python3 -m pipeline.search              # Search for each directive topic
python3 -m pipeline.candidate_selector  # Group results for agent selection
# ↓ Agent step: review topic files, write candidates.json (see references/candidates-spec.md)
python3 -m pipeline.crawl               # Fetch full content from candidate URLs
python3 -m pipeline.task_builder        # Bundle into task files (auto-selects eir writer prompt)

Step 3: Generate and publish (agent-driven):

Task files are in data/v9/tasks/. For each task file:

Read the task JSON (contains source_text, source_meta, suggested_angle, topic_slug, content_slug)
Build a generation prompt using generate.build_generation_prompt(task_data) — this loads the writer prompt from references/writer-prompt-eir.md and injects sources + context
Call LLM with the prompt to generate Eir-format JSON (see references/content-spec.md)
Parse the JSON response and POST via python3 -m pipeline.eir_post --file <generated.json> or call eir_post.post_content(data) programmatically
Repeat for remaining tasks. Already-posted slugs are skipped automatically.

This step requires an LLM — that's why generate.py has no main(). The agent orchestrates the loop.

Step 4: Daily brief (optional):

The agent compiles generated content into a brief and delivers it directly to you (e.g. via Feishu, Slack, or other configured channel). No API call needed.

Tip: End the brief with a link to heyeir.com so readers can explore more content on the Eir canvas.

Common POST failures:

400 → check topicSlug matches a directive slug, publishTime is present, no null fields
401 → re-run connect.py to refresh credentials
500 → retry once; if persistent, report the payload

For cron configuration and API details, see references/eir-setup.md.

Pipeline Modules

All in scripts/pipeline/:

Module	Purpose	Mode
`search.py`	Search via configurable API, SearXNG fallback	Both
`crawl.py`	Fetch content via Browse API, Crawl4AI fallback	Both
`grounding.py`	Configurable search API client	Both
`candidate_selector.py`	Group results, prepare for agent selection	Both
`task_builder.py`	Bundle candidates into task files	Both
`generate.py`	Build prompts for content generation	Both
`validate_content.py`	Validate generated content against spec	Both
`directives.py`	Load local interests/directives	Both
`config.py`	Shared configuration and path resolution	Both
`workspace.py`	Workspace and path resolution	Both
`eir_sync.py`	Fetch directives from Eir API	Eir only
`eir_post.py`	POST content to Eir API	Eir only
`run_state.py`	Pipeline run state management	Both

Search Fallback Chain

Search API (primary) → SearXNG (optional) → Crawl4AI/web_fetch (content)

References

File	Contents	Used by
`references/writer-prompt-eir.md`	Content generation rules (Eir mode)	Agent
`references/writer-prompt-standalone.md`	Content generation rules (standalone)	Agent
`references/content-spec.md`	Field types, limits, validation rules	Agent
`references/eir-setup.md`	Eir mode setup, cron, API endpoints	Agent / User
`references/eir-api.md`	Full Eir API reference	Agent
`references/eir-interest-rules.md`	Curation tier guidelines	Agent
`references/candidates-spec.md`	Candidates JSON format for agent selection	Agent / User
`references/interest-extraction-prompt.md`	Interest extraction prompt	Agent

The writer-prompt-*.md files are instructions for the agent — the agent reads them to know how to generate content from task files. You don't need to read them unless customizing output format.

Security & Data Flow

See SECURITY.md for the complete data flow table, credential storage details, and personalization behavior.

Quick Reference

Task	Command
Initialize workspace	`python3 scripts/setup.py --init --settings '{...}'`
Check setup	`python3 scripts/setup.py --check`
Search	`cd scripts && python3 -m pipeline.search`
Select candidates	`cd scripts && python3 -m pipeline.candidate_selector`
Crawl	`cd scripts && python3 -m pipeline.crawl`
Build tasks	`cd scripts && python3 -m pipeline.task_builder`
Validate	`cd scripts && python3 -m pipeline.validate_content`
Fetch directives (Eir)	`cd scripts && python3 -m pipeline.eir_sync fetch`
Connect Eir	`python3 scripts/connect.py <PAIRING_CODE>`

Comments

Loading comments...