Smart News

Data & APIs

Use when calling the Crypto News Analyzer HTTP API for async analysis jobs, semantic search, datasource management, intelligence operations, or health checks from OpenClaw.

Install

openclaw skills install smart-news

Crypto News HTTP API Skill

Use this skill to call the Crypto News Analyzer HTTP API from OpenClaw.

When to Use

Use this skill when you need to call https://news.tradao.xyz or a compatible private deployment.

Typical triggers:

  • Run asynchronous crypto news analysis over a time window
  • Run asynchronous unified semantic search (News + Intelligence) for a freeform topic query
  • Poll an API job until it finishes and then fetch the final result
  • Create, list, or delete datasources through the HTTP API
  • Query and manage intelligence topics through the topic-first API (create, revise, confirm, merge findings, detail, list, archive)
  • View and manage topic-datasource associations (get, set, add, remove) to scope topic research
  • List intelligence topic research run logs per-topic or globally
  • Check service health before or after an API workflow

Quick Reference

Authentication is Bearer token style: send Authorization: Bearer <API_KEY> with every request.

POST /analyze creates a job and returns immediately. It does not return the final report. Poll status, then fetch the result.

Workflow: POST /analyze -> GET /analyze/{job_id} -> GET /analyze/{job_id}/result

Jobs move through these states: queued, running, completed, failed.

POST /semantic-search creates a job, returns 202 Accepted, and includes status_url, result_url, plus a Retry-After header. When hours exceeds the server max (720h default), a warning field describes the truncation. Semantic search jobs that do not complete within 5 minutes are automatically failed with a timeout error.

Semantic workflow: POST /semantic-search -> GET /semantic-search/{job_id} -> GET /semantic-search/{job_id}/result

Unified semantic search retrieves from both content_items and raw_intelligence_items via PostgreSQL with pgvector HNSW indexes (embedding vector(1536)). SQLite runtime is unsupported.

For detailed guides, see:

OpenClaw Runtime

This skill declares metadata.openclaw.primaryEnv: API_KEY. In OpenClaw, inject the bearer token through ~/.openclaw/openclaw.json:

{
  skills: {
    entries: {
      "smart-news": {
        enabled: true,
        apiKey: "YOUR_API_KEY"
      }
    }
  }
}

If apiKey is unavailable, do not send unauthenticated requests. Ask the operator to configure the token first.

If you are using a non-production deployment, replace https://news.tradao.xyz with the correct base URL before issuing requests.

Analyze Workflow

Create an analysis job by posting to /analyze with hours and user_id. The server responds with 202 Accepted, a job_id, status_url, and result_url.

Poll the status endpoint until the job reaches completed or failed. Do not expect the analysis report in the initial POST response. Once completed, fetch the result URL.

Semantic Search

Unified semantic search retrieves from both News (content_items) and Intelligence (raw_intelligence_items) domains via UNION ALL over pgvector HNSW indexes. The response includes a source_breakdown with per-domain matched_count and retained_count. Each hit carries a source_domain discriminator ("news" or "intelligence").

Create a semantic search job by posting to /semantic-search with hours, query, and user_id. The server responds with 202 Accepted, a job_id, status_url, and result_url. Semantic search job IDs start with semantic_search_job_.

Poll the status endpoint until the job reaches completed or failed, then fetch the report from the result URL. Use the status field as the source of truth for lifecycle state; success becomes true only when the job is completed successfully.

Request rules:

  • hours must be a positive integer
  • query is required, trimmed, and capped at 300 characters
  • query cannot be blank or whitespace-only
  • user_id must match ^[A-Za-z0-9_-]{1,128}$

Operational constraints:

  • Semantic search is PostgreSQL-only and returns 503 when the backend does not support pgvector
  • Both content_items and raw_intelligence_items tables have embedding vector(1536) columns with HNSW indexes (idx_content_embedding_hnsw and idx_intelligence_embedding_hnsw)
  • The API uses vector similarity over stored content embeddings and combines that with deterministic local keyword fallback (no LLM-driven keyword expansion)
  • LLM query decomposition is disabled by default (query_planning_enabled: false); when disabled the raw user query is embedded directly as the only subquery. The max_subqueries cap (4) only applies when query planning is explicitly re-enabled
  • Final retained results are capped at 200 unique items per domain before merging
  • Embedding generation requires OPENAI_API_KEY; report synthesis requires KIMI_API_KEY or GROK_API_KEY (query planning also requires an LLM key but is disabled by default)

The result body returns a Markdown report with query, normalized_intent, matched_count, retained_count, time_window_hours, source_breakdown, and report.

Datasource Management

Configure news and intelligence sources through the datasource API. Create sources with POST /datasources, list them with GET /datasources, and remove them with DELETE /datasources/{id}. All datasource routes require Bearer auth.

Each datasource has a purpose field: news (RSS/X/REST feeds for analysis) or intelligence (Telegram groups, V2EX for topic research). The GET /datasources endpoint supports optional purpose and source_type query parameters for filtering. Results are sorted by purpose, source type, then name.

Tags help organize sources. Each datasource accepts up to 16 unique tags. Each tag is capped at 32 characters. Tags are normalized to lowercase and deduplicated automatically.

List and create responses include only safe summaries. For rest_api type datasources, secrets are redacted and counts replace raw credential fields. This prevents accidental credential exposure when reviewing configurations.

Intelligence Query (Topic-First)

All intelligence routes require Bearer auth. The deprecated entry-based routes (/intelligence/entries*, /intelligence/discovery, /intelligence/labels, /intelligence/search) have been removed in the topic-only refactor. Topics are the sole first-class intelligence objects, driving scheduled LLM research from raw ingested messages and storing findings with merge support.

Synchronous topic workflow endpoints:

  • POST /intelligence/topics — Create a topic draft from a user theme (returns AI-generated prompt draft)
  • POST /intelligence/topics/{topic_id}/revise — Revise the draft prompt with feedback
  • PUT /intelligence/topics/{topic_id}/prompt — Manually set/replace the prompt text (context-aware: edits active prompt if one exists, otherwise creates draft revision)
  • POST /intelligence/topics/{topic_id}/confirm — Confirm and activate the topic for research (requires prompt_version_id)
  • GET /intelligence/topics — List topics with pagination and active_only filter (default: true)
  • GET /intelligence/topics/{topic_id} — Get topic metadata and merge availability
  • GET /intelligence/topics/{topic_id}/findings — Get paginated active findings with citations and source URLs
  • GET /intelligence/topics/{topic_id}/prompts — Get prompt versions and current active prompt
  • POST /intelligence/topics/{topic_id}/archive — Archive a topic
  • GET /intelligence/topics/{topic_id}/runs — List topic research run logs
  • GET /intelligence/topic-runs — List all topic research runs globally

These endpoints are synchronous; there is no async job/poll flow. Results return immediately.

Async topic merge endpoint:

  • POST /intelligence/topics/{topic_id}/merge — Start an async merge job (returns 202 Accepted with job_id, status_url, result_url)
  • GET /intelligence/topics/{topic_id}/merge/{job_id} — Check merge job status
  • GET /intelligence/topics/{topic_id}/merge/{job_id}/result — Retrieve completed merge results

Merge workflow: POST /intelligence/topics/{id}/merge → poll GET .../merge/{job_id}GET .../merge/{job_id}/result. Jobs move through states: queued, running, completed, failed. The merge LLM call may take several minutes, so polling is required — do not block on the POST response.

Topics have lifecycle states: draft, active, archived. Only active topics are researched by the ingestion scheduler. Finding merge is available through both the async HTTP endpoint and the Telegram /topic_merge command.

Telegram Webhook

The webhook endpoint exists for maintainer-level Telegram integration. It is not the primary path for day-to-day operators. Regular users should interact through the API routes or Telegram slash commands instead.

When processing webhook updates, validate the X-Telegram-Bot-Api-Secret-Token header to confirm the request originates from Telegram.

Endpoint Index

Supported HTTP routes:

  • GET /health - Service health check
  • POST /analyze - Create an analysis job (async, returns 202)
  • GET /analyze/{job_id} - Check job status
  • GET /analyze/{job_id}/result - Retrieve completed job results
  • POST /semantic-search - Create a semantic search job (async, returns 202)
  • GET /semantic-search/{job_id} - Check semantic search job status
  • GET /semantic-search/{job_id}/result - Retrieve completed semantic search results
  • POST /datasources - Create a datasource
  • GET /datasources - List all datasources
  • DELETE /datasources/{id} - Delete a datasource
  • POST /telegram/webhook - Telegram webhook receiver
  • POST /intelligence/topics - Create topic draft (synchronous, Bearer-protected)
  • POST /intelligence/topics/{id}/revise - Revise topic prompt
  • PUT /intelligence/topics/{id}/prompt - Manually set topic prompt
  • POST /intelligence/topics/{id}/confirm - Confirm and activate topic
  • GET /intelligence/topics - List topics with status filters
  • GET /intelligence/topics/{id} - Get topic metadata and merge availability
  • GET /intelligence/topics/{id}/findings - Get paginated findings with citations
  • GET /intelligence/topics/{id}/prompts - Get prompt versions and active prompt
  • POST /intelligence/topics/{id}/archive - Archive topic
  • POST /intelligence/topics/{id}/merge - Start async merge job (returns 202)
  • GET /intelligence/topics/{id}/merge/{job_id} - Check merge job status
  • GET /intelligence/topics/{id}/merge/{job_id}/result - Retrieve completed merge results
  • GET /intelligence/topics/{id}/datasources - List datasource associations for a topic
  • PUT /intelligence/topics/{id}/datasources - Replace all datasource associations atomically
  • POST /intelligence/topics/{id}/datasources/{datasource_id} - Add a datasource association (idempotent)
  • DELETE /intelligence/topics/{id}/datasources/{datasource_id} - Remove a datasource association (idempotent)
  • GET /intelligence/topics/{id}/runs - List topic research run logs
  • GET /intelligence/topic-runs - List all topic research runs globally

Non-Goals

This skill does not cover:

  • Telegram slash commands (use the Telegram bot directly)
  • Autogenerated documentation routes (/docs, /redoc, /openapi.json)
  • Deprecated compatibility aliases are not part of the active runtime surface
  • Direct embedding backfill operations beyond pointing you to the documented command

These surfaces exist but are intentionally excluded from this API-focused skill.

Updating

Keep this skill aligned with the live HTTP routes in api_server.py, the AI Analyze API Guide at docs/AI_ANALYZE_API_GUIDE.md, the semantic search guide at docs/SEMANTIC_SEARCH_API_GUIDE.md, and the domain repository contracts in domain/repositories.py.

When documentation disagrees with implementation, trust the code and tests over prose docs. Source precedence: code first, then reference files, then guides.