Tavily arXiv Paper Fetech

v1.0.0

Resolve one or more paper titles to reliable arXiv papers using Tavily search, then fetch compact arXiv metadata and abstracts with a local helper script. Us...

0· 107·0 current·0 all-time
byDehao Huang@red0orange

Install

OpenClaw Prompt Flow

Install with OpenClaw

Best for remote or guided setup. Copy the exact prompt, then paste it into OpenClaw for red0orange/tavily-arxiv-paper-fetech.

Previewing Install & Setup.
Prompt PreviewInstall & Setup
Install the skill "Tavily arXiv Paper Fetech" (red0orange/tavily-arxiv-paper-fetech) from ClawHub.
Skill page: https://clawhub.ai/red0orange/tavily-arxiv-paper-fetech
Keep the work scoped to this skill only.
After install, inspect the skill metadata and help me finish setup.
Use only the metadata you can verify from ClawHub; do not invent missing requirements.
Ask before making any broader environment changes.

Command Line

CLI Commands

Use the direct CLI path if you want to install manually and keep every step visible.

OpenClaw CLI

Bare skill slug

openclaw skills install tavily-arxiv-paper-fetech

ClawHub CLI

Package manager switcher

npx clawhub@latest install tavily-arxiv-paper-fetech
Security Scan
VirusTotalVirusTotal
Benign
View report →
OpenClawOpenClaw
Benign
high confidence
Purpose & Capability
Name/description match what the skill does. Included Python helper scripts and the SKILL.md are consistent with resolving titles via Tavily and fetching arXiv metadata; no unrelated credentials, binaries, or config paths are requested.
Instruction Scope
Runtime instructions stay within the described workflow: normalize titles, call Tavily for resolution only, then run local Python scripts to fetch arXiv metadata and render JSONL→markdown. The skill does not instruct reading unrelated files or exfiltrating data to unexpected endpoints.
Install Mechanism
No install spec (instruction-only). The two small Python scripts are included in the bundle and use only the standard library (urllib, re, json, pathlib). No downloads from external, untrusted URLs or package installs are present.
Credentials
No environment variables or credentials are required. Network access to arxiv.org and access to a Tavily search tool are expected and proportional to the stated purpose. The scripts use a harmless custom User-Agent header and validate the host is arxiv.org.
Persistence & Privilege
The skill does not request persistent/always-enabled presence and doesn't modify other skills or global agent settings. It writes outputs only under the specified WORKDIR.
Assessment
This skill is coherent and implements what it claims: it will call Tavily (the search tool), make HTTP requests to arxiv.org, run the included Python scripts, and write files under WORKDIR. Before installing, confirm your agent/environment: (1) has access to Tavily if title resolution is needed, (2) allows outbound HTTP to arxiv.org, and (3) has python3 on PATH so the helper scripts can run. No secrets are requested. If you want extra caution, review the two included scripts (they are small and only use the standard library) and set WORKDIR to a location you control so outputs are isolated.

Like a lobster shell, security has layers — review code before you run it.

latestvk97eqxeknmzf3tgjy3vsh27nhs83x2sy
107downloads
0stars
1versions
Updated 4w ago
v1.0.0
MIT-0

Tavily arXiv Paper Fetech Pipeline

Orchestrate a title-to-arXiv metadata workflow for: $ARGUMENTS

Overview

This skill turns a list of paper titles into a reproducible arXiv lookup result:

paper titles
-> Tavily search only
-> canonical arXiv abs URL
-> local arXiv metadata fetch
-> JSONL process log
-> markdown report

The goal is not just to answer once. The goal is to leave behind a reusable work folder that can be resumed or consumed by another workflow.

Inputs

Extract or ask for:

  • TITLE_LIST: one or more paper titles
  • WORKDIR: optional working folder

If WORKDIR is omitted, default to:

  • 0_docs/tavily_arxiv_paper_fetech

Accepted title formats:

  • a single title
  • multiple titles separated by newlines
  • markdown bullet lists

Constants

  • TAVILY_ONLY = true — Tavily is the only allowed search mechanism
  • SEQUENTIAL_MODE = true — process titles one by one to reduce rate limits
  • NO_FALLBACK_SEARCH = true — never switch to arXiv API search, guessed arXiv URLs, or another provider
  • LOW_CONTEXT_MODE = true — append one JSON line per title, then render markdown from JSONL

Work Folder Layout

Write all outputs under WORKDIR:

ArtifactPurpose
input_titles.mdnormalized input title list
paper_fetches.jsonlone JSON line per processed title
paper_fetch_report.mdrendered report from the JSONL

Execution Rule

Process titles in order. Do not parallelize Tavily calls.

If Tavily rate limits:

  • wait briefly
  • retry Tavily only
  • if it still fails, record the error and continue

Phase 0: Initialize

  1. Normalize the input title list.
  2. Create WORKDIR.
  3. Write the normalized title list to WORKDIR/input_titles.md.

Phase 1: Tavily resolution

For each title:

  1. Search Tavily with:
"<paper title>" arXiv
  1. Prefer results in this order:

    • https://arxiv.org/abs/...
    • https://arxiv.org/html/...
    • https://arxiv.org/pdf/...
  2. Accept a result only when the match is reliable:

    • exact title match after minor normalization
    • same title with only punctuation, Unicode, or math-formatting differences
    • arXiv page clearly shows the same paper title
  3. If uncertain, record no_match instead of guessing.

Phase 2: Local arXiv fetch

Once a reliable arXiv URL is known, run:

python3 ".cursor/skills/tavily-arxiv-paper-fetech/scripts/fetch_arxiv_abs.py" "<arxiv-url>"

This returns compact JSON with:

  • canonical abs URL
  • arXiv id
  • title
  • authors
  • abstract

Phase 3: Low-context JSONL logging

For each title, immediately append one JSON line to:

  • WORKDIR/paper_fetches.jsonl

Each line should include at least:

  • index
  • input_title
  • tavily_status
  • tavily_error
  • arxiv_url
  • fetch

Status values:

  • ok
  • no_match
  • error

Phase 4: Render markdown report

After all titles are processed, run:

python3 ".cursor/skills/tavily-arxiv-paper-fetech/scripts/jsonl_to_paper_fetch_md.py" \
  "WORKDIR/paper_fetches.jsonl" \
  "WORKDIR/paper_fetch_report.md"

Output Format

The rendered report should look like:

# Tavily arXiv Paper Fetech Report

## Results

### 1. Original Title
- Status: ok
- arXiv URL: https://arxiv.org/abs/xxxx.xxxxx
- arXiv ID: xxxx.xxxxx
- Resolved Title: ...
- Authors: ...
- Abstract: ...

Key Rules

  • Never fabricate a paper match.
  • Never use non-Tavily search for title resolution.
  • Keep all outputs inside WORKDIR.
  • Prefer the local helper script over bringing full arXiv page content into context.

Utility Scripts

  • scripts/fetch_arxiv_abs.py — fetch compact metadata from a known arXiv URL
  • scripts/jsonl_to_paper_fetch_md.py — render JSONL to markdown

Additional Resources

  • For copy-paste invocations and expected outputs, see examples.md

Example Invocation

/tavily-arxiv-paper-fetech "RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic Control\nOpenVLA: An Open-Source Vision-Language-Action Model --workdir 0_docs/tavily_arxiv_lookup_run_01"

Comments

Loading comments...