Deep Researcher Skill

v1.0.0

Aggregates insights from web, papers, video, and forums with paywall bypass to deliver comprehensive, unbiased, multi-format research and analysis.

⭐ 0· 126·0 current·0 all-time

by@geoffguides

OpenClaw Prompt Flow

Install with OpenClaw

Best for remote or guided setup. Copy the exact prompt, then paste it into OpenClaw for geoffguides/deep-researcher-skill.

Previewing Install & Setup.

Prompt PreviewInstall & Setup

Install the skill "Deep Researcher Skill" (geoffguides/deep-researcher-skill) from ClawHub.
Skill page: https://clawhub.ai/geoffguides/deep-researcher-skill
Keep the work scoped to this skill only.
After install, inspect the skill metadata and help me finish setup.
Use only the metadata you can verify from ClawHub; do not invent missing requirements.
Ask before making any broader environment changes.

Command Line

CLI Commands

Use the direct CLI path if you want to install manually and keep every step visible.

OpenClaw CLI

Bare skill slug

openclaw skills install deep-researcher-skill

ClawHub CLI

Package manager switcher

npx clawhub@latest install deep-researcher-skill

Security Scan

VirusTotal

Benign

View report →

OpenClaw

Benign

medium confidence

✓

Purpose & Capability

Name/description promise multi‑source research and paywall bypass; SKILL.md explicitly lists built‑in search/YouTube/arXiv/Reddit/browser tools and paywall bypass steps, which are proportional to the stated goals. No unrelated credentials, binaries, or installs are requested.

ℹ

Instruction Scope

Instructions stay within the research domain (search, transcripts, papers, forums) but explicitly direct the agent to use third‑party proxy services (archive.org/google cache, 12ft.io, r.jina.ai) and a browser tool to retrieve paywalled content. That is coherent with the skill's stated paywall‑bypass capability, but it means target URLs and page contents will be sent to those external services (privacy/data‑exfiltration risk). The skill also offers a 'Show my research history' feature without documenting storage location or retention—this is ambiguous and worth clarifying.

✓

Install Mechanism

No install spec or code; instruction‑only skill — lowest on‑disk risk. There are no downloads, archives, or external install URLs to review.

✓

Credentials

No required environment variables or credentials. Optional API keys (NewsAPI, OpenWeather, ExchangeRate) are reasonable for the claimed optional integrations and are documented as optional. No unrelated secrets requested.

ℹ

Persistence & Privilege

always is false and autonomous invocation is allowed (platform default). The skill references a research history feature but does not declare where history is stored or how long it is retained — this is a scope/privilege detail that should be clarified before use. There is no instruction to modify other skills or system settings.

Assessment

This skill is internally consistent with its description, but before installing you should: (1) decide whether you are comfortable that requested URLs and article contents may be sent to third‑party fetchers (archive.org/google cache, 12ft.io, r.jina.ai) — that can leak sensitive URLs or content; (2) consider legal/ethical implications of automated paywall bypass for copyrighted material and check terms of service for sources you care about; (3) confirm where and how 'research history' will be stored and who can access it (agent memory, local disk, remote store); (4) if you supply optional API keys, limit their scope and monitor usage; and (5) restrict or review the agent's browser tool permissions (cookies/authenticated sessions) to avoid unintended access to private accounts. If you need stronger assurances, ask the skill author to document storage/retention and to offer an option that disables proxy fetchers and only uses public/open sources.

Like a lobster shell, security has layers — review code before you run it.

analysisvk972scv3aq6jqaa6mwxxc7gdj183mrxcdeep-divevk972scv3aq6jqaa6mwxxc7gdj183mrxcinvestigationvk972scv3aq6jqaa6mwxxc7gdj183mrxclatestvk972scv3aq6jqaa6mwxxc7gdj183mrxcresearchvk972scv3aq6jqaa6mwxxc7gdj183mrxcsourcesvk972scv3aq6jqaa6mwxxc7gdj183mrxc

126downloads

0stars

1versions

Updated 1mo ago

v1.0.0

MIT-0

SKILL.md - Research Assistant

Description

Your personal research department. Multi-source synthesis that turns scattered information into actionable intelligence â not just summaries, but insights you can act on.

Price

Free â or $5 to support development.

Prerequisites

DuckDuckGo Search (built-in, no key needed)
YouTube Content tool (built-in, no key needed)
arXiv skill (built-in, no key needed)
Reddit Readonly skill (built-in, no key needed)
Browser tool (built-in, for paywall bypass)
Optional: NewsAPI key (free tier: 100 requests/day) â current events
Optional: OpenWeather API key (free tier: 1,000 calls/day) â location context
Optional: ExchangeRate-API key (free tier: 1,500 requests/month) â finance data
Optional: REST Countries API (no key needed) â demographics

Quick Start

Configure optional APIs: "Set up my research assistant with NewsAPI"
Research: "Research [topic]" or "Deep dive into [question]"

Commands

"Research [topic]" â Quick synthesis from multiple sources
"Deep dive into [question]" â Comprehensive analysis
"Compare [A] vs [B]" â Competitive/feature analysis
"What's new in [field] this month?" â Temporal research
"Research for [format]: [topic]" â Brief, thread, blog, or decision matrix
"Show my research history" â Previous queries and findings

Tool Selection Matrix

Source Type	Tool to Use	Fallback
Web search	`duckduckgo_search`	None needed
YouTube transcripts	`youtube-content` skill	Browser tool
Academic papers	`arxiv` skill	`duckduckgo_search` with site:arxiv.org
Forums/Reddit	`reddit-readonly` skill	None needed
Paywalled articles	`browser_navigate` + archive.org	12ft.io, textise dot iitty
Current events	NewsAPI (if configured)	`duckduckgo_search` news filter
Weather data	OpenWeather API	`duckduckgo_search`
Financial data	ExchangeRate-API	`duckduckgo_search`

Paywall Bypass Strategy

When you hit a paywall:

Try archive.org: https://webcache.googleusercontent.com/search?q=URL or https://archive.org/web/*/URL
Try 12ft.io: https://12ft.io/URL (works for Medium, Substack, etc.)
Try textise dot iitty: https://r.jina.ai/http://URL (extracts article text)
Use browser tool: Navigate and extract text directly
Skip only if all fail â mark as "paywalled, unverified"

Core Workflows

1. Quick Research (2-3 minutes)

Input: Any question or topic

Process:

Parallel search across sources using tool matrix above
Fetch top 3-5 results per source
Bypass paywalls using strategy above
Extract key points from each
Synthesize into structured brief
Cite all sources with links

Output:

## Research Brief: [Topic]

### Executive Summary
[3-5 sentences covering the landscape]

### Key Findings
1. **[Finding]** â [Source type: web/video/paper/forum]
2. **[Finding]** â [Source type]
3. **[Finding]** â [Source type]

### Sources
- [Title](URL) â Web article, [Date]
- [Title](URL) â YouTube video, [Channel]
- [Title](URL) â arXiv paper, [Authors]
- [Title](URL) â Reddit discussion, [Subreddit]

### Confidence Score: [High/Medium/Low]
**Why:** [Source quality, recency, consensus level]

### Suggested Next Steps
- [Specific follow-up question]
- [Related topic to explore]
- [Deeper source to check]

2. Deep Dive Research (5-10 minutes)

Input: Complex question requiring comprehensive analysis

Process:

Multi-query expansion (break topic into sub-questions)
10-15 sources across all channels
Apply paywall bypass as needed
Temporal analysis (what's new vs. established)
Credibility scoring per source
Bias detection and flagging
Synthesis with uncertainty levels

Stopping Conditions â When to End:

Saturation: New sources repeat what you already found
Diminishing returns: 10+ sources but confidence still Low
Contradiction ceiling: >50% of sources disagree
Time limit: 15 minutes max for Deep Dive
Confidence achieved: High confidence with 3+ Tier 1 sources

Output:

## Deep Dive: [Topic]

### One-Paragraph Summary
[The TL;DR for busy decision-makers]

### Current State (What's Happening Now)
[Recent developments, 0-6 months]

### Established Knowledge (What We Know)
[Consensus views, foundational concepts]

### Points of Contention
- **[Claim A]** â [Evidence for] vs [Evidence against]
- **[Claim B]** â [Evidence for] vs [Evidence against]

### Source Quality Breakdown
| Source | Type | Credibility | Recency | Bias |
|--------|------|-------------|---------|------|
| [Name] | Academic | High | 2024 | Neutral |
| [Name] | News | Medium | 2025 | Center-left |
| [Name] | Forum | Low | 2025 | N/A |

### Confidence Calibration
**Level:** [High/Medium/Low]
**Reasoning:** [Why this level based on criteria below]

### Actionable Insights
1. **[Insight]** â [Specific action to take]
2. **[Insight]** â [Specific action to take]

### Knowledge Gaps
[What we still don't know]

### Recommended Follow-Up
- [Specific research question]
- [Expert to consult]
- [Primary source to find]

3. Comparative Analysis

Input: "Compare X vs Y" or "Feature gap analysis"

Process:

Research both subjects independently using tool matrix
Extract features/capabilities/attributes
Build comparison matrix
Identify gaps and differentiators
Score on key dimensions

Structured Data Extraction:

Pricing extraction pattern:
- Search: "[Product] pricing cost $"
- Look for: $XXX/month, $XXX/year, free tier limits
- Source: Official pricing page (bypass paywall if needed)

Feature extraction pattern:
- Search: "[Product] features vs [Competitor]"
- Look for: Feature lists, comparison tables
- Use: Browser tool to extract structured data

Sentiment extraction pattern:
- Reddit: Search r/[topic] for "[Product] review"
- Look for: Specific pros/cons with reasoning
- Score: Count positive vs negative mentions

Output:

## Comparison: [A] vs [B]

### At a Glance
| Dimension | [A] | [B] | Winner |
|-----------|-----|-----|--------|
| Price | $X | $Y | [A/B/Tie] |
| Key Feature | [Desc] | [Desc] | [A/B/Tie] |
| User Sentiment | [Score] | [Score] | [A/B/Tie] |

### Detailed Breakdown

**[A] Strengths:**
- [Point with source]
- [Point with source]

**[B] Strengths:**
- [Point with source]
- [Point with source]

**[A] Weaknesses:**
- [Point with source]

**[B] Weaknesses:**
- [Point with source]

### Feature Gap Analysis
- [Feature A]: [A] has it, [B] doesn't
- [Feature B]: Both have it, [A] does it better
- [Feature C]: Neither has it (opportunity)

### Verdict
[Recommendation with reasoning]

### Sources
[All citations]

4. Temporal Research (What's New)

Input: "What's new in [field] this [timeframe]?"

Process:

Filter sources by date using search filters
Compare to baseline (previous period)
Identify new developments, trends, shifts
Flag emerging vs. fading topics

Output:

## [Field] Update: [Timeframe]

### New Developments
1. **[Development]** â [Impact level] â [Source]
2. **[Development]** â [Impact level] â [Source]

### Trends to Watch
- [Trend]: [Evidence] â [Trajectory: rising/stable/falling]
- [Trend]: [Evidence] â [Trajectory]

### What's Fading
- [Topic]: [Why it's declining]

### Predictions (Speculative)
- [Prediction] â [Based on]

### Sources from This Period
[All recent citations]

5. Format-Specific Output

Brief Mode: Executive summary only (2-3 paragraphs)

Thread Mode: Twitter/X thread format

ð§µ [Topic]: [Hook]

1/ [Point]
2/ [Point]
3/ [Point]

[Sources]

Blog Mode: H2 outline with key points

## [Title]

### Introduction
[Hook]

### [Section 1]
[Key points]

### [Section 2]
[Key points]

### Conclusion
[Takeaway]

### Sources
[Citations]

Decision Matrix Mode: Pros/cons table with scoring

| Option | Pros | Cons | Score |
|--------|------|------|-------|
| [A] | [List] | [List] | X/10 |
| [B] | [List] | [List] | X/10 |

Confidence Calibration System

Don't guess â use these criteria:

High Confidence

Sources: 3+ Tier 1 (academic, official, expert) OR 5+ Tier 2
Recency: All sources <6 months old OR established consensus
Contradictions: Zero major contradictions
Corroboration: Findings confirmed by independent sources

Medium Confidence

Sources: 2+ Tier 2 (industry pubs, established blogs)
Recency: Mix of recent and established
Contradictions: Minor contradictions resolved
Gaps: Some uncertainty acknowledged

Low Confidence

Sources: Single source OR mostly Tier 4-5
Recency: Old data (>1 year) OR no date
Contradictions: Major contradictions unresolved
Gaps: Significant unknowns

Flag language:

High: "Research shows...", "Evidence confirms..."
Medium: "Sources suggest...", "It appears that..."
Low: "One source claims...", "Limited research indicates..."

Source Quality Scoring

Tier 1: Highest Credibility (Weight: 3x)

Peer-reviewed journals (Nature, Science, etc.)
Official documentation (gov, corporate)
SEC filings, regulatory documents
Direct primary sources

Tier 2: High Credibility (Weight: 2x)

Established news (Reuters, AP, BBC)
Expert blogs with track record
Industry analysts (Gartner, McKinsey)
Technical publications (IEEE, ACM)

Tier 3: Medium Credibility (Weight: 1x)

Industry publications
Established YouTube channels
Well-moderated forums
Think tank reports

Tier 4: Low Credibility (Weight: 0.5x)

General news coverage
Encyclopedia entries (Wikipedia â follow citations)
Content aggregators

Tier 5: Use Cautiously (Weight: 0.25x)

Anonymous forums
Unverified social posts
Personal blogs without track record

Auto-Skip

Known misinformation sources
Circular references (A cites B cites A)
Paywalled AND can't bypass

Bias Detection

Political Spectrum

Left / Center-left / Center / Center-right / Right
Flagged when source consistently leans one direction

Commercial Bias

None: No financial stake in topic
Disclosed: Affiliate links, sponsorships noted
Undisclosed: Potential conflicts not mentioned

Confirmation Bias Warning

"This source only presents one side"
"Contradictory evidence exists"
"Consensus vs. outlier view"

Domain-Specific Research Patterns

Product/Competitive Research (ClawMart/Gumroad)

Search: "[Product] vs [competitor] review"
Check: Pricing pages, feature lists
Mine: Reddit for real user experiences
Extract: Structured comparison table

Scientific/Medical Research

Start: PubMed, Cochrane Library for clinical
Check: arXiv for preprints
Verify: Primary sources, not news summaries
Flag: Single studies vs. meta-analyses

Financial Research

SEC filings (10-K, 10-Q) for public companies
Earnings call transcripts
Analyst reports (paid) â use summaries from news
ExchangeRate-API for currency data

Legal Research

Court documents via PACER or public dockets
Legal databases (Westlaw, Lexis â paywalled)
Law reviews and journals
Regulatory agency guidance

Configuration

research-config.json

{
  "apis": {
    "newsapi": {
      "key": "YOUR_NEWSAPI_KEY",
      "tier": "free"
    },
    "openweather": {
      "key": "YOUR_OPENWEATHER_KEY",
      "tier": "free"
    },
    "exchangerate": {
      "key": "YOUR_EXCHANGERATE_KEY",
      "tier": "free"
    }
  },
  "defaults": {
    "max_sources": 10,
    "recency_preference": "balanced",
    "output_format": "brief",
    "confidence_threshold": "medium"
  },
  "quality_thresholds": {
    "min_credibility": "medium",
    "max_age_days": 365,
    "require_diversity": true
  },
  "stopping_conditions": {
    "max_sources": 15,
    "max_time_minutes": 15,
    "saturation_threshold": 3
  }
}

Examples

Example 1: Quick Research

User: "Research AI video generation trends"

Agent:
## Research Brief: AI Video Generation Trends

### Executive Summary
AI video generation has exploded in 2024-2025 with Runway, Pika, and Kling leading consumer tools. Quality improved dramatically â 4-second clips now rival professional footage. Open-source alternatives (Stable Video Diffusion) gaining traction. Key limitation: temporal consistency in longer sequences.

### Key Findings
1. **Runway Gen-2** is market leader for quality â $15/mo, 4-second clips
2. **Pika Labs** offers budget alternative at $8/mo with good results
3. **Kling AI** (China-based) emerging as quality competitor
4. **Open source** (Stable Video Diffusion) viable for technical users

### Sources
- [The State of AI Video 2025](URL) â Web, Jan 2025
- [Runway vs Pika: Comparison](URL) â YouTube, TechChannel
- [Stable Video Diffusion Paper](URL) â arXiv, 2024
- [r/MachineLearning discussion](URL) â Reddit, Feb 2025

### Confidence Score: High
Multiple Tier 2+ sources confirm, recent data, established consensus. Zero contradictions.

### Suggested Next Steps
- Deep dive into temporal consistency solutions
- Compare pricing for high-volume production
- Research enterprise alternatives (Sora, etc.)

Example 2: Competitive Analysis with Paywall Bypass

User: "Compare ElevenLabs vs Play.ht for voice generation"

Agent:
## Comparison: ElevenLabs vs Play.ht

### At a Glance
| Dimension | ElevenLabs | Play.ht | Winner |
|-----------|------------|---------|--------|
| Price | $5/mo | $30/mo | ElevenLabs |
| Quality | Excellent | Good | ElevenLabs |
| Languages | 29 | 142 | Play.ht |
| API | Yes | Yes | Tie |

### Pricing Research
- ElevenLabs: $5/mo Starter, $22/mo Creator (source: pricing page)
- Play.ht: $30/mo Personal, $99/mo Pro (source: pricing page)

### User Sentiment (Reddit r/elevenlabs, r/speech synthesis)
- ElevenLabs: 85% positive (quality praised, occasional latency issues)
- Play.ht: 70% positive (good languages, expensive for quality)

### Verdict
ElevenLabs for quality/price, Play.ht if you need obscure languages.

[Full breakdown with sources...]

Guardrails

Always cite sources â never present synthesis as original research
Flag uncertainty with calibrated language ("likely" vs "confirms")
Bypass paywalls before skipping â don't leave knowledge on the table
Respect rate limits â cache results, batch when possible
Verify recency â old data can mislead
Acknowledge gaps â "limited research exists on..."
Stop when conditions met â don't over-research

Troubleshooting

Error: "No results found"

Try broader search terms
Check tool availability
Verify internet connection

Error: "Rate limit exceeded"

Wait 60 seconds, retry
Switch to fallback tools
Use cached results when available

Error: "Source quality too low"

Broaden search terms
Remove recency filter
Try alternative sources from tool matrix
Accept "Low confidence" finding

Error: "Paywall blocking access"

Try archive.org
Try 12ft.io
Try textise dot iitty
Use browser tool to extract
Only skip if all methods fail

Version History

V1.0: Multi-source search, synthesis, 4 output formats
V1.1: Bias detection, temporal research, competitive analysis
V1.2: Source quality scoring, citation export, research history
V1.3: Public APIs integration â NewsAPI, OpenWeather, ExchangeRate-API, REST Countries
V1.4:
- Added tool selection matrix (platform-agnostic)
- Added paywall bypass strategy
- Added confidence calibration system
- Added stopping conditions
- Added structured data extraction patterns
- Added domain-specific research patterns

Turn information into intelligence.

Comments

Loading comments...