Skill flagged — suspicious patterns detected

ClawHub Security flagged this skill as suspicious. Review the scan results before using.

llms.txt File Builder

Generate, validate, and optimize llms.txt files for AI crawler accessibility. Creates structured markdown files that help AI platforms (ChatGPT, Perplexity,...

MIT-0 · Free to use, modify, and redistribute. No attribution required.
0 · 256 · 0 current installs · 0 all-time installs
byGEOLY AI@geoly-geo
MIT-0
Security Scan
VirusTotalVirusTotal
Suspicious
View report →
OpenClawOpenClaw
Benign
high confidence
Purpose & Capability
The name/description (llms.txt generator + validator) aligns with the included Python scripts and reference docs. Network fetches, sitemap parsing, page analysis, and optional URL checks are expected capabilities for this purpose. No unrelated credentials, binaries, or config paths are requested.
Instruction Scope
SKILL.md instructs the agent to run the provided scripts, which only read user-provided inputs (domain, file paths, interactive responses) and make HTTP requests to the target site(s). The scripts do not read unrelated local files, environment variables, or transmit data to third-party endpoints beyond fetching/analyzing the target URLs.
Install Mechanism
There is no automated install spec (instruction-only). The Python scripts require third-party packages (requests, beautifulsoup4) but only print an error and exit if missing; nothing is auto-downloaded or executed from remote URLs. This is a low-risk model but users should install dependencies themselves in a controlled environment.
Credentials
No environment variables, credentials, or config paths are required. The scripts only perform unauthenticated HTTP requests to target domains (and optionally HEAD requests when validating). The requested access is proportional to the stated crawling/validation task.
Persistence & Privilege
The skill is not always-enabled and does not persist or modify other skills or system-wide settings. It runs on-demand and does not store credentials or enable autonomous background behavior.
Assessment
This skill appears coherent and matches its stated purpose. Before installing or running: 1) Review and run the Python scripts locally or in a sandbox; they perform HTTP(s) requests to any domain you supply—avoid pointing them at internal or private endpoints or embedding secrets in URLs. 2) Install dependencies (requests, beautifulsoup4) manually in a controlled environment. 3) The generator crawls pages (limited to ~40 pages, with a 0.5s delay) and the validator can HEAD-check live URLs; be mindful of target site rate limits and robots/crawl policies. 4) No credentials are requested and there is no obvious exfiltration; if you plan to use authenticated pages, expect the scripts to fail or to require modification (do not add secrets unless you trust and audit code). 5) If you need higher assurance, have someone with Python/security knowledge audit the code before giving the tool access to non-public hosts.

Like a lobster shell, security has layers — review code before you run it.

Current versionv1.0.1
Download zip
ai-crawlvk97f99fw0fhqzkagyx5ppjahh981yfrwgeovk97f99fw0fhqzkagyx5ppjahh981yfrwlatestvk975842magreetk0fzjnjb7sn5821m4bllms-txtvk97f99fw0fhqzkagyx5ppjahh981yfrw

License

MIT-0
Free to use, modify, and redistribute. No attribution required.

SKILL.md

llms.txt File Builder

Methodology by GEOly AI (geoly.ai) — GEO infrastructure for the AI search era.

Generate well-structured llms.txt files to help AI platforms understand and cite your content.

Quick Start

Generate an llms.txt file for any website:

python scripts/generate_llms_txt.py <domain> [--output llms.txt]

Example:

python scripts/generate_llms_txt.py example.com --output llms.txt

What is llms.txt?

The llms.txt standard helps AI crawlers understand:

  • What your brand/company does
  • Which pages contain the most valuable information
  • How content is organized (products, docs, blog, etc.)
  • Where to find key facts and data

Full standard details: See references/standard.md

Standard Format

# [Brand Name]

> [One-sentence brand description]

[2-3 paragraph overview: what you do, who it's for, key differentiators]

## Key Pages

- [Page Title](URL): One-line description
- [Page Title](URL): One-line description

## Products / Services

- [Product Name](URL): What it does and who it's for

## Documentation

- [Doc Title](URL): What this doc explains

## Blog / Resources

- [Article Title](URL): Key insight or topic covered

## About

- [About Us](URL): Company background and mission
- [Contact](URL): How to reach the team

Generation Methods

Method 1: From Sitemap (Automated)

python scripts/generate_llms_txt.py example.com --from-sitemap

Automatically fetches sitemap.xml, analyzes each page, and generates descriptions.

Method 2: Interactive (Guided)

python scripts/generate_llms_txt.py example.com --interactive

Prompts you for brand info and key URLs, then drafts descriptions.

Method 3: From URL List

python scripts/generate_llms_txt.py example.com --urls urls.txt

Where urls.txt contains one URL per line.

Validation

Validate an existing llms.txt file:

python scripts/validate_llms_txt.py llms.txt

Checks for:

  • Proper markdown structure
  • Valid URLs
  • No duplicate entries
  • Optimal link count (15-40 pages)
  • Factual tone (not promotional)

Quality Criteria

AspectGoodBad
Brand description"GEOly AI is a GEO monitoring platform tracking brand visibility across ChatGPT, Perplexity, Gemini.""We are the best AI SEO tool ever!"
Page descriptions"Explains how to set up MCP integration with Claude Desktop""Our awesome docs page"
Link count15–40 curated pages500+ URLs (sitemap dump)
ToneFactual, entity-focusedPromotional, keyword-stuffed
StructureClear sections by content typeFlat list or random order

Output Formats

  • Markdown (default): Ready-to-deploy llms.txt
  • JSON: Structured data for programmatic use
  • HTML: Styled preview for stakeholder review

Advanced Usage

Custom Sections

python scripts/generate_llms_txt.py example.com \
  --sections "Products,API Reference,Case Studies,Changelog"

Exclude Patterns

python scripts/generate_llms_txt.py example.com \
  --exclude "/admin/,/private/,/draft/"

Multi-language Support

python scripts/generate_llms_txt.py example.com \
  --language zh-CN \
  --output llms-zh.txt

Deployment

Once generated, place the file at:

https://[your-domain]/llms.txt

Ensure it:

  • Returns HTTP 200
  • Is accessible without authentication
  • Has Content-Type: text/plain or text/markdown

See Also

Files

7 total
Select a file
Select a file to preview.

Comments

Loading comments…