Genai Toolkit

v1.0.0

Bridge AI models to databases through MCP with config and evaluation tools. Use when setting up DB tools, comparing engines, or evaluating prompt quality.

0· 143·1 current·1 all-time
bybytesagain4@xueyetianya

Install

OpenClaw Prompt Flow

Install with OpenClaw

Best for remote or guided setup. Copy the exact prompt, then paste it into OpenClaw for xueyetianya/genai-toolkit.

Previewing Install & Setup.
Prompt PreviewInstall & Setup
Install the skill "Genai Toolkit" (xueyetianya/genai-toolkit) from ClawHub.
Skill page: https://clawhub.ai/xueyetianya/genai-toolkit
Keep the work scoped to this skill only.
After install, inspect the skill metadata and help me finish setup.
Use only the metadata you can verify from ClawHub; do not invent missing requirements.
Ask before making any broader environment changes.

Command Line

CLI Commands

Use the direct CLI path if you want to install manually and keep every step visible.

OpenClaw CLI

Bare skill slug

openclaw skills install genai-toolkit

ClawHub CLI

Package manager switcher

npx clawhub@latest install genai-toolkit
Security Scan
VirusTotalVirusTotal
Benign
View report →
OpenClawOpenClaw
Benign
high confidence
Purpose & Capability
Name/description promise is a CLI toolkit for logging and evaluating generative-AI experiments. The included shell script implements the described commands and stores data under ~/.local/share/genai-toolkit, so required capabilities match the stated purpose.
Instruction Scope
SKILL.md instructs only local logging, exporting, searching, and status checks. The script reads/writes only within the data directory and standard system utilities (date, grep, tail, wc, du). There are no instructions to read unrelated system files, environment secrets, or to transmit data externally.
Install Mechanism
No install spec is provided (instruction-only skill). A single bash script is included; it is a plain shell implementation and does not download or extract remote code. Risk from install mechanism is low.
Credentials
The skill requests no environment variables or credentials. It does use $HOME to create ~/.local/share/genai-toolkit (expected). SKILL.md suggests logging items like 'API keys or environment settings' — this is a usage note rather than an automatic collection, but users should avoid storing secrets in plaintext log files.
Persistence & Privilege
Skill is not marked always:true and does not modify other skills or system-wide settings. It persists only its own data under the user's home directory and does not request elevated privileges.
Assessment
This appears to be a simple local logging CLI. Before installing: (1) Review the script if you want to be certain it won’t be changed; (2) be aware it stores all entries in plaintext at ~/.local/share/genai-toolkit — do not log secrets (API keys, passwords) there; (3) consider setting restrictive permissions on that directory (chmod 700) or using an encrypted workspace if you must record sensitive data; (4) exports (json/csv/txt) are created as files and could be shared accidentally, so handle exported files carefully.

Like a lobster shell, security has layers — review code before you run it.

latestvk97apc4gb20psrq7a9vvmynhhh8367j7
143downloads
0stars
1versions
Updated 1mo ago
v1.0.0
MIT-0

Genai Toolkit

Genai Toolkit v2.0.0 — an AI toolkit for managing generative AI workflows from the command line. Log configurations, benchmarks, prompts, evaluations, fine-tuning runs, cost tracking, and optimization notes. Each entry is timestamped and persisted locally. Works entirely offline — your data never leaves your machine.

Why Genai Toolkit?

  • Works entirely offline — your data never leaves your machine
  • Simple command-line interface with no GUI dependency
  • Export to JSON, CSV, or plain text at any time for sharing or archival
  • Automatic activity history logging across all commands
  • Each domain command doubles as both a logger and a viewer

Commands

Domain Commands

Each domain command works in two modes: log mode (with arguments) saves a timestamped entry, view mode (no arguments) shows the 20 most recent entries.

CommandDescription
genai-toolkit configure <input>Log a configuration note such as model parameters, API keys, or environment settings. Use this to record setup changes and track which configurations were active during experiments.
genai-toolkit benchmark <input>Log a benchmark result or performance observation. Record latency, throughput, accuracy, or other metrics to compare across runs and model versions.
genai-toolkit compare <input>Log a comparison note between models, configurations, or approaches. Useful for side-by-side evaluations like GPT-4 vs Claude on specific tasks.
genai-toolkit prompt <input>Log a prompt template or prompt engineering note. Track iterations on prompt design, record what worked, and document prompt versioning.
genai-toolkit evaluate <input>Log an evaluation result or quality metric. Record accuracy scores, F1 metrics, human ratings, or any qualitative assessment of model outputs.
genai-toolkit fine-tune <input>Log a fine-tuning run or hyperparameter note. Track epochs, learning rates, dataset sizes, and resulting model performance after fine-tuning.
genai-toolkit analyze <input>Log an analysis observation or insight. Record patterns found in data, failure mode analysis, or trends across experiments.
genai-toolkit cost <input>Log cost tracking data including API costs, compute expenses, and token consumption. Essential for budget monitoring across projects and providers.
genai-toolkit usage <input>Log usage metrics or consumption data. Track request volumes, token counts, rate limit encounters, and daily/monthly consumption patterns.
genai-toolkit optimize <input>Log optimization attempts or performance improvements. Record what was changed, the expected vs actual impact, and next steps.
genai-toolkit test <input>Log test results or test case notes. Record pass/fail outcomes, edge cases discovered, and regression test results.
genai-toolkit report <input>Log a report entry or summary finding. Capture weekly summaries, milestone reports, or executive-level findings from AI workflows.

Utility Commands

CommandDescription
genai-toolkit statsShow summary statistics across all log files, including entry counts per category and total data size on disk.
genai-toolkit export <fmt>Export all data to a file in the specified format. Supported formats: json, csv, txt. Output is saved to the data directory.
genai-toolkit search <term>Search all log entries for a term using case-insensitive matching. Results are grouped by log category for easy scanning.
genai-toolkit recentShow the 20 most recent entries from the unified activity log, giving a quick overview of recent work across all commands.
genai-toolkit statusHealth check showing version, data directory path, total entry count, disk usage, and last activity timestamp.
genai-toolkit helpShow the built-in help message listing all available commands and usage information.
genai-toolkit versionPrint the current version (v2.0.0).

Data Storage

All data is stored locally at ~/.local/share/genai-toolkit/. Each domain command writes to its own log file (e.g., configure.log, benchmark.log). A unified history.log tracks all actions across commands. Use export to back up your data at any time.

Requirements

  • Bash (4.0+)
  • No external dependencies — pure shell script
  • No network access required

When to Use

  • Tracking AI model benchmarks and comparisons across different providers and versions over time
  • Logging prompt engineering iterations to understand what improvements actually moved the needle
  • Monitoring API costs and token usage across multiple projects and billing periods
  • Evaluating fine-tuning experiments with detailed hyperparameter and metric tracking
  • Building a searchable knowledge base of optimization attempts and analysis insights

Examples

# Log a benchmark result
genai-toolkit benchmark "GPT-4o latency: avg 1.2s, p99 3.8s on summarization task, 500 samples"

# Track a cost entry
genai-toolkit cost "March batch processing: $42.50 across 15k requests, avg $0.0028/req"

# Compare two models
genai-toolkit compare "Claude 3.5 vs GPT-4o on code generation — Claude 15% faster, GPT-4o 5% more accurate"

# Log a prompt iteration
genai-toolkit prompt "v3: Added chain-of-thought instruction, reduced hallucination rate from 12% to 3%"

# Record a fine-tuning run
genai-toolkit fine-tune "SQL-gen model epoch 5: accuracy=0.96, loss=0.12, lr=2e-5, dataset=50k rows"

# View all statistics
genai-toolkit stats

# Export everything to JSON
genai-toolkit export json

# Search for entries mentioning latency
genai-toolkit search latency

# Check recent activity
genai-toolkit recent

# Health check
genai-toolkit status

Powered by BytesAgain | bytesagain.com | hello@bytesagain.com

Comments

Loading comments...