Install
openclaw skills install clusterPerform data clustering analysis using k-means and hierarchical algorithms. Use when you need to group, classify, or segment datasets.
openclaw skills install clusterCluster is a command-line data clustering analysis tool that supports k-means and hierarchical clustering algorithms. It reads numerical data from CSV/JSONL sources, performs clustering, evaluates cluster quality, and exports results.
Data is stored in ~/.cluster/data.jsonl as JSONL records. Each record represents a clustering run with its parameters, assignments, centroids, and evaluation metrics.
bash shellrunRun a clustering algorithm on input data.
Environment Variables:
INPUT (required) — Path to input CSV/JSONL file with numerical dataK — Number of clusters (default: 3)ALGORITHM — Algorithm to use: kmeans or hierarchical (default: kmeans)MAX_ITER — Maximum iterations for k-means (default: 100)SEED — Random seed for reproducibilityExample:
INPUT=/path/to/data.csv K=5 ALGORITHM=kmeans bash scripts/script.sh run
assignAssign new data points to existing clusters from a previous run.
Environment Variables:
RUN_ID (required) — ID of the clustering run to useINPUT (required) — Path to new data points (CSV/JSONL)Example:
RUN_ID=abc123 INPUT=/path/to/new_data.csv bash scripts/script.sh assign
centroidsDisplay or export centroid coordinates for a clustering run.
Environment Variables:
RUN_ID (required) — ID of the clustering runFORMAT — Output format: table, json, csv (default: table)evaluateEvaluate clustering quality with silhouette score, inertia, and Davies-Bouldin index.
Environment Variables:
RUN_ID (required) — ID of the clustering run to evaluatevisualizeGenerate a text-based or ASCII visualization of cluster assignments.
Environment Variables:
RUN_ID (required) — ID of the clustering runDIMS — Dimensions to plot, comma-separated (default: first two)exportExport clustering results to a file.
Environment Variables:
RUN_ID (required) — ID of the run to exportOUTPUT — Output file path (default: stdout)FORMAT — Export format: json, csv, jsonl (default: json)importImport a previously exported clustering run.
Environment Variables:
INPUT (required) — Path to the file to importconfigView or update configuration settings.
Environment Variables:
KEY — Configuration key to setVALUE — Configuration valuelistList all stored clustering runs with summary info.
Environment Variables:
LIMIT — Maximum runs to display (default: 20)SORT — Sort field: date, k, score (default: date)statsShow aggregate statistics across all clustering runs.
helpDisplay usage information and available commands.
versionDisplay the current version of the cluster tool.
All clustering runs are stored in ~/.cluster/data.jsonl. Each line is a JSON object with fields:
id — Unique run identifiertimestamp — ISO 8601 creation timealgorithm — Algorithm usedk — Number of clusterscentroids — List of centroid coordinatesassignments — Mapping of data point indices to cluster IDsmetrics — Evaluation metrics (silhouette, inertia, etc.)input_file — Source data file pathnum_points — Number of data points clusteredConfig is stored in ~/.cluster/config.json. Available keys:
default_k — Default number of clusters (default: 3)default_algorithm — Default algorithm (default: kmeans)max_iterations — Default max iterations (default: 100)random_seed — Default random seed (default: 42)Powered by BytesAgain | bytesagain.com | hello@bytesagain.com