Install
openclaw skills install dedupeDeduplication reference — exact matching, fuzzy matching, hash-based dedup, bloom filters, and data quality. Use when removing duplicate records, files, or data entries.
openclaw skills install dedupeQuick-reference skill for deduplication strategies, algorithms, and data quality patterns.
introscripts/script.sh intro
Overview of deduplication — types, strategies, and tradeoffs.
exactscripts/script.sh exact
Exact deduplication — hash-based, key-based, and sorting approaches.
fuzzyscripts/script.sh fuzzy
Fuzzy deduplication — similarity measures, blocking, and record linkage.
filesscripts/script.sh files
File-level deduplication — fdupes, jdupes, rdfind, and storage dedup.
algorithmsscripts/script.sh algorithms
Dedup algorithms — bloom filters, HyperLogLog, MinHash, SimHash.
sqlscripts/script.sh sql
SQL deduplication patterns — ROW_NUMBER, DISTINCT, GROUP BY strategies.
cliscripts/script.sh cli
Command-line dedup tools — sort, uniq, awk, and stream processing.
checklistscripts/script.sh checklist
Deduplication quality checklist and validation steps.
helpscripts/script.sh help
versionscripts/script.sh version
| Variable | Description |
|---|---|
DEDUPE_DIR | Data directory (default: ~/.dedupe/) |
Powered by BytesAgain | bytesagain.com | hello@bytesagain.com