{"skill":{"slug":"data-toolkit","displayName":"Data Toolkit","summary":"Complete data conversion, validation, and cleaning toolkit. Convert between JSON/CSV/YAML/XML, validate schemas, clean duplicates and nulls. Essential utilit...","description":"---\nname: data-toolkit\ndescription: Complete data conversion, validation, and cleaning toolkit. Convert between JSON/CSV/YAML/XML, validate schemas, clean duplicates and nulls. Essential utilities for data processing workflows.\nversion: 1.0.0\nauthor: Forge\nmetadata:\n  {\n    \"openclaw\":\n      {\n        \"requires\": { \"bins\": [\"node\", \"python3\"] },\n        \"install\": []\n      }\n  }\n---\n\n# Data Toolkit\n\nComplete data processing utilities for OpenClaw agents.\n\n## Features\n\n### Converters\n- **JSON ↔ CSV** - Bidirectional conversion with schema inference\n- **JSON ↔ YAML** - Clean formatting, comment preservation\n- **JSON ↔ XML** - Configurable root elements and attributes\n- **CSV ↔ YAML** - Direct conversion without intermediate steps\n- **Multi-format batch conversion** - Process entire directories\n\n### Validators\n- **JSON Schema validation** - Validate against JSON Schema specs\n- **CSV structure validation** - Check headers, columns, data types\n- **Data type inference** - Automatic type detection and validation\n- **Custom rules** - Define business logic validations\n\n### Cleaners\n- **Duplicate removal** - Smart deduplication with configurable keys\n- **Null/empty handling** - Remove or replace null values\n- **Data normalization** - Standardize formats (dates, numbers, strings)\n- **Whitespace cleanup** - Trim, collapse multiple spaces\n- **Column operations** - Remove, rename, reorder columns\n\n## Get Data Toolkit\n\n🛒 **Gumroad (€10):** https://nexusatlas.gumroad.com/l/bsyacx  \n📦 **ClawHub:** https://clawhub.ai/skills/data-toolkit\n\nMIT License — Python 3.8+, zero dependencies.\n\n## Usage\n\n### Convert Data\n\n```bash\n# JSON to CSV\n./src/convert.py --input data.json --output data.csv --format csv\n\n# CSV to JSON\n./src/convert.py --input data.csv --output data.json --format json\n\n# JSON to YAML\n./src/convert.py --input data.json --output data.yaml --format yaml\n\n# XML to JSON\n./src/convert.py --input data.xml --output data.json --format json\n\n# Batch conversion\n./src/convert.py --input-dir ./raw --output-dir ./processed --format json\n```\n\n### Validate Data\n\n```bash\n# Validate against JSON schema\n./src/validate.py --input data.json --schema schema.json\n\n# Validate CSV structure\n./src/validate.py --input data.csv --check-headers --check-types\n\n# Custom validation rules\n./src/validate.py --input data.json --rules validation-rules.yaml\n```\n\n### Clean Data\n\n```bash\n# Remove duplicates\n./src/clean.py --input data.json --dedupe --key id\n\n# Handle nulls\n./src/clean.py --input data.csv --remove-nulls\n./src/clean.py --input data.csv --replace-nulls \"N/A\"\n\n# Normalize data\n./src/clean.py --input data.json --normalize dates,numbers,strings\n\n# Full cleanup pipeline\n./src/clean.py --input messy.csv --dedupe --remove-nulls --normalize all --output clean.csv\n```\n\n## API Usage (Python)\n\n```python\nfrom data_toolkit import convert, validate, clean\n\n# Convert\nconvert.json_to_csv('input.json', 'output.csv')\nconvert.csv_to_yaml('input.csv', 'output.yaml')\n\n# Validate\nis_valid = validate.json_schema('data.json', 'schema.json')\nerrors = validate.csv_structure('data.csv')\n\n# Clean\nclean.remove_duplicates('data.json', key='id')\nclean.normalize_dates('data.csv', format='ISO8601')\n```\n\n## Examples\n\nSee `examples/` directory for complete workflows:\n- `examples/etl-pipeline.sh` - Full ETL workflow\n- `examples/api-data-processing.py` - API response processing\n- `examples/batch-conversion.sh` - Bulk file conversion\n\n## Installation\n\nDependencies are minimal and common:\n- Python 3.8+\n- PyYAML\n- pandas (optional, for advanced CSV operations)\n\n```bash\npip install pyyaml pandas\n```\n\n## Requirements\n\n- Node.js (for JSON/YAML parsing)\n- Python 3.8+\n- 10MB disk space\n\n## License\n\nMIT\n\n## Support\n\nIssues: https://github.com/forge-agent/data-toolkit\nDocs: See `docs/` directory\n","topics":["CSV","Data Processing","Json"],"tags":{"latest":"1.0.1"},"stats":{"comments":0,"downloads":403,"installsAllTime":15,"installsCurrent":0,"stars":0,"versions":2},"createdAt":1777912031038,"updatedAt":1778492846193},"latestVersion":{"version":"1.0.1","createdAt":1777920925086,"changelog":"Added Gumroad purchase link","license":"MIT-0"},"metadata":{"setup":[],"os":null,"systems":null},"owner":{"handle":"atlasnexusops","userId":"s1768dx4c1e9cwh7qyzb69dfmx86377b","displayName":"AtlasNexusOps","image":"https://avatars.githubusercontent.com/u/263571615?v=4"},"moderation":null}