{"skill":{"slug":"data-validator-pro","displayName":"Data Validator Pro","summary":"Data quality validation and profiling toolkit for tabular data. Use when checking data completeness, detecting anomalies, validating schemas, profiling datas...","description":"---\nname: data-quality-validator\ndescription: Data quality validation and profiling toolkit for tabular data. Use when checking data completeness, detecting anomalies, validating schemas, profiling datasets, or assessing data cleanliness. Triggers on phrases like \"data quality\", \"data validation\", \"schema validation\", \"data profiling\", \"missing data\", \"anomaly detection\", \"data completeness\", \"dirty data\".\n---\n\n# Data Quality Validator\n\nToolkit for validating and profiling tabular data quality.\n\n## Features\n\n- **Schema validation** - Check column types, constraints, and rules\n- **Completeness analysis** - Missing value detection and reporting\n- **Anomaly detection** - Statistical outlier detection\n- **Profiling** - Summary statistics and distribution analysis\n- **Constraint checking** - Range checks, uniqueness, regex patterns\n\n## Quick Start\n\n```python\nfrom scripts.data_profiler import DataProfiler\nfrom scripts.schema_validator import SchemaValidator\n\n# Profile a dataset\nprofiler = DataProfiler()\nreport = profiler.profile(df)  # pandas DataFrame\nprint(report[\"missing\"])\nprint(report[\"outliers\"])\n\n# Validate against schema\nschema = {\n    \"age\": {\"type\": \"int\", \"min\": 0, \"max\": 150},\n    \"email\": {\"type\": \"str\", \"regex\": r\"^\\S+@\\S+\\.\\S+$\"},\n    \"id\": {\"type\": \"int\", \"unique\": True}\n}\nvalidator = SchemaValidator(schema)\nerrors = validator.validate(df)\nfor err in errors:\n    print(err)\n```\n\n## Scripts\n\n- `scripts/data_profiler.py` - Dataset profiling and summary stats\n- `scripts/schema_validator.py` - Schema-based validation engine\n- `scripts/anomaly_detector.py` - Statistical anomaly detection\n\n## References\n\n- `references/validation_rules.md` - Common validation patterns\n","tags":{"latest":"1.0.0"},"stats":{"comments":0,"downloads":398,"installsAllTime":15,"installsCurrent":0,"stars":0,"versions":1},"createdAt":1778115166502,"updatedAt":1778492864292},"latestVersion":{"version":"1.0.0","createdAt":1778115166502,"changelog":"Initial release: Data profiling, schema validation, and anomaly detection for tabular data","license":"MIT-0"},"metadata":null,"owner":{"handle":"kaiyuelv","userId":"s171c2g5qpr0rdyra8xbm4srbh84mjjm","displayName":"Lv Lancer","image":"https://avatars.githubusercontent.com/u/29176686?v=4"},"moderation":null}