Data Cleaner Skill

v1.1.0

AI-powered tool for cleaning Excel/CSV data by removing duplicates, handling missing values, standardizing formats, detecting outliers, and batch processing...

0· 280·1 current·1 all-time

Install

OpenClaw Prompt Flow

Install with OpenClaw

Best for remote or guided setup. Copy the exact prompt, then paste it into OpenClaw for shenghoo123-png/data-cleaner-tool.

Previewing Install & Setup.
Prompt PreviewInstall & Setup
Install the skill "Data Cleaner Skill" (shenghoo123-png/data-cleaner-tool) from ClawHub.
Skill page: https://clawhub.ai/shenghoo123-png/data-cleaner-tool
Keep the work scoped to this skill only.
After install, inspect the skill metadata and help me finish setup.
Use only the metadata you can verify from ClawHub; do not invent missing requirements.
Ask before making any broader environment changes.

Command Line

CLI Commands

Use the direct CLI path if you want to install manually and keep every step visible.

OpenClaw CLI

Bare skill slug

openclaw skills install data-cleaner-tool

ClawHub CLI

Package manager switcher

npx clawhub@latest install data-cleaner-tool
Security Scan
VirusTotalVirusTotal
Pending
View report →
OpenClawOpenClaw
Benign
high confidence
Purpose & Capability
The name/description match the included code: a Python/pandas-based data cleaner. The README and SKILL.md mention batch processing and a batch_clean.py example, but no batch_clean.py is included in the package — a minor inconsistency. Overall, required tools (pandas/openpyxl) are appropriate for the stated purpose.
Instruction Scope
Runtime instructions and examples only run local Python scripts on local files and request installing pandas/openpyxl. They do not instruct reading unrelated system files or exporting data to external endpoints. One mismatch: the SKILL.md and README call this an 'AI-driven' tool, but the provided script is heuristic/deterministic (no external model/API calls).
Install Mechanism
No install spec; dependencies are installed via standard pip (pandas, openpyxl) as documented. No downloads from untrusted URLs or archive extraction are present.
Credentials
The skill requests no environment variables, no credentials, and no special config paths — proportional to a local data-cleaning tool.
Persistence & Privilege
Skill is not marked always:true and is user-invocable (normal). It does not request persistent system-wide privileges or modify other skills/config.
Assessment
This package appears to be a straightforward local data-cleaner and does not request credentials or make network calls, so risk is low. Before installing/running: 1) note that README/SKILL.md mention batch_clean.py but that file is missing — batch-processing example is incomplete; 2) the script is deterministic (pandas/regex rules), not actually calling any external AI service despite the 'AI-driven' wording; 3) there are small code/documentation mismatches (e.g., --fix-date handling in the CLI and the function signature) but these are bugs, not malicious behavior; 4) sanitize or review outputs if your data may contain untrusted spreadsheet formulas: the tool writes CSV/Excel but does not sanitize leading characters (e.g., =,+,-,@) which can be a CSV/Excel formula-injection risk when opening in spreadsheet apps; 5) run it first on non-sensitive test data or in a sandbox, and inspect the generated _cleaned files to confirm behavior. If you need batch processing, request the missing batch script from the author or implement your own wrapper.

Like a lobster shell, security has layers — review code before you run it.

latestvk97fafb5spf2q5bg75jbyc78cx84723x
280downloads
0stars
4versions
Updated 3w ago
v1.1.0
MIT-0

Data Cleaner Pro - 数据清洗工具

简介

AI驱动的数据清洗工具,自动处理Excel/CSV数据中的重复、缺失、格式错误等问题。

适用场景:运营报表、财务数据、客户名单清洗

功能特性

  • 智能去重 - 保留最新/最早记录
  • 缺失值处理 - 填充/删除/预测
  • 格式标准化 - 手机号、邮箱、日期等
  • 异常值检测 - 统计方法 + 规则方法
  • 批量处理 - 多个文件同时清洗

使用方式

触发词

  • "清洗数据"
  • "去重处理"
  • "整理Excel"

输入

  • CSV/Excel 文件路径
  • 清洗规则(可选,默认智能模式)

输出

  • 清洗后的文件
  • 清洗报告(处理了xx条,删除了xx条)

技术栈

  • pandas:数据处理
  • Python:核心逻辑
  • openpyxl:Excel支持

安装依赖

pip install pandas openpyxl

使用示例

# 智能清洗
python scripts/clean_data.py data.csv

# 自定义规则
python scripts/clean_data.py data.csv --dedup --fillna mean --fix-phone

# 批量处理
python scripts/batch_clean.py folder/

使用说明

  • 免费使用
  • 欢迎反馈问题和建议

适用人群

  • 运营人员:整理用户数据
  • 财务人员:清洗账单数据
  • 行政人员:整理员工名单
  • 销售人员:客户名单去重

Comments

Loading comments...