Install
openclaw skills install smart-data-insightsEnterprise-grade data analysis assistant. Clean, analyze, and visualize data automatically. **Triggers when user mentions:** - Data cleaning: "数据清洗", "整理数据", "清理数据", "数据预处理" - Data analysis: "分析数据", "数据分析", "数据报表", "生成报告" - Visualization: "画图", "图表", "可视化", "生成图表" - Excel/CSV: "处理Excel", "分析CSV", "读取表格" - Insights: "数据洞察", "发现规律", "趋势分析" Supports Excel (.xlsx), CSV, JSON formats. Generates reports, charts, and insights.
openclaw skills install smart-data-insightsAutomatically clean, analyze, and visualize enterprise data.
| Feature | Description | Reference |
|---|---|---|
| Data Cleaning | Remove duplicates, handle missing values, standardize formats | references/data_cleaning.md |
| Data Analysis | Statistics, trends, correlations | references/data_analysis.md |
| Visualization | Charts, graphs, dashboards | references/visualization.md |
| Report Generation | Automated insights and recommendations | references/report_generation.md |
Place your data file (Excel/CSV/JSON) in a known location.
# Basic analysis
{baseDir}/tools/analyze.py data.csv
# With specific options
{baseDir}/tools/analyze.py data.xlsx --clean --visualize --report
Output includes:
| Tool | Function | Input | Output |
|---|---|---|---|
analyze.py | Main analysis entry point | Data file | Summary + options |
clean.py | Data cleaning | Raw data | Clean data |
visualize.py | Generate charts | Data | PNG/PDF charts |
report.py | Generate reports | Analysis results | Markdown report |
"帮我分析这个销售数据"
# Place your file as sales_data.csv
{baseDir}/tools/analyze.py sales_data.csv
Output:
✅ Data loaded: 1,234 rows, 8 columns
📊 Summary statistics generated
📈 Visualization: sales_trend.png
💡 3 key insights found
"清洗并分析客户数据"
{baseDir}/tools/analyze.py customer_data.xlsx --clean --visualize
"生成完整的数据报告"
{baseDir}/tools/analyze.py data.csv --report --output report.md
| Format | Read | Write | Notes |
|---|---|---|---|
| CSV | ✅ | ✅ | Universal format |
| Excel (.xlsx) | ✅ | ✅ | Requires openpyxl |
| JSON | ✅ | ✅ | Structured data |
| TSV | ✅ | ✅ | Tab-separated |
| File | Description |
|---|---|
*_cleaned.csv | Cleaned data |
*_summary.txt | Statistical summary |
*_chart_*.png | Visualizations |
*_report.md | Full analysis report |
# Specific columns only
{baseDir}/tools/analyze.py data.csv --columns "sales,date,region"
# Time series analysis
{baseDir}/tools/analyze.py data.csv --timeseries --date-column "date"
# Group by category
{baseDir}/tools/analyze.py data.csv --group-by "region" --aggregate "sum,mean"
# Chart types
{baseDir}/tools/visualize.py data.csv --type bar
{baseDir}/tools/visualize.py data.csv --type line
{baseDir}/tools/visualize.py data.csv --type scatter
{baseDir}/tools/visualize.py data.csv --type pie
# Styling
{baseDir}/tools/visualize.py data.csv --style professional
{baseDir}/tools/visualize.py data.csv --colors "blue,green,red"
# Install dependencies
pip3 install pandas openpyxl matplotlib seaborn
# Verify installation
python3 -c "import pandas, matplotlib; print('Dependencies OK')"
"Module not found"
pip3 install pandas openpyxl matplotlib
"File encoding error"
--encoding gbk"Memory error with large files"
--chunk-size 10000--sample 0.1