Community Data Process

v1.0.0

北汽社群数据导出:数据清洗 → 数据校对 → 数据合并。 从客户群导出文件中筛选温冷一期和试点店标签数据, 校对关键指标一致性,增量合并到 BI_社群数据上传表。 统计日期使用源文件创建时间(下载日期)。 每天 00:55 自动执行。

0· 139·1 current·1 all-time

Install

OpenClaw Prompt Flow

Install with OpenClaw

Best for remote or guided setup. Copy the exact prompt, then paste it into OpenClaw for yuhui435/community-data-process.

Previewing Install & Setup.
Prompt PreviewInstall & Setup
Install the skill "Community Data Process" (yuhui435/community-data-process) from ClawHub.
Skill page: https://clawhub.ai/yuhui435/community-data-process
Keep the work scoped to this skill only.
After install, inspect the skill metadata and help me finish setup.
Use only the metadata you can verify from ClawHub; do not invent missing requirements.
Ask before making any broader environment changes.

Command Line

CLI Commands

Use the direct CLI path if you want to install manually and keep every step visible.

OpenClaw CLI

Bare skill slug

openclaw skills install community-data-process

ClawHub CLI

Package manager switcher

npx clawhub@latest install community-data-process
Security Scan
VirusTotalVirusTotal
Benign
View report →
OpenClawOpenClaw
Benign
high confidence
Purpose & Capability
Name/description describe exporting, cleaning, auditing and merging Excel data from customer-group exports; the included run.py implements precisely those actions (find latest export in Downloads, filter by labels, validate numeric columns, map and append rows to BI file). No unrelated credentials, binaries, or services are requested.
Instruction Scope
SKILL.md and run.py both operate on local Excel files in the user's Downloads folder and on a local BI Excel file. The script uses file creation time as the 'download date', reads/writes only local files, and writes a small result text file to Downloads. There is a minor robustness issue (column selection by index and a hard-coded Windows-style Downloads path) that could cause incorrect behavior if source files differ, but it is scope-consistent rather than malicious.
Install Mechanism
No install spec and no external downloads — instruction-only plus a Python script. Dependencies are standard (Python 3.11+, pandas, openpyxl). Nothing is fetched from arbitrary URLs or installed system-wide.
Credentials
The skill requires no environment variables, credentials, or config paths. The script exclusively accesses local files in the Downloads directory and writes output there; requested access is proportional to the stated file-processing task.
Persistence & Privilege
always:false (default) and no mechanism to persist or modify other skills or global agent settings. The script writes its own result file in Downloads but does not alter other skills or request elevated privileges. Autonomous invocation is allowed by platform default but does not combine with other privilege escalations here.
Assessment
This skill appears to do what it says: local Excel processing (filter → audit → merge) with no network calls or credential access. Before installing or enabling autonomous runs, test it on copies of your files: it uses the Windows-style Downloads path and selects columns by index (position), so mismatched column orders or non-Windows environments may cause errors or incorrect mappings. It appends rows without de-duplication, so running multiple times can create duplicates — keep backups of the BI file. Also note it writes a small result file (.baic_community_result.txt) to Downloads. If you plan to run it automatically, schedule only after confirming mappings are correct and that the Downloads folder contains the expected filenames and column layout.

Like a lobster shell, security has layers — review code before you run it.

latestvk970b3nkj3hb38ykrkvsc9bn6983at8q
139downloads
0stars
1versions
Updated 1mo ago
v1.0.0
MIT-0

北汽社群数据导出

每日社群数据导出 → 清洗 → 校对 → 合并到 BI 系统。

流程概览

1. 数据清洗 → 2. 数据校对 → 3. 数据合并 → 4. 最终验证

使用方式

一键执行全流程

python ~/.openclaw/workspace-pm/skills/community-data-process/run.py

分步执行

# 第 1 步:数据清洗
python ~/.openclaw/workspace-pm/skills/community-data-process/run.py clean

# 第 2 步:数据校对
python ~/.openclaw/workspace-pm/skills/community-data-process/run.py audit

# 第 3 步:数据合并
python ~/.openclaw/workspace-pm/skills/community-data-process/run.py merge

# 第 4 步:最终验证
python ~/.openclaw/workspace-pm/skills/community-data-process/run.py verify

规则

清洗规则

  • 源文件:Downloads 目录下按**创建时间(下载时间)**排序,取最新的 客户群导出*.xlsx
  • 筛选条件:O 列(群标签)= 温冷一期 或 试点店
  • 数字列格式转换:群人数、员工人数、客户人数、今日入群、今日退群、今日消息 → int

校对规则

  • 三方对比:源文件 vs 清理后 vs 合并后
  • 8 个指标全部一致才通过
  • 数据质量:无空值、无负值

合并规则

  • 模式:增量添加(不去重)
  • 列映射:源文件 A-O 列 → 目标文件 E-S 列
  • D 列统计日期 = 源文件的创建时间(下载日期),文件是哪天下载的就填哪天(不是脚本执行日期)
  • A-C 列留空

列映射

源文件目标文件列名
AE群 ID
BF群名称
CG群主
DH群管理员
EI群人数
FJ群活跃
GK群类型
HL员工人数
IM客户人数
JN今日入群
KO今日退群
LP今日消息
MQ入群时间
NR最后发言时间
OS群标签

输出文件

文件说明
客户群导出_清理后_温冷一期 + 试点店_YYYYMMDD.xlsx清洗后数据
BI_社群数据上传_已更新_YYYYMMDD.xlsx合并后最终文件
数据校对报告_YYYYMMDD.txt校对报告

常见问题

Q1: 今日退群数据不一致

检查是否使用了正确的源文件。不同日期导出的文件数据不同。

Q2: 统计日期错误

脚本自动读取源文件的下载日期,不需要手动指定。

Q3: 数字列显示为文本

脚本自动转换 6 个数字列为 int 格式。

依赖

  • Python 3.11+
  • pandas
  • openpyxl

Comments

Loading comments...