kafka写入hive和sr

v1.0.1

为 bethune 项目生成新的 Flink Kafka 到 Hive 和 StarRocks 双写监控任务，参考 Bus_Search_ReplacePrice_KafkaToStarRock_34 及相邻的 33/35/36 模式，自动产出 Job 类、MessageModel、PO、4 个 config.p...

⭐ 0· 220·0 current·0 all-time

by@printsky

OpenClaw Prompt Flow

Install with OpenClaw

Best for remote or guided setup. Copy the exact prompt, then paste it into OpenClaw for printsky/flink-kafka-dual-write1.

Previewing Install & Setup.

Prompt PreviewInstall & Setup

Install the skill "kafka写入hive和sr" (printsky/flink-kafka-dual-write1) from ClawHub.
Skill page: https://clawhub.ai/printsky/flink-kafka-dual-write1
Keep the work scoped to this skill only.
After install, inspect the skill metadata and help me finish setup.
Use only the metadata you can verify from ClawHub; do not invent missing requirements.
Ask before making any broader environment changes.

Command Line

CLI Commands

Use the direct CLI path if you want to install manually and keep every step visible.

OpenClaw CLI

Bare skill slug

openclaw skills install flink-kafka-dual-write1

ClawHub CLI

Package manager switcher

npx clawhub@latest install flink-kafka-dual-write1

Security Scan

VirusTotal

Suspicious

View report →

OpenClaw

Benign

high confidence

✓

Purpose & Capability

Name and description claim to generate Flink Job/MessageModel/Po and update four config files and optionally run a compile; SKILL.md only requires reading repo artifacts, writing Java and properties files, and optionally running 'mvn -DskipTests compile'. No unrelated credentials, binaries, or installs are requested, so the declared requirements match the claimed purpose.

ℹ

Instruction Scope

Instructions are narrowly focused on reading reference artifacts, producing three Java files and four config changes, and optionally running mvn compile. It does assume the agent has read/write access to the target repository and the ability to run Maven; SKILL.md does not declare required binaries but this is an operational expectation rather than a security mismatch. There are no directives to read unrelated system files or exfiltrate data.

✓

Install Mechanism

Instruction-only skill with no install spec and no code files to execute. That minimizes disk-write and remote-download risk.

✓

Credentials

No environment variables, credentials, or config paths are requested. The tasks described (code generation and config updates) typically do not require secrets, so the absence of requested credentials is proportionate.

✓

Persistence & Privilege

always is false and the skill does not request persistent/system-wide changes beyond writing files in the target repository. Autonomous invocation is allowed (platform default) but not combined with broad credentials or elevated privileges.

Assessment

This skill appears coherent and limited to generating Flink job code and updating configs. Before installing, confirm the agent will run in a safe environment and that you are comfortable granting it read/write access to the target repository (it will modify files). If you enable the optional compile step, remember 'mvn compile' may fetch dependencies and run build scripts (network access and arbitrary build-time code can run), so prefer running compile on a fork or CI sandbox and review diffs before merging. No credentials are requested by the skill; if the agent later asks for API keys or repo credentials, treat that as unexpected and investigate.

Like a lobster shell, security has layers — review code before you run it.

latestvk979za1hmcqn1jk935rjpfv2158375wa

220downloads

0stars

2versions

Updated 10h ago

v1.0.1

MIT-0

Flink Kafka 双写任务生成

按下面流程执行，默认服务对象是 bethune 仓库中的 Kafka 日志监控任务。

先做什么

先确认用户给了哪些输入。若信息不全，只问最小必需项：

任务编号
参考任务，若用户说“参考任务34”则优先复用 34 的骨架
Kafka topic
module 过滤值
目标表名或业务名
消息字段结构，尤其是是否存在嵌套对象或列表展开字段

若用户已经给出“按任务34类似模式”，默认理解为：

单条消息通常产出一行，不按列表展开
保留 Kafka -> filter -> flatMap -> Hive -> StarRocks 的完整链路
沿用 parseAndFormatLogTime()、safe()、Hive 分区补齐、4 份 config 同步更新的处理方式

若任务更接近 35 或 36 这类列表展开模式，按列表展开规则处理。详细模式见 references/bethune-patterns.md。

实现步骤

先阅读参考任务及相关 MessageModel、Po、config.properties 键位，确认命名和字段顺序。
生成或更新 3 个 Java 文件：MessageModel、Po、Job。
同步更新 4 个配置文件：
- src/main/resources/config.properties
- src/main/resources/dev/config.properties
- src/main/resources/product/config.properties
- src/main/resources/stage/config.properties
在仓库可编译时运行 mvn -DskipTests compile 验证新增任务。
向用户回传新增文件、配置键、是否编译通过；若用户需要，再补 Hive 和 StarRocks DDL。

必须遵守的约束

保持 TableSchema、StarRocksSinkRowBuilder、toHiveRow() 三处字段顺序完全一致。
st 永远放在输出首列；Hive 行末尾永远追加 year、month、day。
module 过滤值写死在 Job 类常量里，不写入配置。
logTime 统一走 A 方案：为空或解析失败都记录错误日志并丢弃。
message 为空直接丢弃。
id 优先取 skyNetVo.getId()，为空时生成 UUID。
cnt 通常固定为 1。
字符串字段优先通过 safe() 兜底，数值字段保留原始数值类型。
列表字段为 null 或空集合时，整条消息直接丢弃。

命名规则

MessageModel：src/main/java/com/ly/tms/po/carSupply/SkynetLog{BizName}MessageModel.java
Po：src/main/java/com/ly/tms/po/carSupply/SkynetLog{BizName}Po.java
Job：src/main/java/com/ly/tms/job/Bus_{BizName}_KafkaToStarRock_{任务编号}.java

配置键遵循 bethune 现有分组：

topic key: kafka.bus.{biz}.topic
group key: travel.car.{biz}.group
StarRocks key: starrocks.fe.travel.common.{tableKey}
Hive key: hive.hive_train_ops.{tableKey}

生成代码时的判断规则

用户给的是顶层字段 + 少量嵌套对象：按 34 模式写单行输出。
用户给的是 datas、fullPriceList 这类列表：按 35/36 模式在 flatMap() 中逐项展开。
JSON 字段名与 Java 字段名不一致时，在 MessageModel 上补 @JSONField(name = "...")。
若参考任务里存在“嵌套字段优先，顶层字段兜底”的业务规则，保留该优先级，不要简化成单字段直取。

输出要求

完成后至少说明：

新增或修改了哪些文件
新增了哪些配置键
本次任务属于“单行模式”还是“列表展开模式”
是否完成编译验证

参考资料

读取 references/bethune-patterns.md 获取以下内容：

任务 33/34/35/36 的差异
任务 34 的完整骨架摘要
parseAndFormatLogTime() 与 toHiveRow() 的固定模板
4 份配置文件中的插入分组位置 cnt INT, traceid STRING, {其余字段按 PO 顺序} ) PARTITIONED BY (year STRING, month STRING, day STRING) ROW FORMAT DELIMITED FIELDS TERMINATED BY '\001' STORED AS TEXTFILE;


### StarRocks
```sql
CREATE TABLE TCTravelStreamData_db.{表名} (
    st            DATETIME,
    apmtraceid    VARCHAR(256),
    id            VARCHAR(256),
    cnt           INT,
    traceid       VARCHAR(256),
    {其余字段：STRING→VARCHAR(512), INT→INT, DOUBLE→DOUBLE}
)
DUPLICATE KEY(st, apmtraceid)
DISTRIBUTED BY HASH(id) BUCKETS 8
PROPERTIES ("replication_num" = "3");

参考示例（已实现任务）

任务	类名	Topic	Module	List展开字段	SR表名
33	Bus_Search_Abtest_KafkaToStarRock_33	skynet_log_Public_SFC_ABTest_Monitor	BUS_Public_SFC_ABTest_Monitor	无	bus_sfc_abtest_monitor
34	Bus_Search_ReplacePrice_KafkaToStarRock_34	skynet_log_Public_SFC_Replace_Price_Monitor	BUS_Public_SFC_Replace_Price_Monitor	无(ReferPriceBean嵌套)	bus_sfc_replace_price_monitor
35	Bus_Carpool_CalEnter_KafkaToStarRock_35	skynet_log_3304590_CallEnter	BUS_PUBLIC_CARPOOL_PRICING_CallEnter	fullPriceList	bus_carpool_calenter_monitor
36	Bus_Metric_Collection_KafkaToStarRock_36	skynet_log_3309435_bus_travelmetrics	BUS_METRIC_COLLECTION	datas	bus_metric_collection_monitor

Comments

Loading comments...