v3.5 Test Report

v1.0.0

分析v3.5生产部署器日志,生成测试报告和统计数据。支持运行状态监控、版本对比、策略分析,适合AI Agent性能评估和实验报告生成。

0· 133·0 current·0 all-time

Install

OpenClaw Prompt Flow

Install with OpenClaw

Best for remote or guided setup. Copy the exact prompt, then paste it into OpenClaw for rf-ai-wh/v35-test-report.

Previewing Install & Setup.
Prompt PreviewInstall & Setup
Install the skill "v3.5 Test Report" (rf-ai-wh/v35-test-report) from ClawHub.
Skill page: https://clawhub.ai/rf-ai-wh/v35-test-report
Keep the work scoped to this skill only.
After install, inspect the skill metadata and help me finish setup.
Use only the metadata you can verify from ClawHub; do not invent missing requirements.
Ask before making any broader environment changes.

Command Line

CLI Commands

Use the direct CLI path if you want to install manually and keep every step visible.

OpenClaw CLI

Bare skill slug

openclaw skills install v35-test-report

ClawHub CLI

Package manager switcher

npx clawhub@latest install v35-test-report
Security Scan
VirusTotalVirusTotal
Benign
View report →
OpenClawOpenClaw
Benign
high confidence
Purpose & Capability
Name/description describe log analysis and reporting; included script and SKILL.md only read a local log file and compute stats—these requirements match the stated purpose.
Instruction Scope
SKILL.md directs the agent to run the provided Python script and documents the data source (/tmp/agent_v35_production.log). Instructions do not request reading unrelated files, environment variables, or sending data to external endpoints.
Install Mechanism
No install spec; skill is instruction-only with a small local script. No downloads or package installs are specified, which minimizes installation risk.
Credentials
The skill declares no required environment variables, credentials, or config paths beyond the single log path it uses. Requested access is proportional to the task.
Persistence & Privilege
Skill does not request always:true, does not modify other skills or global settings, and has no built-in persistence. Autonomous invocation is allowed by default (normal).
Assessment
This skill appears coherent and low-risk: it runs a small local Python script that parses /tmp/agent_v35_production.log and prints statistics. Before installing or running it, verify the log path is correct and that the log file does not contain sensitive data you don't want processed. If you plan to run this in a shared environment, consider copying a sanitized sample of the log to a safe location and run the script there. Also review/modify the simple regexes in scripts/generate_report.py if your log format differs (they assume specific text patterns like '使用: V35' and '准确度: XX%').

Like a lobster shell, security has layers — review code before you run it.

latestvk97bp42gamyj6zt0y3gmc7bbbs83nzk2
133downloads
0stars
1versions
Updated 1mo ago
v1.0.0
MIT-0

Test Report Skill

一句话说明

分析 v3.5 生产部署器日志,生成测试报告和统计数据。

适用场景

  • 📊 监控 v3.5 运行状态和性能指标
  • 📈 对比 v3.0 和 v3.5 表现差异
  • 🎯 分析策略分布和使用频率
  • 📝 生成实验报告和数据可视化

快速开始

生成测试报告

python3 scripts/generate_report.py

示例输出

============================================================
v3.5 生产部署器测试报告
============================================================
生成时间: 2025-03-26 15:00:00
------------------------------------------------------------

📊 总体统计:
   总运行次数: 100
   v3.5 运行: 85 (85%)
   v3.0 运行: 15 (15%)

🎯 平均赞数:
   v3.5: 35.2 赞
   v3.0: 18.5 赞
   提升: +90.3%

📈 预测准确度:
   平均准确度: 75.5%
   MAE: 2.3

⚙️ 策略分布:
   learning + insights: 45%
   exploration + engagement: 30%
   trending + concise: 25%
============================================================

功能详解

1. 运行统计

  • 总运行次数
  • v3.5 / v3.0 占比
  • 权重变化趋势

2. 性能对比

  • 平均赞数对比
  • 准确度统计
  • 预测偏差分析

3. 策略分析

  • 策略组合分布
  • 探索 vs 利用比例
  • 因果分数分布

数据源

默认读取:/tmp/agent_v35_production.log

与其他 Skill 配合

Skill配合方式
v35-monitor-center监控数据输入源
auto-weekly-system周报中嵌入测试数据
weekly-report-skill生成周期性测试报告

更新日志

v1.0.0 (2025-03-26)

  • ✅ 基础报告生成
  • ✅ v3.5/v3.0 对比分析
  • ✅ 策略分布统计

反馈与贡献

如有问题或建议,欢迎反馈。

Comments

Loading comments...