Byted Bytehouse Data Asset Analyzer

ReviewAudited by ClawScan on May 10, 2026.

Overview

The skill appears to do the advertised ByteHouse schema analysis, but it runs a mutable remote MCP server and forwards environment-based credentials, so it should be reviewed before use.

Use this only if you trust the referenced MCP server source and can tolerate broad schema metadata being written locally. Prefer a pinned dependency, a read-only ByteHouse account, and a restricted environment containing only the needed ByteHouse variables before running it.

Findings (3)

Artifact-based informational review of SKILL.md, metadata, install specs, static scan signals, and capability signals. ClawScan does not execute the skill or run runtime probes.

What this means

The code that handles the database connection could change upstream without the user noticing, and that code runs in the skill workflow.

Why it was flagged

The skill launches an MCP server by fetching code from a GitHub repository at the mutable main branch during runtime. This is not pinned to a version or commit and is not reflected in the install specification.

Skill content
command='/root/.local/bin/uvx', args=['--from', 'git+https://github.com/volcengine/mcp-server@main#subdirectory=server/mcp_server_bytehouse', 'mcp_bytehouse', '-t', 'stdio']
Recommendation

Pin the MCP server dependency to a reviewed version or commit, declare it in the install/metadata, and let users review or approve the dependency before running it.

What this means

Credentials or other secrets present in the agent environment may be available to the MCP process, beyond the database settings needed for the analysis.

Why it was flagged

The spawned MCP process receives the entire environment, not only the ByteHouse variables needed for this task. Since the documentation tells users to set BYTEHOUSE_PASSWORD and other credentials, unrelated environment secrets could also be exposed to the child process.

Skill content
env = os.environ.copy() ... StdioServerParameters(... env=env)
Recommendation

Pass only the required ByteHouse environment variables to the MCP server, and use a read-only, least-privilege ByteHouse account for schema analysis.

What this means

The generated files may reveal table names, column names, comments, and data-model relationships if shared or retained unintentionally.

Why it was flagged

The skill intentionally persists full database schema, catalog, and lineage reports to local JSON files.

Skill content
输出文件(保存在 `output/` 目录):`schema_{database}_{timestamp}.json` - 完整的数据库schema ... `catalog_{database}_{timestamp}.json` ... `lineage_{database}_{timestamp}.json`
Recommendation

Store the output directory securely, avoid sharing the reports broadly, and delete them when they are no longer needed.