Tp4
High
- Category
- MCP Tool Poisoning
- Confidence
- 90% confidence
- Finding
- The skill is presented as a benchmarking/comparison tool, but the documented behavior is materially broader: it supports persistent logging across many categories, full-text search, export of all stored data, and filesystem/activity reporting. That mismatch can cause users or upstream systems to grant trust or pass sensitive prompt, evaluation, or cost data without realizing it will be retained and later searchable/exportable, increasing confidentiality risk.
