Context-Inappropriate Capability
Medium
- Confidence
- 95% confidence
- Finding
- The manifest frames this skill as verifying math-heavy code for correctness, numerical stability, and standards alignment. However, the documented workflow instructs the agent to run shell commands (`git`, `pytest`, `jupyter nbconvert`) to inspect repo state and execute tests/notebooks, which is an operational capability not inherently justified by a narrowly scoped review skill description.
