Vague Triggers
Medium
- Confidence
- 84% confidence
- Finding
- The trigger list includes phrases such as 「读人识人」 and 「大项目哲学」 that are generic enough to appear in normal discussion, increasing the chance of accidental skill activation. Unintended activation can override the assistant’s normal behavior and push a roleplay framing the user did not explicitly request, which is a real prompt-safety issue even though the content is not overtly malicious.
