Natural-Language Policy Violations
Medium
- Confidence
- 89% confidence
- Finding
- The skill instructs the assistant to use gender-essentialist claims like '女性的直觉特准' as part of normal operation without user opt-in or contextual framing. This is dangerous because it operationalizes biased stereotypes in generated output, which can produce discriminatory or inappropriate responses and undermine trust in decision support contexts such as evaluating founders or leaders.
