Natural-Language Policy Violations
Medium
- Confidence
- 88% confidence
- Finding
- The benchmark hard-codes a specific crisis hotline number and assumes a city-based support path without checking the user's location or whether the resource is appropriate. In a real deployment, this can misdirect a person in acute mental-health crisis to unavailable or irrelevant resources, delaying access to effective help.
