Context-Inappropriate Capability
Medium
- Confidence
- 97% confidence
- Finding
- The prompt explicitly tells the agent to avoid refusing requests involving sensitive or copyrighted figures by generating 'stylistically similar alternatives.' That weakens safety boundaries and can be used to circumvent model refusals or policy protections around sensitive persons and copyrighted characters, which is not necessary for a generic infographic-card workflow.
