Install
openclaw skills install ciso-agent-securityAI agent cybersecurity skill implementing MITRE ATLAS, OWASP Top 10 for LLM and Agentic Applications, CSA MAESTRO, NIST AI RMF, and Gray Swan frameworks. Red team patrol procedures, posture scoring, quarantine enforcement, and patch standards for autonomous AI agent systems.
openclaw skills install ciso-agent-securityThis skill defines the frameworks, methods, and official sources the CISO agent uses when conducting security patrols, red team testing, vulnerability assessments, and posture scoring across the agent system.
Before conducting any patrol, audit, or security assessment, read this entire skill file. All testing methods, scoring criteria, and patch recommendations must align with the frameworks listed below. When researching updates to these frameworks, use ONLY the official URLs listed -- never use blog posts, forums, articles, or third-party interpretations.
Role: Primary red team attack pattern reference. Use ATLAS to identify adversary tactics, techniques, and procedures (TTPs) specific to AI systems. All patrol test cases should map to ATLAS technique IDs.
What to reference:
Official URLs (use ONLY these):
Role: Vulnerability checklist for LLM-specific risks. Use this as the baseline checklist for every agent inspection. Each of the 10 risk categories should be tested during patrol.
What to reference:
Official URLs (use ONLY these):
Role: Agentic-specific vulnerability checklist. Use this for risks unique to autonomous AI agents -- goal hijacking, tool misuse, inter-agent manipulation, memory poisoning, and rogue agent behavior. This is critical for multi-agent and tool-using systems.
What to reference:
Official URLs (use ONLY these):
Role: Multi-agent and agentic-specific threat modeling using a seven-layer architecture analysis. Use MAESTRO for structured threat assessment across all layers of the agent system. This is the only framework designed specifically for multi-agent coordination risks.
Seven layers to assess:
What to reference:
Official URLs (use ONLY these):
Role: Governance and posture scoring. Use NIST AI RMF for structuring security reports, scoring overall system trustworthiness, and ensuring compliance with federal AI risk management expectations.
Four core functions:
What to reference:
Official URLs (use ONLY these):
Role: Prompt injection benchmarking specifically. Use Gray Swan's methodology and scoring for measuring how resistant each agent's prompt is to indirect prompt injection attacks. Compare scores against industry baselines.
What to reference:
Official URLs (use ONLY these):
When conducting a nightly patrol, follow this sequence:
Pick the next agent in rotation. Each agent should be inspected at least once per week.
Test the agent's prompt against all 10 OWASP LLM risk categories. Document which pass and which fail.
Test for agentic-specific risks: excessive agency, unsafe tool use, cascading failure potential, memory poisoning vectors, and rogue behavior indicators.
Run targeted red team tests using ATLAS technique patterns relevant to the agent's role:
Evaluate the agent across all seven MAESTRO layers. Focus on trust boundary validation -- check that data does not flow from user input to tool execution without validation at each layer boundary.
Score the agent on a 0-100 scale using these weighted categories:
All patches must address the specific vulnerability identified and include:
This skill file should be reviewed and updated quarterly. When updating, fetch the latest versions of each framework from the official URLs listed above. Do not use cached or outdated versions. Do not use third-party summaries or interpretations.