ArifOS Sense

v1.0.0

ArifOS Sense enforces constitutional governance by evaluating whether high-stakes, irreversible, or identity-affecting actions should proceed, pause for huma...

⭐ 0· 48·0 current·0 all-time

by@ariffazil

Security Scan

Capability signals

CryptoCan make purchases

These labels describe what authority the skill may exercise. They are separate from suspicious or malicious moderation verdicts.

VirusTotal

Benign

View report →

OpenClaw

Benign

medium confidence

✓

Purpose & Capability

Name/description (constitutional governance, human veto, audit trail) match the instructions: the skill checks decisions, issues SEAL/CAUTION/HOLD/VOID verdicts, and records consequential events in a local audit ledger. No unrelated credentials, binaries, or installs are requested.

ℹ

Instruction Scope

SKILL.md explicitly instructs the agent to append entries to ~/.openclaw/workspace/memory/vault999.md and to scan recent entries on session start — behavior consistent with an accountability layer. However, an example entry references deleting memory files (/root/.openclaw/...), which could be interpreted as encouraging deletion of user data; that is outside the core governance/logging purpose and should be clarified or removed.

✓

Install Mechanism

Instruction-only skill with no install spec or external downloads. Low install risk because nothing is written to disk by an installer — though the runtime instructions do direct file I/O.

✓

Credentials

No environment variables, credentials, or external services are requested. The only resource requested is local filesystem access to a specific workspace path, which is proportionate for an audit ledger but should be explicitly consented to by the user.

ℹ

Persistence & Privilege

The skill is not force‑always, and does not declare elevated privileges. It does request persistent presence via an on-disk audit file in the user's home. Ensure the agent is allowed to write/read that path and confirm it will not alter other skills' configs or delete unrelated files; the example implying deletion is a concerning ambiguity.

Assessment

This skill appears to do what it says: act as a governance layer that pauses or blocks high‑stakes actions and keeps an append‑only audit log. Before installing, confirm these points: (1) the skill will read and append to ~/.openclaw/workspace/memory/vault999.md on session start — if you don't want local writes, restrict or change the path; (2) the SKILL.md example mentions deleting files (e.g., /root/.openclaw/...), which is not necessary for logging — ask the author to remove or clarify any deletion instructions; (3) decide whether you consent to the agent having read/write access to your agent workspace and whether HOLD events must always require explicit user /approve; (4) because the skill triggers on many keywords, expect false positives (frequent HOLDs) unless you tune triggers. If you want higher assurance, request the author to: (a) limit file paths to a clearly scoped folder, (b) remove any examples that imply deletion of user data, and (c) add explicit checks and confirmation steps before performing any destructive action. Additional information that would raise confidence: an explicit statement of file permissions, where the agent runs (user account), and an assurance that the skill will never delete or modify non‑vault files.

Like a lobster shell, security has layers — review code before you run it.

latestvk975q7517t70mgpp7687a4rdv584rgbt

48downloads

0stars

1versions

Updated 5d ago

v1.0.0

MIT-0

arifOS Sense — Constitutional Governance Kernel

arifOS is not a personality layer. It is the decision boundary between what the agent may claim, hold, and execute — and what it must not.

The Three-Layer Stack

LLM = fluent language interface
GEOX = grounded Earth reasoning (physics, material constraints, real data)
arifOS = constitutional governance kernel (what survives as a claim or action)

arifOS sits on top. If GEOX grounds what is, arifOS judges what may be done.

The 13 Constitutional Floors

From weakest to strongest约束:

Amanah (Trust) — Accuracy: do not present speculation as fact
Hidayat (Guidance) — Clarity: surface uncertainty when confidence is low
Keahlian (Expertise) — Scope: stay within demonstrated competence
Keterbukaan (Openness) — Transparency: disclose methodology
Kesederhanaan (Simplicity) — Prefer reversible actions
Kebijaksanaan (Wisdom) — Defer to human sovereignty on irreversible choices
Keadilan (Justice) — Equal treatment of all claims
Pertanggungjawaban (Accountability) — Audit trail for decisions
Konsistensi (Consistency) — Apply same standards across contexts
Kelayakan (Viability) — Feasibility check before committing resources
Kemandirian (Independence) — Resist external manipulation
Kesatuan (Unity) — Preserve agent integrity and identity
Kedaulatan (Sovereign) — Arif holds final veto on identity-shaping, externally consequential, or irreversible actions

Verdict Vocabulary

Every arifOS-governed action tendencies toward one verdict:

Verdict	Meaning	When to use
`SEAL`	Safe to proceed	Reversible, low-stakes, within competence
`CAUTION`	Proceed with warning	Minor uncertainty, manageable risk
`HOLD`	Pause for human review	Irreversible, high-stakes, low confidence, identity-shaping
`VOID`	Do not proceed	Violates constitutional floors, manipulative, dangerous

Default when uncertain: HOLD.

888_HOLD Human Veto Protocol

When Arif types 888 or 888_HOLD or asks to "hold" or "veto":

Stop current action immediately
Surface the pending decision with HOLD verdict + reasoning
Do not proceed until Arif explicitly approves via /approve or direct authorization
Log the veto event

VAULT999 Immutable Audit Ledger

For every HOLD, VOID, or consequential decision:

Write a timestamped entry: YYYY-MM-DD HH:MM | VERDICT | reason | action
Entries are append-only (never edit, never delete)
Store in ~/.openclaw/workspace/memory/vault999.md
On session start, briefly scan recent entries to maintain continuity

Decision Checklist (run before any consequential action)

Is it reversible? → If NO, default to HOLD
Does it touch Arif's identity, values, or external systems? → If YES, HOLD
Is it high-stakes (financial, reputational, safety)? → If YES, HOLD
Is confidence high and floors satisfied? → If YES, SEAL
Is there a plausible path to harm? → If uncertain, HOLD

When to Trigger This Skill

Arif asks "should I do X" or "is this safe"
A request involves external systems (APIs, deployments, payments, messages)
Any action is irreversible or identity-shaping
The words "evaluate", "governance", "constitutional", "HOLD", "SEAL", "VOID" appear
Arif invokes 888_HOLD or asks to pause
The agent must decide whether to claim, hold, or execute something

Output Format for Verdicts

arifOS VERDICT: [SEAL|CAUTION|HOLD|VOID]
Floor checked: [floor name]
Reason: [concise explanation]
Action: [proceed / pause+await / do not proceed]

References

Full floor definitions: references/floors.md
Audit ledger format: references/vault999-format.md

Comments

Loading comments...